Publications

International Conference Proceedings (Refereed)

Taiga Someya, Anej Svete, Brian DuSell, Timothy J. O’Donnell, Mario Giulianelli, Ryan Cotterell. 2025. Information Locality as an Inductive Bias for Neural Language Models. The 63rd Annual Meeting of the Association for Computational Linguistics (ACL).
Ryo Yoshida, Shinnosuke Isono, Kohei Kajikawa, Taiga Someya, Yushi Sugimoto, Yohei Oseki. 2025. If Attention Serves as a Cognitive Model of Human Memory Retrieval, What is the Plausible Memory Representation? The 63rd Annual Meeting of the Association for Computational Linguistics (ACL).
Taiga Someya, Ryo Yoshida, Hitomi Yanaka, Yohei Oseki. 2025. Derivational Probing: Unveiling the Layer-wise Construction of Syntactic Structures in Neural Language Models. The 29th Conference on Computational Natural Language Learning (CoNLL).
Ryo Yoshida, Taiga Someya, Yohei Oseki. 2024. Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision. The 62nd Annual Meeting of the Association for Computational Linguistics (ACL), Findings.
Taiga Someya, Tatsuya Ishigaki, Yohei Oseki, Ryo Nagata, Hiroya Takamura. 2024. Leveraging Player Embeddings for Soccer Event Prediction. The 2nd International Workshop on Intelligent Technologies for Precision Sports Science (IT4PSS) in Conjunction with the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024).
Taiga Someya, Yushi Sugimoto, Yohei Oseki. 2024. JCoLA: Japanese Corpus of Linguistic Acceptability, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING).
Taiga Someya, Ryo Yoshida, Yohei Oseki. 2024. Targeted Syntactic Evaluations on the Chomsky Hierarchy, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING).
Taiga Someya, Yohei Oseki. 2023. JBLiMP: Japanese Benchmark of Linguistic Minimal Pairs, Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Findings, 1581–1594.

Presentations at International Conferences (Refereed)

Kenjiro Ide, Taiga Someya, Kohei Kawaguchi, Keisuke Fujii, Interpretable Low-Dimensional Modeling of Spatiotemporal Agent States for Decision Making in Football Tactics, 12th International Conference on Sport Sciences Research and Technology Support (icSPORTS 2024), 2024. (Abstract)
Taiga Someya, Atom Scott, Keisuke Fujii, Hidehisa Akiyama, Tomoharu Nakashima, Hitomi Yanaka. 2024. FootballGPT: Counterfactual Evaluation With a Foundation Model for Football, Opta Forum. (Poster)

Awards

KAKUSEI Project “Ha (破)” Outstanding Achievement Award, 2025.03.
2025 The Association for Natural Language Processing Annual Conference Committee Special Award, 2025.03.
2023 年度未踏 IT 人材発掘・育成事業『スーパークリエータ』, 2024.06.
🏆 LREC-COLING 2024 Best Paper Award, 2024.05
2024 The Association for Natural Language Processing Annual Conference Committee Special Award, 2024.03
2024 The Association for Natural Language Processing Annual Conference Fujitsu Award, 2024.03
2023 Sports Data Science Competition Soccer Division Excellence Award, 2024.01.
2023 The University of Tokyo Todai to Texas Demo Day Award, 2023.11.
JSAI Annual Conference (37th) Student Incentive Award, 2023.09.

Domestic Conference Proceedings (Refereed)

染谷大河, 石垣達也, 大関洋平, 永田亮, 高村大也. 2024. 言語モデリングによる行動選択・状態推移確率の推定に基づく選手定量評価指標, 人工知能学会第 38 回全国大会.
染谷大河, スコットアトム, 藤井慶輔, 秋山英久, 中島智晴, 谷中瞳. 2024. シミュレーションデータを用いたサッカー基盤モデルの構築に向けて, 人工知能学会第 38 回全国大会.
染谷大河, 石垣達也, 大関洋平, 永田亮, 高村大也. 2023. サッカーイベント予測における選手ベクトルの利用, 人工知能学会第 37 回全国大会, 1875-1878.

Domestic Conference Proceedings (Non-refereed)

染谷大河, 吉田遼, 谷中瞳, 大関洋平. Derivational Probing：言語モデルにおける統語構造構築の解明. 2025. 言語処理学会第 31 回年次大会.
染谷大河, 石垣達也, 高村大也. トラッキングデータからのサッカー実況生成. 2025. 言語処理学会第 31 回年次大会.
吉田遼, 磯野真之介, 梶川康平, 染谷大河, 杉本侑嗣, 大関洋平. アテンションが記憶想起の認知モデルたりうるならば、記憶の表現としては何が妥当か？ 2025. 言語処理学会第 31 回年次大会．
盧捷, 金杜, 柴田行輝, 土井智暉, 染谷大河, 谷中瞳. 大規模言語モデルは日本語・中国語の状態パーフェクトを理解できるか? 2025. 言語処理学会第 31 回年次大会.
指田昌樹, 鈴木彩音, 安田卓矢, 染谷大河, 谷中瞳. 従属節が分断された不可能言語を言語モデルは学習するのか. 2025. 言語処理学会第 31 回年次大会．
染谷大河, 大関洋平. 認知ファインチューニング：眼球運動による大規模言語モデルのファインチューニング. 2024. 言語処理学会第 30 回年次大会.
山田祐真, 染谷大河, 大関洋平. 小規模言語モデルによる統語パラメータの獲得. 2024. 言語処理学会第 30 回年次大会.
吉田遼, 染谷大河, 大関洋平. Tree Planted Transformer: 統語的大規模言語モデルの構築に向けて. 2024. 言語処理学会第 30 回年次大会.
染谷大河, 川口康平, 藤井慶輔. 2024. 言語モデリングによる行動選択・状態推移確率の推定に基づく選手定量評価指標, 2023 年度スポーツデータサイエンスコンペティション.
山田裕真, 染谷大河, 大関洋平. 2023. 統語パラメータの生得性：言語モデルからの知見, NLP 若手の会（YANS）第 18 回シンポジウム.
染谷大河, 吉田遼, 中石海, 大関洋平. 2023. チョムスキー階層とニューラル言語モデル, 言語処理学会第 29 回年次大会, 2973-2977.
染谷大河*, 吉田遼*, 中石海*, 濱西祐之介*, 大関洋平. 2022. チョムスキー階層とニューラル言語モデル, NLP 若手の会（YANS）第 17 回シンポジウム.（*は同等の貢献を表す）
染谷大河, 大関洋平. 2022. 日本語版 CoLA の構築, 言語処理学会第 28 回年次大会, 1872-1877.
染谷大河, 進藤裕之, 大関洋平. 2022. 情報抽出技術を用いた JCoLA の拡張に向けて, 言語処理学会第 28 回年次大会, 290-295.
染谷大河, 大関洋平. 2022. 日本語版 CoLA の構築の舞台裏, 言語処理学会ワークショップ「日本語における評価用データセットの構築と利用性の向上」.

Invited talk

染谷大河, 大関洋平. 2022. JBLiMP: 言語モデルの統語的評価のための日本語データセット, 国立国語研究所ワークショップ「日本語における評価用データセットの構築と利用性の向上」.

Taiga Someya（染谷大河）

Publications

International Conference Proceedings (Refereed)

Presentations at International Conferences (Refereed)

Awards

Domestic Conference Proceedings (Refereed)

Domestic Conference Proceedings (Non-refereed)

Invited talk