Research
研究
研究業績
カテゴリー
研究領域
年
-
Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous Words
Gouki Minegishi, Hiroki Furuta, Yusuke Iwasawa, Yutaka Matsuo
International Conference on Learning Representations (ICLR 2025)
-
Lost in the Distance: Large Language Models Struggle to Capture Long-Distance Relational Knowledge
Meiyun Wang, Takeshi Kojima, Yusuke Iwasawa, Yutaka Matsuo
The 2025 Annual Conference of the Nations of the Americas Chapter of the ACL (NAACL 2025)
-
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta, Yutaka Matsuo, Aleksandra Faust, Izzeddin Gur.
Transactions on Machine Learning Research (TMLR).
-
2024年度 第19回言語処理若手シンポジウム(YANS2024),スポンサー賞(株式会社日立製作所賞): “指示数増加による大規模言語モデルの指示追従性能への悪影響”
小島 武
-
Slender-Mamba: Fully Quantized Mamba From Head to Toe.
Zhenxuan Yu, Takeshi Kojima, Yutaka Matsuo and Yusuke Iwasawa.
Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025).
-
“Geometric-Averaged Preference Optimization for Soft Preference Labels”
Hiroki Furuta, Kuang-Huei Lee, Shixiang Shane Gu, Yutaka Matsuo, Aleksandra Faust, Heiga Zen, Izzeddin Gur
Advances in Neural Information Processing Systems 37 (NeurIPS 2024)
-
“Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?“
Fumiya Uchiyama, Takeshi Kojima, Andrew Gambardella, Qi Cao, Yusuke Iwasawa, Yutaka Matsuo
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)
-
“Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4”
Jiaxian Guo*, Bo Yang*, Paul Yoo, Yuchen Lin, Yutaka Matsuo, Yusuke Iwasawa
AAAI RL+LLM, 2024 (Oral).
-
“Decoupling Noise and Toxic Parameters for Language Model Detoxification by Task Vector Merging”
Yongmin Kim, Takeshi Kojima, Yusuke Iwasawa, Yutaka Matsuo
2024 First Conference on Language Modeling (COLM 2024).
-
”Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks”
Andrew Gambardella, Yusuke Iwasawa, Yutaka Matsuo
Proceeding of the 62nd Annual Meeting of the Association for Computer Linguistics (ACL2024)