Research

  • Home
  • Publications
  • Publications

    Category

    Research Area

    Year

    • Slender-Mamba: Fully Quantized Mamba From Head to Toe.

      Zhenxuan Yu, Takeshi Kojima, Yutaka Matsuo and Yusuke Iwasawa.

      Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025).

    • Geometric-Averaged Preference Optimization for Soft Preference Labels.

      Hiroki Furuta, Kuang-Huei Lee, Shixiang Shane Gu, Yutaka Matsuo, Aleksandra Faust, Heiga Zen, Izzeddin Gur

      Advances in Neural Information Processing Systems 37 (NeurIPS 2024)

    • “ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate”

      Shohei Taniguchi, Keno Harada, Gouki Minegishi, Yuta Oshima, Seong Cheol Jeong, Go Nagahara, Tomoshi Iiyama, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo

      Advances in Neural Information Processing Systems 37 (NeurIPS 2024)

    • “Which Programming Language and What Features at the Pre-training Stage Affect Downstream Logical Inference Performance?”

      Fumiya Uchiyama, Takeshi Kojima, Andrew Gambardella, Qi Cao, Yusuke Iwasawa, Yutaka Matsuo

      The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

    • Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4.

      Jiaxian Guo*, Bo Yang*, Paul Yoo, Yuchen Lin, Yutaka Matsuo, Yusuke Iwasawa

      AAAI RL+LLM, 2024 (Oral).

    • “Decoupling Noise and Toxic Parameters for Language Model Detoxification by Task Vector Merging.”

      Yongmin Kim, Takeshi Kojima, Yusuke Iwasawa, Yutaka Matsuo

      2024 First Conference on Language Modeling (COLM 2024).

    • “Aligning Superintelligence Goals with Societal Welfare: An Evolutionary Perspective.”

      Hiroshi Yamakawa

      9th International Conference on Robot Ethics and Standards (ICRES 2024).

    • “Sustainability of Digital Life Form Societies.”

      Hiroshi Yamakawa

      9th International Conference on Robot Ethics and Standards (ICRES 2024)

    • “Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks.”

      Andrew Gambardella, Yusuke Iwasawa, Yutaka Matsuo

      Proceeding of the 62nd Annual Meeting of the Association for Computer Linguistics (ACL2024)

    • “KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques.”

      Edison Marrese-Taylor

      Proceeding of the 62nd Annual Meeting of the Association for Computer Linguistics (ACL2024)