Research

  • Home
  • Publications
  • 強化学習
  • Publications

    Category

    Research Area

    Year

    • Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation

      Toshinori Kitamura, Arnob Ghosh, Tadashi Kozuno, Wataru Kumagai, Kazumi Kasaura, Kenta Hoshino, Yohei Hosoe, Yutaka Matsuo.

      Advances in Neural Information Processing Systems (NeurIPS 2025_Spotlight)

    • Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form

      Toshinori Kitamura, Tadashi Kozuno, Wataru Kumagai, Kenta Hoshino, Yohei Hosoe, Kazumi Kasaura, Masashi Hamaya, Paavo Parmas, Yutaka Matsuo

      International Conference on Learning Representations (ICLR 2025)

    • JSAI2018 Excellence Award: “Improving Robustness to Long Action Sequences by Partitioning into Subtasks and Predicting Abstracted Actions in Instruction Following.”

      篠田 一聡,竹澤 祐貴,鈴木 雅大,岩澤 有祐,松尾 豊