Jaewoo Ahn
jaewoo.ahn AT vision.snu.ac.kr

Hi, I’m a Ph.D. candidate in the Department of Computer Science and Engineering at Seoul National University where I am advised by Prof. Gunhee Kim as a member of Vision & Learning Lab. In addition, I am currently a Research Scientist Intern at KRAFTON AI.
I am broadly interested in NLP and multimodal AI, with a particular focus on human-like embodied conversational agents that interact naturally in real-world environments. To this end, my work has advanced consistent persona modeling (MPChat, TimeChara), robust perception (MAC), and embodied action (Orak, FlashAdventure).
Currently, my research focuses on integrating multisensory perception into LLMs to support multimodal interactions in diverse (e.g., computer-use, video game, embodied) environments. In particular, I focus on enhancing decision-making capabilities of LLM/VLM agents across 2D/3D environments.
You can refer to my Research Statement: Towards Coherent Embodied Conversational Agent, if interested in.
News
Sep 27, 2025 | Our FlashAdventure & Orak papers were accepted (as Spotlight & Outstanding, respectively) and will be presented at the Wordplay Workshop @ EMNLP 2025! |
---|---|
Aug 20, 2025 | Our FlashAdventure paper got accepted to EMNLP 2025! |
Jul 24, 2025 | Our ChartCap paper got accepted to ICCV 2025 as a Highlight Poster! |
Jun 05, 2025 | Our Orak benchmark for video game agents is released! |
May 15, 2025 | Our MAC paper got accepted to ACL 2025! |