I'm interested in where perception meets understanding.

I'm a PhD student at Yonsei University MIR Lab, advised by Youngjae Yu. Prior to this, I received my MS in Computer Engineering from Seoul National University, advised by Gunhee Kim. During my PhD, I did research-oriented internships at Microsoft Research, LG AI Research, and Naver.

My research goal is understanding how knowledge emerges and develops by replicating these processes in machines. Arguably, the most fundamental source of human knowledge lies in perception, which forms the basis of how we interpret and learn from the world around us. To reflect this, my current focus is on multimodal artificial intelligence, where I explore challenges that arise from introducing multimodal inputs and outputs to large language models (LLMs).

News

Publications (* equal contribution)

Education

Experience

  • Summer 2025, Microsoft Research Redmond (Topic: Multimodal Reasoning)
  • Summer 2024, LG AI Research (Topic: Multimodal LLMs)
  • Summer 2023, Naver (Topic: Any-to-Any Foundational Models)