Multimodal AI Lab
Dept. of AI, Ewha Womans University. Seoul, Korea.
Multimodal AI Lab @ EWHA (PI: Jiyoung Lee) focuses on developing robust and generalizable AI models that reason and generate information across multiple modalitiesβvision, audio, language, and robotics. Our research spans multimodal large language models (MLLMs), agentic systems, robotics, multimodal generation, video understanding, 3D perception, cross-modal grounding, but not limited to. We build multimodal systems that learn with minimal supervision and perform reliably in diverse, real-world settings. We aim to push the boundaries of multimodal learning and create AGI that is creative, effective, and efficient.
Recruiting Undergraduate Interns/ Graduate Students / Postdoctoral Researchers: We are looking for undergraduate interns, graduate students, and postdoctoral researchers to research with! If you are interested in doing cool multimodal learning research, please send your CV and GPA to .
Β
News
| Feb 2026 | One paper is accepted at CVPR 2026! π |
|---|---|
| Jan 2026 | Two papers are accepted at ICASSP 2026! π |
| Aug 2025 | One paper is accepted at ICCV Workshop@Gen4AVC! π |
| Jul 2025 | One paper is accepted in International Journal of Computer Vision (IJCV) [Q1, IF:9.3]! π |
| Apr 2025 | Multimodal AI Lab @ EWHA website is now open! π |
| Mar 2025 | Prof.Jiyoung Lee joins in Dept. of AI, Ewha Womans University π |
| Feb 2025 | One paper is accepted at CVPR 2025! π |
| Dec 2024 | Prof.Jiyoung Lee presented at Postech AI day (topic: Read, Watch and Scream! Sound Generation from Text and Video). |
| Dec 2024 | Prof.Jiyoung Lee presented at HUST, Vietnam (topic: Audio Generation from Visual Contents). |
| Dec 2024 | One paper is accepted at AAAI 2025! π |
| Oct 2024 | One paper is accepted at NeurIPS 2024 Workshop on Video-Language Models 2024! π |
| Sep 2024 | One paper is accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) [Q1, IF:20.8]! π |
| Sep 2024 | Prof.Jiyoung Lee serves a lecture, Topics in Artificial Intelligence: Multimodal Deep Learning Theories and Applications, at Seoul National University (Fall 2024) |
| Jun 2024 | One paper is accepted in Pattern Recognition (PR) [Q1, IF:7.5]! π |
| Jan 2024 | Two papers are accepted at ICLR 2024! π |
| Before 2024 | You can find our older news in here |