4/20 World Model: Spatial Physical Generative AI (김태균 교수/KAIST 전산학부)

작성자
kaistsoftware
작성일
2026-04-17 17:58
조회
345
  • 강사 : 김태균 교수 (KAIST 전산학부)
  • 일시 : 2026. 4. 20. (월) 16:00~17:30
Recent breakthroughs in large language models (LLMs) and generative AI have demonstrated remarkable capabilities in text, image, and video synthesis. However, despite their scale and fluency, these models remain fundamentally limited in their understanding of 3D space, physical interaction, and embodied reasoning. This talk explores the next frontier beyond LLMs: Spatial Physical Generative AI — systems that not only generate content but understand and reason about the physical world. We begin by examining the evolution from Transformers and large-scale pretraining toward vision-language and world models that attempt grounded intelligence. While current generative video and multimodal models achieve impressive visual realism, they often lack true spatial consistency, physical plausibility, and compositional generalization. Addressing these limitations requires integrating 3D scene representations, physics-based simulation, and generative diffusion frameworks. The talk presents recent advances in 3D spatial AI, motion diffusion, physics-guided video generation, and category-level 6D pose estimation. We introduce methods that combine regression and diffusion modeling with score scaling sampling to capture multi-hypothesis pose distributions efficiently. Furthermore, we highlight MPMAvatar, a hybrid mesh–3D Gaussian Splatting framework that enables physically accurate cloth simulation and photorealistic avatar rendering, demonstrating realistic deformation, collision handling, and zero-shot scene interaction. By unifying generative models with spatial representation and physical simulation, we move toward AI systems capable of embodied reasoning, real-world interaction, and scalable physical intelligence. Spatial Physical Gen AI, also called world models, represents a critical step toward grounded artificial general intelligence (AGI), bridging perception, generation, and physical understanding.
전체 163
번호 제목 작성자 작성일 추천 조회
공지사항
2026년 봄학기 콜로퀴엄 일정 안내
kaistsoftware | 2026.03.04 | 추천 1 | 조회 2712
kaistsoftware 2026.03.04 1 2712
162
5/18 믿고 쓸만한 AI 에이전트를 위하여 (허기홍 교수/KAIST 전산학부)
kaistsoftware | 2026.05.06 | 추천 0 | 조회 254
kaistsoftware 2026.05.06 0 254
161
5/11 LLMServingSim: Bridging AI Hardware Innovation and Real-World LLM Serving (박종세 교수/KAIST 전산학부)
kaistsoftware | 2026.05.06 | 추천 0 | 조회 288
kaistsoftware 2026.05.06 0 288
160
4/27 From Visual Understanding to Embodied Intelligence: Agentic AI and Physical AI in Robotics (윤성의 교수/KAIST 전산학부)
kaistsoftware | 2026.04.17 | 추천 0 | 조회 334
kaistsoftware 2026.04.17 0 334
159
4/20 World Model: Spatial Physical Generative AI (김태균 교수/KAIST 전산학부)
kaistsoftware | 2026.04.17 | 추천 0 | 조회 345
kaistsoftware 2026.04.17 0 345
158
4/13 LLM 시대의 Computer Science 교육과 AI 연구의 미래 (성민혁 교수/KAIST 전산학부)
kaistsoftware | 2026.04.10 | 추천 0 | 조회 402
kaistsoftware 2026.04.10 0 402
157
4/6 Mobile AI Agent (신인식 교수/KAIST 전산학부)
kaistsoftware | 2026.04.10 | 추천 0 | 조회 445
kaistsoftware 2026.04.10 0 445
156
3/30 Keeping up with the abstraction - software engineering for AI (유신 교수/KAIST 전산학부)
kaistsoftware | 2026.03.30 | 추천 0 | 조회 656
kaistsoftware 2026.03.30 0 656
155
3/23 그래프 기계학습을 활용한 인공지능 기술의 최신 연구 동향 (황지영 교수/KAIST 전산학부)
kaistsoftware | 2026.03.19 | 추천 0 | 조회 866
kaistsoftware 2026.03.19 0 866
154
3/16 AGI 시대의 도래, 대학교욱의 역할을 다시 묻는다 (김진형 교수/KAIST 전산학부)
kaistsoftware | 2026.03.19 | 추천 0 | 조회 1150
kaistsoftware 2026.03.19 0 1150
153
12/1 Car as an Experience - Designing for the Driving Context (정세빈 부장/GM Korea)
kaistsoftware | 2025.11.28 | 추천 1 | 조회 1936
kaistsoftware 2025.11.28 1 1936