3/12 에너지 효율적인 인공지능 학습 시스템 (권영진 교수/KAIST 전산학부)

작성자
kaistsoftware
작성일
2024-03-05 15:13
조회
10689
  • 강사 : 권영진 교수 (KAIST 전산학부)
  • 일시 : 2024. 3. 12 (화) 16:00~17:30
Energy saving is a crucial mission for data center providers. Among many services, DNN training and inference are significant contributors to energy consumption. This work focuses on saving energy in multi-GPU DNN training. Typically, energy savings come at the cost of some degree of performance degradation. However, determining the acceptable level of performance degradation for a long-running training job can be difficult.

This work proposes ENVPIPE, an energy-saving DNN training framework. ENVPIPE aims to maximize energy saving while maintaining negligible performance slowdown. ENVPIPE takes advantage of slack time created by bubbles in pipeline parallelism. It schedules pipeline units to place bubbles after pipeline units as frequently as possible and then stretches the execution time of pipeline units by lowering the SM frequency. During this process, ENVPIPE does not modify hyperparameters or pipeline dependencies, preserving the original accuracy of the training task. It selectively lowers the SM frequency of pipeline units to avoid performance degradation. We implement ENVPIPE as a library using PyTorch and demonstrate that it can save up to 25.2% energy in single-node training with 4 GPUs and 28.4% in multi-node training with 16 GPUs, while keeping performance degradation to less than 1%.

* 강연자료
전체 163
번호 제목 작성자 작성일 추천 조회
공지사항
2026년 봄학기 콜로퀴엄 일정 안내
kaistsoftware | 2026.03.04 | 추천 1 | 조회 2154
kaistsoftware 2026.03.04 1 2154
142
5/26 Denoising Diffusion for 3D Human and Object Pose Estimation Under Interactions (김태균 교수/KAIST 전산학부)
kaistsoftware | 2025.05.23 | 추천 0 | 조회 5143
kaistsoftware 2025.05.23 0 5143
141
5/19 이미지/비디오 생성 기술의 현재와 미래 (성민혁 교수/KAIST 전산학부)
kaistsoftware | 2025.05.16 | 추천 0 | 조회 5057
kaistsoftware 2025.05.16 0 5057
140
5/12 Startup Funding (최원호 교수/KAIST 전산학부)
kaistsoftware | 2025.05.08 | 추천 0 | 조회 5378
kaistsoftware 2025.05.08 0 5378
139
4/21 반복되는 SW오류, 어떻게 막을것인가? (허기홍 교수/KAIST 전산학부)
kaistsoftware | 2025.04.07 | 추천 0 | 조회 7003
kaistsoftware 2025.04.07 0 7003
138
4/7 Hacking Unmanned Vehicles (김용대 교수/KAIST 전기및전자공학부)
kaistsoftware | 2025.04.04 | 추천 0 | 조회 6479
kaistsoftware 2025.04.04 0 6479
137
3/24 Mobile AI Agent (신인식 교수/KAIST 전산학부)
kaistsoftware | 2025.03.21 | 추천 0 | 조회 7144
kaistsoftware 2025.03.21 0 7144
136
3/17 Analyzing LLM Inference Chains (유신 교수/KAIST 전산학부)
kaistsoftware | 2025.03.10 | 추천 0 | 조회 7657
kaistsoftware 2025.03.10 0 7657
135
3/10 AI 의인화와 윤리적 문제: AI는 어떻게 사람처럼 보이도록 설계되었는가? (김진형 교수/KAIST 전산학부)
kaistsoftware | 2025.03.05 | 추천 0 | 조회 10388
kaistsoftware 2025.03.05 0 10388
134
2025년 봄학기 콜로퀴엄 일정 안내
kaistsoftware | 2025.02.27 | 추천 0 | 조회 15808
kaistsoftware 2025.02.27 0 15808
133
11/25 Finding Security Vulnerabilities in Layer-1 and Layer-2 Blockchains (강민석 교수/KAIST 전산학부)
kaistsoftware | 2024.11.21 | 추천 0 | 조회 11257
kaistsoftware 2024.11.21 0 11257