12/2 Multilingual and Cross-Lingual Analysis of Neural Machine Translation Models (김재명 연구원/NAVER LABS Europe, France)

작성자

kaistsoftware

작성일

2021-12-01 16:51

조회

12571

강사 : 김재명 연구원 (NAVER LABS Europe, France)
일시 : 2021. 12. 2 (목) 17:00~18:30

In this talk, we explore and analyze multilinguality and cross-linguality with respect to neural machine translation (NMT).
Recent studies on the analysis of the multilingual representations focus on identifying whether there is an emergence of language-independent representations, or whether a multilingual model partitions its weights among different languages. While most of such work has been conducted in a "black-box" manner, in this talk, we aim to analyze individual components of a multilingual NMT model. In particular, we look at the encoder self-attention and encoder-decoder attention heads (in a many-to-one NMT model) that are more specific to the translation of a certain language pair than others by (1) employing metrics that quantify some aspects of the attention weights such as "variance" or "confidence", and (2) systematically ranking the importance of attention heads with respect to translation quality. We observe that surprisingly, the set of most important attention heads are very similar across the language pairs and that it is possible to remove nearly one-third of the less important heads without hurting the translation quality greatly.
Having seen the internals of the multilingual NMT models, we now turn our attention to the bilingual (and cross-lingual) data itself. More specifically, we investigate whether discourse relations are preserved across cross-lingual sentences, using openly available discourse corpora derived from TED talks. We find that on average, 68% and 48% of inter-sentential discourse relations are exactly matched across 28 language pairs at the first and second level of the Penn Discourse Treebank hierarchy, respectively. Motivated by these findings, we performed a preliminary study on the effectiveness of discourse relations when applied to context-aware NMT. Experimental results show that adding discourse information can enhance NMT models' capability to adapt to contextual information and better handle various discourse phenomena. In addition, we show that constraining different types of discourse relations makes it possible to control target translation by adding appropriate discourse markers while maintaining the quality of translation.

« 11/25 Modeling Species Interactions and Distributions Under Imperfect Detection (서유진 박사/Brown Univ.)

12/9 Speech Communication in Everyday Life (송지은 교수/KAIST 인문사회과학부) »

목록보기

전체 143

번호	제목	작성자	작성일	추천	조회
공지사항	2025년 봄학기 콜로퀴엄 일정 안내 kaistsoftware \| 2025.02.27 \| 추천 0 \| 조회 10447	kaistsoftware	2025.02.27	0	10447
142	6/2 Intelligent Techniques for Graphics, Vision, and Robotics (윤성의 교수/KAIST 전산학부) kaistsoftware \| 2025.06.02 \| 추천 0 \| 조회 416	kaistsoftware	2025.06.02	0	416
141	5/26 Denoising Diffusion for 3D Human and Object Pose Estimation Under Interactions (김태균 교수/KAIST 전산학부) kaistsoftware \| 2025.05.23 \| 추천 0 \| 조회 552	kaistsoftware	2025.05.23	0	552
140	5/19 이미지/비디오 생성 기술의 현재와 미래 (성민혁 교수/KAIST 전산학부) kaistsoftware \| 2025.05.16 \| 추천 0 \| 조회 776	kaistsoftware	2025.05.16	0	776
139	5/12 Startup Funding (최원호 교수/KAIST 전산학부) kaistsoftware \| 2025.05.08 \| 추천 0 \| 조회 1437	kaistsoftware	2025.05.08	0	1437
138	4/21 반복되는 SW오류, 어떻게 막을것인가? (허기홍 교수/KAIST 전산학부) kaistsoftware \| 2025.04.07 \| 추천 0 \| 조회 3340	kaistsoftware	2025.04.07	0	3340
137	4/7 Hacking Unmanned Vehicles (김용대 교수/KAIST 전기및전자공학부) kaistsoftware \| 2025.04.04 \| 추천 0 \| 조회 2804	kaistsoftware	2025.04.04	0	2804
136	3/24 Mobile AI Agent (신인식 교수/KAIST 전산학부) kaistsoftware \| 2025.03.21 \| 추천 0 \| 조회 3616	kaistsoftware	2025.03.21	0	3616
135	3/17 Analyzing LLM Inference Chains (유신 교수/KAIST 전산학부) kaistsoftware \| 2025.03.10 \| 추천 0 \| 조회 3984	kaistsoftware	2025.03.10	0	3984
134	3/10 AI 의인화와 윤리적 문제: AI는 어떻게 사람처럼 보이도록 설계되었는가? (김진형 교수/KAIST 전산학부) kaistsoftware \| 2025.03.05 \| 추천 0 \| 조회 6066	kaistsoftware	2025.03.05	0	6066
133	11/25 Finding Security Vulnerabilities in Layer-1 and Layer-2 Blockchains (강민석 교수/KAIST 전산학부) kaistsoftware \| 2024.11.21 \| 추천 0 \| 조회 7224	kaistsoftware	2024.11.21	0	7224