Computer Vision Faculty Publications

Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation

Chenxu Yang, Institute of Information Engineering
Zheng Lin, Institute of Information Engineering
Lanrui Wang, Institute of Information Engineering
Chong Tian, Mohamed Bin Zayed University of Artificial IntelligenceFollow
Liang Pang, Institute of Computing Technology Chinese Academy of Sciences
Jiangnan Li, Institute of Information Engineering
Qirong Ho, Mohamed Bin Zayed University of Artificial IntelligenceFollow
Yanan Cao, Institute of Information Engineering

Document Type

Conference Proceeding

Publication Title

EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings

Abstract

Knowledge-grounded dialogue generation aims to mitigate the issue of text degeneration by incorporating external knowledge to supplement the context. However, the model often fails to internalize this information into responses in a human-like manner. Instead, it simply inserts snippets of the provided knowledge into generic responses. As a result, the generated responses tend to be tedious, incoherent, and in lack of interactivity which means the degeneration problem is still unsolved. In this work, we find that such copying-style degeneration is primarily due to the weak likelihood objective, which allows the model to "cheat" the objective by merely duplicating knowledge snippets in a superficial pattern matching manner based on overlap. To overcome this challenge, we propose a Multi-level Adaptive Contrastive Learning (MACL) framework that dynamically samples negative examples and subsequently penalizes degeneration behaviors at both the token-level and sequence-level. Extensive experiments on the WoW dataset demonstrate the effectiveness of our approach across various pre-trained models and decoding strategies.

First Page

8002

Last Page

8015

Publication Date

1-1-2023

Recommended Citation

C. Yang et al., "Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation," EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings, pp. 8002 - 8015, Jan 2023.

This document is currently not available here.

COinS

Computer Vision Faculty Publications

Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation

Document Type

Publication Title

Abstract

First Page

Last Page

Publication Date

Recommended Citation

Browse

Contribute

Links

Computer Vision Faculty Publications

Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

Publication Date

Recommended Citation

Share

Browse

Contribute

Links