Challenging Instances are Worth Learning: Generating Valuable Negative Samples for Response Selection Training (2109.06538v1)

Published 14 Sep 2021 in cs.CL

Abstract: Retrieval-based chatbot selects the appropriate response from candidates according to the context, which heavily depends on a response selection module. A response selection module is generally a scoring model to evaluate candidates and is usually trained on the annotated positive response and sampled negative responses. Sampling negative responses lead to two risks: a). The sampled negative instances, especially that from random sampling methods, are mostly irrelevant to the dialogue context and too easy to be fitted at the training stage while causing a weak model in the real scenario. b). The so-called negative instances may be positive, which is known as the fake negative problem. To address the above issue, we employ pre-trained LLMs, such as the DialoGPT to construct more challenging negative instances to enhance the model robustness. Specifically, we provide garbled context to the pre-trained model to generate responses and filter the fake negative ones. In this way, our negative instances are fluent, context-related, and more challenging for the model to learn, while can not be positive. Extensive experiments show that our method brings significant and stable improvements on the dialogue response selection capacity.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (4)

Yao Qiu (6 papers)
Jinchao Zhang (49 papers)
Huiying Ren (2 papers)
Jie Zhou (687 papers)

Citations (3)

View on Semantic Scholar

Challenging Instances are Worth Learning: Generating Valuable Negative Samples for Response Selection Training (2109.06538v1)

Related Papers