Generating, retrieving persona and generating responses for long-term open-domain dialogue

Dohyun Cha; Dawon Lee; Jihie Kim

doi:10.7717/peerj-cs.2979

PeerJ Computer Science (Jul 2025)

Generating, retrieving persona and generating responses for long-term open-domain dialogue

Dohyun Cha,
Dawon Lee,
Jihie Kim

Affiliations

Dohyun Cha: Department of Computer Science and Artificial Intelligence, Dongguk University, Seoul, Republic of South Korea
Dawon Lee: Division of AI Software Convergence, Dongguk University, Seoul, Republic of South Korea
Jihie Kim: Department of Computer Science and Artificial Intelligence, Dongguk University, Seoul, Republic of South Korea

DOI: https://doi.org/10.7717/peerj-cs.2979
Journal volume & issue: Vol. 11
p. e2979

Abstract

Read online Read online

Open-domain dialogue systems have shown remarkable capabilities in generating natural and consistent responses in short-term conversations. However, in long-term conversations such as multi-session chat (MSC), where the dialogue history exceeds the model’s maximum input length (i.e., 1024 tokens), existing dialogue generation systems often overlook the information from earlier dialogues, leading to the loss of context. To prevent such loss and generate natural, consistent responses, we propose a GRGPerDialogue framework, consisting of three main stages: generating persona from past dialogues, retrieving persona relevant to the current utterance, and generating responses based on both persona and recent dialogues. In the first stage, we generate the persona of each speaker in real-time with diverse expressions, leveraging Llama 2 In-Context Learning (ICL). Subsequently, we propose a new dataset called Persona-Utterance Pair (PUP) and use it to train Facebook dense passage retrieval (DPR) model for retrieving persona sentences relevant to the current utterance. Finally, we train generative models such as Generative Pre-trained Transformer 2 (GPT-2) and Bidirectional and Auto-Regressive Transformers (BART) to generate responses based on retrieved persona sentences and the recent dialogues. Experimental results on a long-term dialogue dataset demonstrate that the GRGPerDialogue framework outperforms baseline models by approximately 0.6% to 1% in terms of the Rouge-1 metric. Furthermore, human evaluation results supported the effectiveness of GRGPerDialogue. These results indicate that GRGPerDialogue can generate responses that are not only more fluent and consistent, but also more relevant to the dialogue history than baseline models.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords