DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation
Abstract
We present a large, tunable neural conversational response generation model, DialoGPT (dialogue generative pre-trained transformer). Trained on 147M conversation-like exchanges extracted from Reddit comment chains over a period spanning from 2005 through 2017, DialoGPT extends the Hugging Face PyTorch transformer to attain a performance close to human both in terms of automatic and human evaluation in single-turn dialogue settings. We show that conversational systems that leverage DialoGPT generate more relevant, contentful and context-consistent responses than strong baseline systems. The pre-trained model and training pipeline are publicly released to facilitate research into neural response generation and the development of more intelligent open-domain dialogue systems.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Contrastive Speaker-Aware Learning for Multi-party Dialogue Generation with LLMs (2025)
- Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs (2025)
- Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation (2025)
- SS-MPC: A Sequence-Structured Multi-Party Conversation System (2025)
- SAGE: Steering and Refining Dialog Generation with State-Action Augmentation (2025)
- Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning (2025)
- Fine-Tuning Qwen 2.5 3B for Realistic Movie Dialogue Generation (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 22
Browse 22 models citing this paperDatasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 1,332
Collections including this paper 0
No Collection including this paper