A Simple Language Model for Task-Oriented Dialogue

Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, Richard Socher

Task-oriented dialogue is often decomposed into three tasks: understanding user input, deciding actions, and generating a response. This allows for dedicated models for each sub-task, but we find a simple, unified approach leads to state-of-the-art performance across multiple settings on the MultiWOZ dataset. SimpleTOD is a simple approach to task-oriented dialogue that uses a single causal language model trained on all sub-tasks recast as a single sequence prediction problem. This allows SimpleTOD to fully leverage transfer learning from pre-trained, open domain, causal language models such as GPT-2. SimpleTOD improves over the prior state-of-the-art by 1.22 points in joint goal accuracy for dialogue state tracking. SimpleTOD also improves all three metrics used to evaluate action and response generation in the most complete setting for task-oriented dialog systems: inform rate by 8.1 points, success rate by 9.7 points, and BLEU by 23.5 points.

Knowledge Graph

arrow_drop_up

Comments

Sign up or login to leave a comment