Building an open domain socialbot with self-dialogues

By University of Edinburgh
2017
Download Copy BibTeX
Copy BibTeX
Edina is a conversational agent whose responses utilize data harvested from Amazon Mechanical Turk (AMT) through an innovative new technique we call self-dialogues. These are conversations in which a single AMT Worker plays both participants in a dialogue. Such dialogues are surprisingly natural, efficient to collect and reflective of relevant and/or trending topics. These self-dialogues provide training data for a generative neural network as well as a basis for soft rules used by a matching score component. We present methodology for combining rule-based, retrieval, and generative methods to effectively leverage our data. Our hybrid data-driven methodology thus addresses both coverage limitations of a strictly rule-based approach and the lack of guarantees of a strictly machine-learning approach.

Authors: Ben Krause, Marco Damonte*, Mihai Dobre*, Daniel Duma*, Federico Fancellu*†, Emmanuel Kahembwe*, Jianpeng Cheng, Joachim Fainberg*, Bonnie Webber‡

* equal contribution; † team leader; ‡ faculty advisor

Latest news

The latest updates, stories, and more about Alexa Prize.
GB, MLN, Edinburgh
We’re looking for a Machine Learning Scientist in the Personalization team for our Edinburgh office experienced in generative AI and large models. You will be responsible for developing and disseminating customer-facing personalized recommendation models. This is a hands-on role with global impact working with a team of world-class engineers and scientists across the Edinburgh offices and wider organization. You will lead the design of machine learning models that scale to very large quantities of data, and serve high-scale low-latency recommendations to all customers worldwide. You will embody scientific rigor, designing and executing experiments to demonstrate the technical efficacy and business value of your methods. You will work alongside aRead more