Conversational AI

NAACL: Industry track offers reality checks, new directions

Industry track chair and Amazon principal research scientist Rashmi Gangadharaiah on trends in industry papers and the challenges of building practical dialogue systems.

July 08, 2022

The annual meeting of the North American chapter of the Association for Computational Linguistics (NAACL) introduced an industry track in 2018, and at this year’s conference, which begins next week, one of the industry track chairs is Amazon principal research scientist Rashmi Gangadharaiah.

Rashmi Gangadharaiah.png — Rashmi Gangadharaiah, a principal research scientist at Amazon and an industry track chair at this year's meeting of the North American chapter of the Association for Computational Linguistics (NAACL).

“The NAACL industry track inspired industry tracks at other conferences such as COLING and EMNLP,” Gangadharaiah says. “The industry track provides a forum for researchers in the industry to exchange ideas and discuss successful deployments of ML [machine learning] and NLP [natural-language processing] technologies, as well as share challenges that arise in deploying such systems in real-world settings.”

For instance, Gangadharaiah explains, “academic research is often done in very controlled settings. It's not a negative thing: people have to do research, and it's useful to start in a controlled setting. But when we put such systems in real-world situations, we typically have to worry about latency, memory, and space. It's not always accuracy that we go for. It's a balance of latency, memory, space, and accuracy — and a question of how we measure accuracy. So I think it makes it more interesting that way.”

Similarly, Gangadharaiah explains, industry track papers often report negative results. “There are lots of papers that get published in academia, but when we try to put it in real-world settings, we notice that many of these methods don't work well,” she says. “So we do have papers on negative results. And it's crucial, because we do want to show that these are the methods that we tried, and they didn't work.”

The case for simplicity

Hierarchical thinking

Of course, not all industry papers report negative results, and in some cases, Gangadharaiah says, industry research has pointed in directions where academic research has followed.

Again, her own research provides an example. The dialogue systems that Gangadharaiah works on are goal directed, meaning that the purpose of each dialogue is that an AI agent should identify and fulfill the goal of a human speaker. Such systems rely on natural-language-understanding models to make sense of customer utterances, but they also include state trackers that assess progress toward the speaker’s goal.

There is some need of semantic parsing in dialogue systems. ... I think the industry kind of motivated all that work.

Rashmi Gangadharaiah

“If you consider restaurant booking, you might say that you want to book a restaurant for six people, and then you might change your mind and say, ‘Hey no, now I want it for eight people,’” Gangadharaiah explains. “The system will have to make appropriate changes.

“We can introduce more complexity. So, for example, if you're ordering a pizza, maybe you would start with toppings of olives, and then you might go to pepperoni. In this case, you're not asking the system to replace olives with pepperoni; multiple values are being provided for the toppings itself.”

Alexa Conversations modeling architecture

Large language models

Recently, the big story in natural-language processing (NLP) has been the power and adaptability of large language models, such as BERT and GPT-3, that encode the probabilities of long sequences of words and can be fine-tuned on particular NLP tasks. They have applications in dialogue management, too, Gangadharaiah says.

“We’ve successfully deployed such models in Amazon,” she says, “and we’ve been actively exploring how to improve these models in order to make our chatbots — such as AWS Chatbot, LEX, and Alexa — more powerful. For example, I can take these large language models and then fine-tune them on, let's say, a restaurant domain, where I want to book certain seats in a certain restaurant for a certain number of people, and so on.