MASSIVE is a parallel dataset of > 1M utterances across 52 languages with annotations for the Natural Language Understanding tasks of intent prediction and slot annotation. Utterances span 60 intents and include 55 slot types. MASSIVE was created by localizing the Spoken Language Understanding Resource Package (SLURP) dataset, composed of general intelligent voice assistant single-shot interactions.
MASSIVE
2022
Last updated July 20, 2023