-
ACM FAccT 20232023Warning: This paper contains examples of gender non-affirmative language which could be offensive, upsetting, and/or triggering. Transgender and non-binary (TGNB) individuals disproportionately experience discrimination and exclusion from daily life. Given the recent popularity and adoption of language generation technologies, the potential to further marginalize this population only grows. Although a multitude
-
ACL 20232023Recent studies show that sentence-level extractive QA, i.e., based on Answer Sentence Selection (AS2), is outperformed by Generationbased QA (GenQA) models, which generate answers using the top-k answer sentences ranked by AS2 models (a la retrieval-augmented generation style). In this paper, we propose a novel training paradigm for GenQA using supervision from automatic QA evaluation models (GAVA). Specifically
-
AAAI 2023 Workshop on Creative AI Across Modalities2023Automatic song writing is a topic of significant practical interest. However, its research is largely hindered by the lack of training data due to copyright concerns and challenged by its creative nature. Most noticeably, prior works often fall short of modeling the cross-modal correlation between melody and lyrics due to limited parallel data, hence generating lyrics that are less singable. Existing works
-
EMNLP 20232023Product attribute extraction is an emerging field in information extraction and e-commerce, with applications including knowledge base construction, product recommendation, and enhancing customer experiences. In this work, we explore the use of generative models for product attribute extraction. We analyze their utility with hard and soft prompting methods, and demonstrate their ability to generate implicit
-
ASRU 20232023Endpoint (EP) detection is a key component of far-field speech recognition systems that assist the user through voice commands. The endpoint detector has to trade-off between accuracy and latency, since waiting longer reduces the cases of users being cut-off early. We propose a novel two-pass solution for endpointing, where the utterance endpoint detected from a first pass endpointer is verified by a 2nd-pass
Related content
-
September 13, 2021How Amazon intern Michael Saxon uses his experience with automatic speech recognition models to help Alexa answer complex queries.
-
September 10, 2021Data augmentation makes examples more realistic, while continual-learning techniques prevent “catastrophic forgetting”.
-
September 09, 2021Model using ASR hypotheses as extra inputs reduces word error rate of human transcriptions by almost 11%.
-
September 02, 2021Branching encoder networks make operation more efficient, while “neural diffing” reduces bandwidth requirements for model updates.
-
August 27, 2021Liu discusses her work in speech recognition and understanding, prosody modeling, summarization, and natural language processing.
-
August 27, 2021New voice for Alexa’s Reading Sidekick feature avoids the instabilities common to models with variable prosody.