-
EMNLP 20232023Statistical significance testing is used in natural language processing (NLP) to determine whether the results of a study or experiment are likely to be due to chance or if they reflect a genuine relationship. A key step in significance testing is the estimation of confidence interval which is a function of sample variance. Sample variance calculation is straightforward when evaluating against ground truth
-
2023 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)2023To boost training and adaptation of end to end (E2E) automatic speech recognition (ASR) models, several approaches to use paired speech-text input together with unpaired text input have emerged. They aim at improving the model performance on rare words, personalisation, and long tail. In this work, we present a systematic study of the impact of such training/adaptation and compare it to training with synthetic
-
EMNLP 20232023While recent studies have looked into the abilities of large language models in various benchmark tasks, few studies have looked into the controllability of large language models on generation tasks. We present a systematic and extensive analysis of the controllability of large language models on ten benchmarks, including a new simple yet challenging numerical planning benchmark with different granularities
-
NeurIPS 20232023A large body of NLP research has documented the ways gender biases manifest and amplify within large language models (LLMs), though this research has pre- dominantly operated within a gender binary-centric context. A growing body of work has identified the harmful limitations of this gender-exclusive framing; many LLMs cannot correctly and consistently refer to persons outside the gender binary, especially
-
AAAI 20242023Toxic content detection is crucial for online services to remove inappropriate content that violates community standards. To automate the detection process, prior works have proposed varieties of machine learning (ML) approaches to train Language Models (LMs) for toxic content detection. However, both their accuracy and transferability across datasets are limited. Recently, Large Language Models (LLMs)
Related content
-
April 20, 2022MASSIVE dataset and Massively Multilingual NLU (MMNLU-22) competition and workshop will help researchers scale natural-language-understanding technology to every language on Earth.
-
April 07, 2022The JHU + Amazon Initiative for Interactive AI (AI2AI) will be housed in the Whiting School of Engineering.
-
April 04, 2022Thanks to a set of simple abstractions, models with different architectures can be integrated and optimized for particular hardware accelerators.
-
March 23, 2022Amazon researchers optimize the distributed-training tool to run efficiently on the Elastic Fabric Adapter network interface.
-
March 16, 2022A machine learning model learns representations that cluster devices according to their usage patterns.
-
March 03, 2022As an applied science manager at Amazon, Muthu Chandrasekaran works on new tools to automate and build a risk technology.