Group masked autoencoder based density estimator for audio anomaly detection

Ritwik Giri; Fangzhou Cheng; Karim Helwani; Srikanth V. Tenneti; Umut Isik; Arvindh Krishnaswamy

Publication

Group masked autoencoder based density estimator for audio anomaly detection

By Ritwik Giri, Fangzhou Cheng, Karim Helwani, Srikanth V. Tenneti, Umut Isik, Arvindh Krishnaswamy

2020

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

In this paper, we address the problem of detecting previously unseen anomalous audio events, when the training dataset itself does not contain any examples of anomalies. While the traditional density estimation techniques, such as Gaussian Mixture Model (GMM) showed promise in past for the problem at hand, recent advances in neural density estimation techniques, have made them suitable for anomaly detection task. In this work, we develop a novel neural density estimation technique based on the Group-Masked Autoencoder, that estimates the density of an audio time series by taking into account the intra-frame statistics of the signal. Our proposed approach has been validated using the DCASE 2020 challenge dataset (Task 2 - Unsupervised Detection of Anomalous Sounds for Machine Condition Monitoring). We demonstrate the effectiveness of our approach by comparing against the baseline autoencoder model, and also against recently proposed Interpolating Deep Neural Network (IDNN) model.

Group masked autoencoder based density estimator for audio anomaly detection

Latest news

Work with us