Semi-Supervised NMF-CNN for Sound Event Detection

نویسندگان

چکیده

The lack of strongly labeled data can limit the potential a Sound Event Detection (SED) system trained using deep learning approaches. To address this issue, paper proposes novel method to approximate strong labels for weakly Nonnegative Matrix Factorization (NMF) in supervised manner. Using combinative transfer and semi-supervised framework, two different Convolutional Neural Networks (CNN) are synthetic data, approximated unlabeled where one model will produce audio tags. In contrast, other frame-level prediction. proposed methodology is then evaluated on three subsets Classification Acoustic Scenes Events (DCASE) 2020 dataset: validation dataset, challenge evaluation public YouTube dataset. Based results, our outperforms baseline by minimum 7% across these subsets. addition, also top 3 submissions from DCASE 2019 task 4 datasets. Our performance competitive against submission data. A post-challenge analysis was performed which revealed causes difference between 4. leading that we observed 1) detection threshold tuning 2) augmentation techniques used. We could perform better than first place 1.5% changing method. more robust long-duration clips, outperformed them 37%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

STED: Semi-Supervised Targeted Event Detection

Social microblogs such as Twitter and Weibo are experiencing an explosive growth with billions of global users sharing their daily observations and thoughts. Beyond public interests (e.g., sports, music), microblogs can provide highly detailed information for those interested in public health, homeland security, and financial analysis. However, the language used in Twitter is heavily informal, ...

متن کامل

Robust Sound Event Detection through Noise Estimation and Source Separation Using Nmf

This paper addresses the problem of sound event detection under non-stationary noises and various real-world acoustic scenes. An effective noise reduction strategy is proposed in this paper which can automatically adapt to background variations. The proposed method is based on supervised non-negative matrix factorization (NMF) for separating target events from noise. The event dictionary is tra...

متن کامل

Semi-Supervised Novelty Detection

A common setting for novelty detection assumes that labeled examples from the nominal class are available, but that labeled examples of novelties are unavailable. The standard (inductive) approach is to declare novelties where the nominal density is low, which reduces the problem to density level set estimation. In this paper, we consider the setting where an unlabeled and possibly contaminated...

متن کامل

Max-Margin Semi-NMF

In this paper, we propose a maximum-margin framework for classification using Nonnegative Matrix Factorization. In contrast to previous approaches where the classification and matrix factorization stages are separated, we incorporate the maximum margin constraints within the NMF formulation, i.e we solve for a base matrix that maximizes the margin of the classifier in the low dimensional featur...

متن کامل

Semi-supervised Learning for Anomalous Trajectory Detection

A novel learning framework is proposed for anomalous behaviour detection in a video surveillance scenario, so that a classifier which distinguishes between normal and anomalous behaviour patterns can be incrementally trained with the assistance of a human operator. We consider the behaviour of pedestrians in terms of motion trajectories, and parametrise these trajectories using the control poin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3113903