Latent Autoregressive Source Separation

نویسندگان

چکیده

Autoregressive models have achieved impressive results over a wide range of domains in terms generation quality and downstream task performance. In the continuous domain, key factor behind this success is usage quantized latent spaces (e.g., obtained via VQ-VAE autoencoders), which allow for dimensionality reduction faster inference times. However, using existing pre-trained to perform new non-trivial tasks difficult since it requires additional fine-tuning or extensive training elicit prompting. This paper introduces LASS as way vector-quantized Latent Source Separation (i.e., de-mixing an input signal into its constituent sources) without requiring gradient-based optimization modifications models. Our separation method relies on Bayesian formulation autoregressive are priors, discrete (non-parametric) likelihood function constructed by performing frequency counts sums addend tokens. We test our images audio with several sampling strategies ancestral, beam search) showing competitive approaches while offering at same time significant speedups scalability higher dimensional data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Informed Source Separation Using Latent Components

We address the issue of source separation in a particular informed configuration where both the sources and the mixtures are assumed to be known during a so-called encoding stage. This knowledge enables the computation of a side information which ought to be small enough to be watermarked in the mixtures. At the decoding stage, the sources are no longer assumed to be known, only the mixtures an...

متن کامل

Probabilistic Latent Component Analysis for Gearbox Vibration Source Separation

Probabilistic latent component analysis (PLCA) is applied to the problem of gearbox vibration source separation. A model for the probability distribution of gearbox vibration employs a latent variable intended to correspond to a particular vibration source, with the measured vibration at a particular sensor for each source the product of a marginal distribution of vibration by frequency, a marg...

متن کامل

Audio Source Separation by Probabilistic Latent Component Analysis

The problem of audio source separation from a monophonic sound mixture having known instrument types but unknown timbres is presented. An improvement to the Probabilistic Latent Component Analysis (PLCA) source separation method is proposed. The technique uses a basis function dictionary to produce a first round PLCA source separation. The PLCA weights are then refined by incorporating note ons...

متن کامل

Statistical Model of Speech Signals Based on Composite Autoregressive System with Application to Blind Source Separation

This paper presents a new statistical model for speech signals, which consists of a time-invariant dictionary incorporating a set of the power spectral densities of excitation signals and a set of all-pole filters where the gain of each pair of excitation and filter elements is allowed to vary over time. We use this model to develop a combined blind separation and dereverberation method for spe...

متن کامل

Source separation

Our group attracted participants from various research areas to discuss aspects of source separation for audio signals, performed either in a blind way or by using additional knowledge about the underlying sources or the mixing process. In many source separation approaches, one assumes that sources are independent, uncorrelated, and do not overlap with regard to a given representation. Also one...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i8.26131