Speaker Identification Using 2-d Dct, Walsh and Haar on Full and Block Spectrogram

نویسندگان

Tanuja K. Sarode

Shachi J. Natu

Prachi J. Natu

چکیده

This paper aims to provide different approaches to text dependent speaker identification using DCT, Walsh and Haar transform along with use of spectrograms. Spectrograms obtained from speech samples are used as image database for the study undertaken. This image database is then subjected to various transforms. Using Euclidean distance as measure of similarity, most appropriate speaker match is obtained and is declared as identified speaker. Each transform is applied to spectrograms in two different ways: on full image and on image blocks. In both the ways, effect of different number of coefficients of transformed image is observed. Haar transform on full image reduces multiplications required by DCT and Walsh by 28 times whereas applying Haar transform on image blocks requires 18 times less mathematical computations as compared to DCT and Walsh on image blocks. Transforms when applied to image blocks, yield better or equal identification rates with reduced computational complexity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT On Row Mean Of Spectrogram For Speaker Identification

The goal of this paper is to present a very simple approach to text dependent speaker identification using a combination of spectrograms and well known Discrete Cosine Transform (DCT). This approach is based on use of DCT to find similarities between spectrograms obtained from speech samples. The set of spectrograms forms the database for our experiments rather than raw speech samples. Performa...

متن کامل

Speaker Identification using Frequency Dsitribution in the Transform Domain

In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker is converted into frequency domain by applying the different transform techniques. The distributio...

متن کامل

Speaker Identification using Row Mean of Haar and Kekre’s Transform on Spectrograms of Different Frame Sizes

In this paper, we propose Speaker Identification using two transforms, namely Haar Transform and Kekre’s Transform. The speech signal spoken by a particular speaker is converted into a spectrogram by using 25% and 50% overlap between consecutive sample vectors. The two transforms are applied on the spectrogram. The row mean of the transformed matrix forms the feature vector, which is used in th...

متن کامل

Effect of Varying Embedding Energy and Energy wise Sorting on Watermarking using Wavelet Transforms of Orthogonal Transforms DCT, Walsh and Haar

This paper proposes a robust watermarking technique using wavelets of well-known transforms DCT, Walsh and Haar. HL and LH bands are separately selected for watermark insertion. While inserting watermark, its energy is varied with 40% margin to original energy of region of host selected for insertion of watermark to study effect on robustness. Performance of proposed technique is evaluated agai...

متن کامل

Robust Watermarking in Mid-Frequency Band in Transform Domain using Different Transforms with Full, Row and Column Version and Varying Embedding Energy

This paper proposes a watermarking technique using sinusoidal orthogonal transforms DCT, DST, Real Fourier transform and Sine-cosine transform and non-sinusoidal orthogonal transforms Walsh and Haar. These transforms are used in full, column and row version to embed the watermark and their performance is compared. Also using energy conservation property of transforms, different percentage of ho...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Speaker Identification Using 2-d Dct, Walsh and Haar on Full and Block Spectrogram

نویسندگان

چکیده

منابع مشابه

Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT On Row Mean Of Spectrogram For Speaker Identification

Speaker Identification using Frequency Dsitribution in the Transform Domain

Speaker Identification using Row Mean of Haar and Kekre’s Transform on Spectrograms of Different Frame Sizes

Effect of Varying Embedding Energy and Energy wise Sorting on Watermarking using Wavelet Transforms of Orthogonal Transforms DCT, Walsh and Haar

Robust Watermarking in Mid-Frequency Band in Transform Domain using Different Transforms with Full, Row and Column Version and Varying Embedding Energy

عنوان ژورنال:

اشتراک گذاری