tdnn

The Tempo 2 Algorithm: Adjusting Time-Delays By Supervised Learning

1990

Ulrich Bodenhausen Alexander H. Waibel

In this work we describe a new method that adjusts time-delays and the widths of time-windows in artificial neural networks automatically. The input of the units are weighted by a gaussian input-window over time which allows the learning rules for the delays and widths to be derived in the same way as it is used for the weights. Our results on a phoneme classification task compare well with res...

متن کامل

Neural Networks vs Genetically Optimized Neural Networks in Time Series Prediction

2010

Ion Railean Sorin Moga Monica Borda Cristina Stolojescu

This paper deals with methods for finding the suitable weights in an Artificial Neural Network (ANN) using Genetic Algorithms (GA).We study the weakness and strengthness of the proposed approach in case of a statistical data forecasting. We describe a different approach when using the input data during optimization phase. Besides GA, we applied stationary wavelet transform (SWT) as a signal pre...

متن کامل

Recognition of Handwritten ZIP Codes in a Postal Sorting System

Journal: :KI 1999

Marcus Pfister Sven Behnke Raúl Rojas

In this article, we describe the OCR and image processing algorithms used to read destination addresses from nonstandard letters (flats) by the Siemens postal automation system currently in use by the Deutsche Post AG.The article concentrates mainly on the two classifiers used to recognize handprinted digits. One of them is a complex time delayed neural network (TDNN) used to classify scaled di...

متن کامل

Recognition of spelled names over the telephone

1996

Hermann Hild Alexander H. Waibel

Recognition of spelled names over the telephone line is essential for applications such as telephone directory assistance, or automatic mail ordering. We present recognition results on the spelling section of the OGI Spelled and Spoken Word Telephone Corpus, using a Multi-State Time Delay Neural Network (MS-TDNN). Many applications allow for strong language modeling constraints. In our experime...

متن کامل

Corrigendum: Gas-bearing prediction of deep reservoir based on DNN embeddings

Journal: :Frontiers in Earth Science 2023

Corrigendum: Gas-bearing Prediction of Deep Reservoir Based on DNN EmbeddingsShuying Ma1, 2, 3, Junxing Cao1, 3*, Zhege Liu4, 5, Xudong Jiang1, Zhaodong Su1, Yajuan Xue4, 5* Correspondence: [email protected]: gas-bearing prediction, Cepstrum, Embedding, TDNN, LSTM, deep reservoirsCorrigendum on: Ma, S., Cao, J., Liu, Z., Jiang, X., Su, and Xue, Y. (2023). prediction reservoir based embe...

متن کامل

Speaker-independent 3D face synthesis driven by speech and text

Journal: :Signal Processing 2006

Arman Savran Levent M. Arslan Lale Akarun

In this study, a complete system that generates visual speech by synthesizing 3D face points has been implemented. The estimated face points drive MPEG-4 facial animation. This system is speaker independent and can be driven by audio or both audio and text. The synthesis of visual speech was realized by a codebook-based technique, which is trained with audio-visual data from a speaker. An audio...

متن کامل

Text-to-speech conversion with neural networks: a recurrent TDNN approach

Journal: :CoRR 1997

Orhan Karaali Gerald Corrigan Ira A. Gerson Noel Massey

This paper describes the design of a neural network that performs the phonetic-to-acoustic mapping in a speech synthesis system. The use of a time-domain neural network architecture limits discontinuities that occur at phone boundaries. Recurrent data input also helps smooth the output parameter tracks. Independent testing has demonstrated that the voice quality produced by this system compares...

متن کامل

بازسازی فضای حالت سری‌های زمانی آشوبی با استفاده از یک روش هوشمند

ژورنال: روشu200cهای هوشمند در صنعت برق 2010

محمد عطایی, مریم پری زنگنه, پیمان معلم,

استفاده از سری‌های زمانی (منظور مشاهدات ما از فرآیند برحسب زمان) یک راه‌حل مؤثر در تحلیل این سیستم‌ها می‌باشد. در واقع تأکید روی این هدف است که چگونه می‌توان از مشاهداتی به فرم سری زمانی اسکالر از فرآیند، که تنها اطلاعات ما در مورد بعضی از سیستم‌ها می‌باشد، به ساختار فضای حالت با بُعد محدود رسید. بازسازی فضای حالت بر مبنای نظریه محاط بنا شده که کاربرد آن مستلزم تعیین مقدارهای مناسبی برای دو پارا...

متن کامل

Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks

Journal: :IEEE/ACM transactions on audio, speech, and language processing 2022

State-of-the-art automatic speech recognition (ASR) system development is data and computation intensive. The optimal design of deep neural networks (DNNs) for these systems often require expert knowledge empirical evaluation. In this paper, a range architecture search (NAS) techniques are used to automatically learn two types hyper-parameters factored time delay (TDNN-Fs): i) the left right sp...

متن کامل

The Gamma MLP for Speech

1996

Steve Lawrence Ah Chung Tsoi Andrew D. Back

We deene a Gamma multi-layer perceptron (MLP) as an MLP with the usual synaptic weights replaced by gamma lters (as proposed by de Vries and Principe (de Vries & Principe 1992)) and associated gain terms throughout all layers. We derive gradient descent update equations and apply the model to the recognition of speech phonemes. We nd that both the inclusion of gamma lters in all layers, and the...

متن کامل