Transcribing broadcast news with the 1997 Abbot System

نویسندگان

  • Gary D. Cook
  • Tony Robinson
چکیده

Recent DARPA CSR evaluations have focused on the transcription of broadcast news from both television and radio programmes [17]. This is a challenging task because the data includes a variety of speaking styles and channel conditions. This paper describes the development of a connectionist-hidden Markov model (HMM) system, and the enhancements designed to improve performance on broadcast news data. Both multilayer perceptron (MLP) and recurrent neural network acoustic models have been investigated. We asses the effect of using gender-dependent acoustic models, and the impact on performance of varying both the number of parameters and the amount of training data used for acoustic modelling. The use of context-dependent phone models is described, and the effect of the number of context classes is investigated. We also describe a method for incorporating syllable boundary information during search. Results are reported on the 1997 DARPA Hub-4 development test set.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transcription of broadcast television and radio news: the 1996 ABBOT system

This paper describes the development of the cu-con system which participated in the 1996 ARPA Hub 4 Evaluations. The system is based on Abbot, a hybrid connectionist-HMM large vocabulary continuous speech recognition system developed at the Cambridge University Engineering Department [4]. The Hub 4 Evaluation task involves the transcription of broadcast television and radio news programmes. Thi...

متن کامل

Transcription of Broadcast Television and Radio News : The

This paper describes the development of the cu-con system which participated in the 1996 ARPA Hub 4 Evaluations. The system is based on Abbot, a hybrid connec-tionist-HMM large vocabulary continuous speech recognition system developed at the Cambridge University Engineering Department 4]. The Hub 4 Evaluation task involves the transcription of broadcast television and radio news programmes. Thi...

متن کامل

Toward Automatic Recognition of Japanese Broadcast News

In this paper we report on automatic recognition of Japanese broadcast-news speech. We have been working on largevocabulary continuous speech recognition (LVCSR) for Japanese newspaper speech transcription and achieved reasonably good performance. We have recently applied our LVCSR system to transcribing Japanese broadcast-news speech. We extended the vocabulary to 20k words and trained the lan...

متن کامل

Toward automatic transcription of Japanese broadcast news

In this paper, we report on the automatic recognition of Japanese broadcast-news speech. We have been working on largevocabulary continuous speech recognition (LVCSR) for Japanese newspaper speech transcription and have achieved good performance. We have recently applied our LVCSR system to transcribing Japanese broadcast-news speech. We extended the vocabulary from 7k words to 20k words and tr...

متن کامل

The THISL Spoken Document Retrieval System

THISL is an ESPRIT Long Term Research Project focused the development and construction of a system to items from an archive of television and radio news broadcasts. In this paper we outline our spoken document retrieval system based on the ABBOT speech recognizer and a text retrieval system based on Okapi term-weighting . The system has been evaluated as part of the TREC-6 and TREC-7 spoken doc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998