Exploiting Syntactic Structure for Language Modeling

نویسندگان

  • Ciprian Chelba
  • Frederick Jelinek
چکیده

The paper presents a language model that develops syntactic structure and uses it to extract meaningful information from the word history, thus enabling the use of long distance dependencies. The model assigns probability to every joint sequence of words–binary-parse-structure with headword annotation and operates in a left-to-right manner — therefore usable for automatic speech recognition. The model, its probabilistic parameterization, and a set of experiments meant to evaluate its predictive power are presented; an improvement over standard trigram modeling is achieved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Syntactic Structure for Natural Language Modeling

The thesis presents an attempt at using the syntactic structure in natural language for improved language models for speech recognition. The structured language model merges techniques in automatic parsing and language modeling using an original probabilistic parameterization of a shift-reduce parser. A maximum likelihood reestimation procedure belonging to the class of expectation-maximization...

متن کامل

Syntactic Properties of Language of Scientific Communication in Persian Scientific Works

Purpose: The language of science is one of the social types of Persian language, which is used by the educated classes in scientific works and contexts. The purpose of this research is to present an overall picture of the syntactic properties of the Persian scientific language. The types of sentences, types of tenses, verb tenses, and syntactic constructions have been identified in the scientif...

متن کامل

Gender-Based investigation of the Syntactic Development of Iranian EFL Learners: A Focus on Processabilty Theory

Pienemann (1998, 2015) put forward Processability Theory to enlighten why language learners follow definite developmental paths. The aim of the present study was to run a comparative investigation into the difficulty order of different grammatical structures for male and female Iranian EFL learners predicted by Processability Theory. 185 Iranian university students took part in this study. They...

متن کامل

Language Model Adaptation Using Dirichlet Class Language Model Based on Part-of-Speech

Language modeling has many applications in a large variety of domains. Performance of this model depends on its adaptation to a particular style of data. Accordingly, adaptation methods endeavour to apply syntactic and semantic characteristics of the language for language modeling. The previous adaptation methods such as family of Dirichlet class language model (DCLM) extract class of history w...

متن کامل

Maximum entropy techniques for exploiting syntactic, semantic and collocational dependencies in language modeling

A new statistical language model is presented which combines collocational dependencies with two important sources of long-range statistical dependence: the syntactic structure and the topic of a sentence. These dependencies or constraints are integrated using the maximum entropy technique. Substantial improvements are demonstrated over a trigram model in both perplexity and speech recognition ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998