statistical language model

نتایج جستجو برای: statistical language model

تعداد نتایج: 2689345 فیلتر نتایج به سال:

Statistical language model based on a hierarchical approach: MCnv

2001

Imed Zitouni Kamel Smaïli Jean Paul Haton

In this paper, we propose a new language model based on dependent word sequences organized in a multi-level hierarchy. We call this model MC n, where n is the maximum number of words in a sequence and is the maximum number of levels. The originality of this model is its capacity to take into account dependent variable-length sequences for very large vocabularies. In order to discover the variab...

متن کامل

Phrase Based Language Model for Statistical Machine Translation - empirical study

Journal: :CoRR 2015

Geliang Chen

Reordering is a challenge to machine translation (MT) systems. In MT, the widely used approach is to apply word based language model (LM) which considers the constituent units of a sentence as words. In speech recognition (SR), some phrase based LM have been proposed. However, those LMs are not necessarily suitable or optimal for reordering. We propose two phrase based LMs which considers the c...

متن کامل

Joint Space Neural Probabilistic Language Model for Statistical Machine Translation

Journal: :CoRR 2013

Tsuyoshi Okita

A neural probabilistic language model (NPLM) provides an idea to achieve the better perplexity than n-gram language model and their smoothed language models. This paper investigates application area in bilingual NLP, specifically Statistical Machine Translation (SMT). We focus on the perspectives that NPLM has potential to open the possibility to complement potentially ‘huge’ monolingual resour...

متن کامل

A thesaurus-based statistical language model for broadcast news transcription

1998

Akio Ando Akio Kobayashi Toru Imai

This paper describes a thesaurus-based class n-gram model for broadcast news transcription. The most important issue concerned with class n-gram models is how to develop a word classification. We construct a word classification mapping based on a thesaurus so as to maximize the average mutual information function on a training corpus. To examine the effectiveness of the new method, we compare i...

متن کامل

A Tunable Language Model for Statistical Machine Translation

2014

Junfei Guo Juan Liu Qi Han Andreas Maletti

A novel variation of modified KNESER-NEY model using monomial discounting is presented and integrated into the MOSES statistical machine translation toolkit. The language model is trained on a large training set as usual, but its new discount parameters are tuned to the small development set. An in-domain and cross-domain evaluation of the language model is performed based on perplexity, in whi...

متن کامل

Segmenting DNA sequence into 'words' based on statistical language model

Journal: :CoRR 2012

Wang Liang

[Abstract] This paper presents a novel method to segment/decode DNA sequences based on n-gram statistical language model. Firstly, we find the length of most DNA “words” is 12 to 15 bps by analyzing the genomes of 12 model species. The bound of language entropy of DNA sequence is about 1.5674 bits. After building an n-gram biology languages model, we design an unsupervised ‘probability approach...

متن کامل

On-line Language Model Biasing for Statistical Machine Translation

2011

Sankaranarayanan Ananthakrishnan Rohit Prasad Premkumar Natarajan

The language model (LM) is a critical component in most statistical machine translation (SMT) systems, serving to establish a probability distribution over the hypothesis space. Most SMT systems use a static LM, independent of the source language input. While previous work has shown that adapting LMs based on the input improves SMT performance, none of the techniques has thus far been shown to ...

متن کامل

On the Development of a Model of Cultural Identity and Language Achievement among Iranian Advanced EFL Learners

Journal: International Journal of Foreign Language Teaching and Research 2019

Fazlolah Samimi, Shahram Afraz, Sharareh Ebrahimi,

Culture is an inseparable part of a language. In other words, mastering a language and being able to communicate through it inevitably entails integrating with the culture of the speakers of that language which is the reflection of people's identity. The aim of the present study was designing a model of Iranian cultural identity. Initially, to select a homogeneous sample of learners at the adva...

متن کامل

Statistical learning of language: Theory, validity, and predictions of a statistical learning account of language acquisition

Journal: :Developmental Review 2015

متن کامل

Phrase Based Language Model For Statistical Machine Translation

Journal: :CoRR 2015

Jia Xu Geliang Chen

We consider phrase based Language Models (LM), which generalize the commonly used word level models. Similar concept on phrase based LMs appears in speech recognition, which is rather specialized and thus less suitable for machine translation (MT). In contrast to the dependency LM, we first introduce the exhaustive phrase-based LMs tailored for MT use. Preliminary experimental results show that...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید