نتایج جستجو برای: phrase based treatment

تعداد نتایج: 4131590  

2004
Stephan Vogel Sanjika Hewavitharana Muntsin Kolss Alexander H. Waibel

In this paper we describe the components of our statistical machine translation system used for the spoken language translation evaluation campaign. This system is based on phrase-to-phrase translations extracted from a bilingual corpus. A new phrase alignment approaches will be introduced, which finds the target phrase by optimizing the overall word-to-word alignment for the sentence pair unde...

2016
Roald Eiselen

Phrase chunking remains an important natural language processing (NLP) technique for intermediate syntactic processing. This paper describes the development of protocols, annotated phrase chunking data sets and automatic phrase chunkers for ten South African languages. Various problems with adapting the existing annotation protocols of English are discussed as well as an overview of the annotat...

2007
Jesús Tomás Jaime Lloret Mauri Francisco Casacuberta

Phrase-based statistical models constitute one of the most competitive pattern-recognition approaches to machine translation. In this case, the source sentence is fragmented into phrases, then, each phrase is translated by using a stochastic dictionary. One shortcoming of this phrase-based model is that it does not have an adequate generalization capability. If a sequence of words has not been ...

2010
Carlos A. Henríquez Q. Marta R. Costa-Jussà Vidas Daudaravicius Rafael E. Banchs José B. Mariño

This paper describes the 2010 phrase-based statistical machine translation system developed at the TALP Research Center of the UPC in cooperation with BMIC and VMU. In phrase-based SMT, the phrase table is the main tool in translation. It is created extracting phrases from an aligned parallel corpus and then computing translation model scores with them. Performing a collocation segmentation ove...

Journal: :Journal of the International Phonetic Association 2021

The article analyses violations of the Obligatory Contour Principle (OCP) above word level in Tswana, a Southern Bantu language, by investigating realization adjacent lexical high tones across boundaries. results show that boundaries downstep (i.e. lowering second series tones) only takes place within phonological phrase. A phrase break blocks downstep, even when necessary tonal configuration i...

2011
Mridul Gupta Sanjika Hewavitharana Stephan Vogel

Phrase alignment is a crucial step in phrase-based statistical machine translation. We explore a way of improving phrase alignment by adding syntactic information in the form of chunks as soft constraints guided by an in-depth and detailed analysis on a hand-aligned data set. We extend a probabilistic phrase alignment model that extracts phrase pairs by optimizing phrase pair boundaries over th...

1999
Tavs Bjerre

Support Verb Constructions (SVCs) pose interesting problems because of their mixed nature, with idiomatic properties in the sense that they do not seem to comply with general rules, and yet displaying a large number of regularities. The problems we will be concerned with in this paper are the following: • What is event structure, and how is expressed in SVCs? • What verbs may function as suppor...

2014
Spyros Martzoukos Christof Monz Christophe Costa Florêncio

Given a pair of source and target language sentences which are translations of each other with known word alignments between them, we extract bilingual phrase-level segmentations of such a pair. This is done by identifying two appropriate measures that assess the quality of phrase segments, one on the monolingual level for both language sides, and one on the bilingual level. The monolingual mea...

2010
Sankaranarayanan Ananthakrishnan Rohit Prasad Premkumar Natarajan

The performance of phrase-based statistical machine translation (SMT) systems is crucially dependent on the quality of the extracted phrase pairs, which is in turn a function of word alignment quality. Data sparsity, an inherent problem in SMT even with large training corpora, often has an adverse impact on the reliability of the extracted phrase translation pairs. In this paper, we present a n...

Journal: :Lexicographica 2022

Abstract This article is intended as the first of a series papers designing an electronic linguistic resource made up three modules: (1) phrase-based active dictionary thought attempt to implement John Sinclair’s vision “ultimate dictionary”; (2) grammar / construction describing not only morphologic and syntactic rules language but also its systematic (semantic) alternations derivations genera...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید