نتایج جستجو برای: bilingual lexicon

تعداد نتایج: 20633  

2011
Ingemar Hjälmstad Martin Duneld Maria Skeppstedt

A common problem when combining two bilingual dictionaries to make a third, using one common language as a pivot language, is the emergence of false translations due to lexical ambiguity between words in the languages involved. This paper examines if the translation accuracy improves when using part-of-speech filtering of translation candidates. To examine this, two different Japanese-Swedish l...

2004
Takehito Utsuro Kohei Hino Mitsuhiro Kida Seiichi Nakagawa Satoshi Sato

In the framework of bilingual lexicon acquisition from cross-lingually relevant news articles on the Web, it is relatively harder to reliably estimate bilingual term correspondences for low frequency terms. Considering such a situation, this paper proposes to complementarily use much larger monolingual Web documents collected by search engines, as a resource for reliably re-estimating bilingual...

2012
Longhua Qian Hongling Wang Guodong Zhou Qiaoming Zhu

Bilingual lexicon construction (BLC) from comparable corpora is based on the idea that bilingual similar words tend to occur in similar contexts, usually of words. This, however, introduces noise and leads to low performance. This paper proposes a bilingual dependency mapping model for BLC which encodes a word’s context as a combination of its dependent words and their relationships. This combi...

Journal: :CoRR 2001
Akshar Bharati Dipti Misra Sharma Vineet Chaitanya Amba P. Kulkarni Rajeev Sangal Durgesh Rao

The paper reports on efforts taken to create lexical resources pertaining to Indian languages, using the collaborative model. The lexical resources being developed are: (1) transfer lexicon and grammar from English to several Indian languages, and (2) dependency tree bank of annotated corpora for several Indian languages. The dependency trees are based on the Paninian model. (3) is an attempt t...

Journal: :TAL 2009
Emmanuel Prochasson Emmanuel Morin

Research on bilingual lexicon extraction from comparable corpora leads to promising results using large corpora (hundreds of billions of words) using the direct alignment method. However, when using smaller corpora (hundreds of thousands of words), results obtained are slightly lower. We propose to introduce some anchor points on which we can rely for the alignment process using the direct appr...

1994
Pascale Fung Kenneth Ward Church

Various methods have been proposed for aligning texts in two or more languages such as the Canadian Parliamentary Debates (Hansards). Some of these methods generate a bilingual lexicon as a by-product. We present an alternative alignment strategy which we call K-vec, that starts by estimating the lexicon. For example, it discovers that the English word fisheries is similar to the French pêches ...

Journal: :Computational Linguistics 1997
Sue J. Ker Jason S. Chang

This paper presents an algorithm capable of identifying the translation for each word in a bilingual corpus. Previously proposed methods rely heavily on word-based statistics. Under a word-based approach, frequent words with a consistent translation can be aligned at a high rate of precision. However, words that are less frequent or exhibit diverse translations generally do not have statistical...

2013
Amir Hazem Emmanuel Morin

Methods dealing with bilingual lexicon extraction from comparable corpora are often based on word co-occurrence observation and are by essence more effective when using large corpora. In most cases, specialized comparable corpora are of small size, and this particularity has a direct impact on bilingual terminology extraction results. In order to overcome insufficient data coverage and to make ...

Journal: :TACL 2014
Jason Utt Sebastian Padó

Syntax-based distributional models of lexical semantics provide a flexible and linguistically adequate representation of co-occurrence information. However, their construction requires large, accurately parsed corpora, which are unavailable for most languages. In this paper, we develop a number of methods to overcome this obstacle. We describe (a) a crosslingual approach that constructs a synta...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید