نتایج جستجو برای: bilingual mental lexicon

تعداد نتایج: 278779  

2012
Longhua Qian Hongling Wang Guodong Zhou Qiaoming Zhu

Bilingual lexicon construction (BLC) from comparable corpora is based on the idea that bilingual similar words tend to occur in similar contexts, usually of words. This, however, introduces noise and leads to low performance. This paper proposes a bilingual dependency mapping model for BLC which encodes a word’s context as a combination of its dependent words and their relationships. This combi...

Journal: :CoRR 2001
Akshar Bharati Dipti Misra Sharma Vineet Chaitanya Amba P. Kulkarni Rajeev Sangal Durgesh Rao

The paper reports on efforts taken to create lexical resources pertaining to Indian languages, using the collaborative model. The lexical resources being developed are: (1) transfer lexicon and grammar from English to several Indian languages, and (2) dependency tree bank of annotated corpora for several Indian languages. The dependency trees are based on the Paninian model. (3) is an attempt t...

Journal: :TAL 2009
Emmanuel Prochasson Emmanuel Morin

Research on bilingual lexicon extraction from comparable corpora leads to promising results using large corpora (hundreds of billions of words) using the direct alignment method. However, when using smaller corpora (hundreds of thousands of words), results obtained are slightly lower. We propose to introduce some anchor points on which we can rely for the alignment process using the direct appr...

1994
Pascale Fung Kenneth Ward Church

Various methods have been proposed for aligning texts in two or more languages such as the Canadian Parliamentary Debates (Hansards). Some of these methods generate a bilingual lexicon as a by-product. We present an alternative alignment strategy which we call K-vec, that starts by estimating the lexicon. For example, it discovers that the English word fisheries is similar to the French pêches ...

Journal: :Computational Linguistics 1997
Sue J. Ker Jason S. Chang

This paper presents an algorithm capable of identifying the translation for each word in a bilingual corpus. Previously proposed methods rely heavily on word-based statistics. Under a word-based approach, frequent words with a consistent translation can be aligned at a high rate of precision. However, words that are less frequent or exhibit diverse translations generally do not have statistical...

2013
Amir Hazem Emmanuel Morin

Methods dealing with bilingual lexicon extraction from comparable corpora are often based on word co-occurrence observation and are by essence more effective when using large corpora. In most cases, specialized comparable corpora are of small size, and this particularity has a direct impact on bilingual terminology extraction results. In order to overcome insufficient data coverage and to make ...

Journal: :TACL 2014
Jason Utt Sebastian Padó

Syntax-based distributional models of lexical semantics provide a flexible and linguistically adequate representation of co-occurrence information. However, their construction requires large, accurately parsed corpora, which are unavailable for most languages. In this paper, we develop a number of methods to overcome this obstacle. We describe (a) a crosslingual approach that constructs a synta...

2011
Darja Fiser Nikola Ljubesic Spela Vintar Senja Pollak

This paper presents a series of experiments aimed at inducing and evaluating domainspecific bilingual lexica from comparable corpora. First, a small English-Slovene comparable corpus from health magazines was manually constructed and then used to compile a large comparable corpus on health-related topics from web corpora. Next, a bilingual lexicon for the domain was extracted from the corpus by...

2013
Dhouha Bouamor Nasredine Semmar Pierre Zweigenbaum

This paper presents a new method that aims to improve the results of the standard approach used for bilingual lexicon extraction from specialized comparable corpora. We attempt to solve the problem of context vector word polysemy. Instead of using all the entries of the dictionary to translate a context vector, we only use the words of the lexicon that are more likely to give the best character...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید