multi word lexical units

نتایج جستجو برای: multi word lexical units

تعداد نتایج: 741999 فیلتر نتایج به سال:

Rule-based Automatic Multi-word Term Extraction and Lemmatization

2016

Ranka Stankovic Cvetana Krstev Ivan Obradovic Biljana Lazic Aleksandra Trtovac

In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass ...

متن کامل

First approach to the selection of lexical units for continuous speech recognition of Basque

2000

Karmele López de Ipiña M. Inés Torres Lourdes Oñederra Amparo Varona Nerea Ezeiza Mikel Peñagarikano M. Hernández Luis Javier Rodríguez-Fuentes

The selection of appropriated Lexical Units is an important issue in the Language Model (LM) generation. Word has been used classically as unit in most of the Continuous Speech Recognition systems. However, during the last years proposals of non-word units have begun to appear. Since Basque is an agglutinative language with a certain structure inside the word, the nonword units could be an adeq...

متن کامل

Mixed up with Machine Translation: Multi-word Units Disambiguation Challenge

2010

Anabela BARREIRO Annibale ELIA Johanna MONTI Mario MONTELEONE Anabela Barreiro

With the rapid evolution of the Internet, translation has become part of the daily life of ordinary users, not only of professional translators. Machine translation has evolved along with different types of computer-assisted translation tools. Qualitative progress has been made in the field of machine translation, but not all problems have been solved. One problem in particular, namely the poor...

متن کامل

Phonemic similarity effects and prelexical phonology.

Journal: :Memory & cognition 1990

G Lukatela M T Turvey

Ten experiments were conducted on visually presented Serbo-Croatian words and pseudowords, comprising phonemically similar and dissimilar context-target sequences. There were five main results. First, phonemic similarity effects in both lexical decision and naming are independent of graphemic similarity. Second, phonemic similarity need not facilitate lexical decision; the direction of its effe...

متن کامل

Towards Full Automation Of Lexicon Construction

2004

Richard Rohwer Dayne Freitag

We describe work in progress aimed at developing methods for automatically constructing a lexicon using only statistical data derived from analysis of corpora, a problem we call lexical optimization. Specifically, we use statistical methods alone to obtain information equivalent to syntactic categories, and to discover the semantically meaningful units of text, which may be multi-word units or ...

متن کامل

Selection of Lexical Units for Continuous Speech Recognition of Basque

2003

Karmele López de Ipiña Manuel Graña Nerea Ezeiza M. Hernández Ekaitz Zulueta Aitzol Ezeiza C. Tovar

The selection of appropriate Lexical Units (LUs) is an important issue in the development of Continuous Speech Recognition (CSR) systems. Words have been used classically as the recognition unit in most of them. However, proposals of nonword units are beginning to arise. Basque is an agglutinative language with some structure inside words, for which non-word morpheme like units could be an appr...

متن کامل

Obtaining Japanese Lexical Units for Semantic Frames from Berkeley FrameNet Using a Bilingual Corpus

2005

Toshiyuki Kanamaru Masaki Murata Kow Kuroda Hitoshi Isahara

An attempt was made to semi-automatically obtain “lexical units” (LUs) for Japanese from the English LUs defined in the semantic frame database provided by Berkeley FrameNet (BFN) using an English-Japanese bilingual corpus. This task was a prerequisite to building a complete database of semantic frames for Japanese. In the task, a Japanese word is first translated into an English word or phrase...

متن کامل

Cross-word sub-word units for low-resource keyword spotting

2014

William Hartmann Lori Lamel Jean-Luc Gauvain

We investigate the use of sub-word lexical units for the detection of out-of-vocabulary (OOV) keywords in the keyword spotting task. Sub-word units based on morphological decomposition and character ngrams are compared. In particular, we examine the benefit of sub-word units that cross word boundaries. Experiments are performed on the IARPA Babel Turkish dataset. Our results demonstrate that cr...

متن کامل

A Lexical Database of Portuguese Multiword Expressions

2006

Sandra Antunes Maria Fernanda Bacelar do Nascimento João Miguel Casteleiro Amália Mendes Luísa Pereira Tiago Sá

This presentation focuses on an ongoing project which aims at the creation of a large lexical database of Portuguese multiword (MW) units, automatically extracted through the analysis of a balanced 50 million word corpus, statistically interpreted with lexical association measures and validated by hand. This database covers different types of MW units, like named entities, and lexical associati...

متن کامل

Eye-tracking multi-word units: some methodological questions

Journal: :Journal of Eye Movement Research 2015

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید