نتایج جستجو برای: multi word lexical units
تعداد نتایج: 741999 فیلتر نتایج به سال:
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass ...
The selection of appropriated Lexical Units is an important issue in the Language Model (LM) generation. Word has been used classically as unit in most of the Continuous Speech Recognition systems. However, during the last years proposals of non-word units have begun to appear. Since Basque is an agglutinative language with a certain structure inside the word, the nonword units could be an adeq...
With the rapid evolution of the Internet, translation has become part of the daily life of ordinary users, not only of professional translators. Machine translation has evolved along with different types of computer-assisted translation tools. Qualitative progress has been made in the field of machine translation, but not all problems have been solved. One problem in particular, namely the poor...
Ten experiments were conducted on visually presented Serbo-Croatian words and pseudowords, comprising phonemically similar and dissimilar context-target sequences. There were five main results. First, phonemic similarity effects in both lexical decision and naming are independent of graphemic similarity. Second, phonemic similarity need not facilitate lexical decision; the direction of its effe...
We describe work in progress aimed at developing methods for automatically constructing a lexicon using only statistical data derived from analysis of corpora, a problem we call lexical optimization. Specifically, we use statistical methods alone to obtain information equivalent to syntactic categories, and to discover the semantically meaningful units of text, which may be multi-word units or ...
The selection of appropriate Lexical Units (LUs) is an important issue in the development of Continuous Speech Recognition (CSR) systems. Words have been used classically as the recognition unit in most of them. However, proposals of nonword units are beginning to arise. Basque is an agglutinative language with some structure inside words, for which non-word morpheme like units could be an appr...
Obtaining Japanese Lexical Units for Semantic Frames from Berkeley FrameNet Using a Bilingual Corpus
An attempt was made to semi-automatically obtain “lexical units” (LUs) for Japanese from the English LUs defined in the semantic frame database provided by Berkeley FrameNet (BFN) using an English-Japanese bilingual corpus. This task was a prerequisite to building a complete database of semantic frames for Japanese. In the task, a Japanese word is first translated into an English word or phrase...
We investigate the use of sub-word lexical units for the detection of out-of-vocabulary (OOV) keywords in the keyword spotting task. Sub-word units based on morphological decomposition and character ngrams are compared. In particular, we examine the benefit of sub-word units that cross word boundaries. Experiments are performed on the IARPA Babel Turkish dataset. Our results demonstrate that cr...
This presentation focuses on an ongoing project which aims at the creation of a large lexical database of Portuguese multiword (MW) units, automatically extracted through the analysis of a balanced 50 million word corpus, statistically interpreted with lexical association measures and validated by hand. This database covers different types of MW units, like named entities, and lexical associati...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید