Lexicographic study in sketch engine
نویسندگان
چکیده
منابع مشابه
The Sketch Engine
Word sketches are one-page automatic, corpus-based summaries of a word’s grammatical and collocational behaviour. They were first used in the production of the Macmillan English Dictionary and were presented at Euralex 2002. At that point, they only existed for English. Now, we have developed the Sketch Engine, a corpus tool which takes as input a corpus of any language and a corresponding gram...
متن کاملExpanding VerbNet with Sketch Engine
This research describes efforts to expand the lexical resource VerbNet with additional class members and completely new verb classes. Several approaches to this in the past have involved automatic methods for expansion, but this research focuses on the addition of frequent, yet particularly challenging verbs that require manual additions after a survey of each verb’s syntactic behaviors and sem...
متن کاملThe Preposition Corpus in Sketch Engine
Corpora from the Pattern Dictionary of English Prepositions (PDEP) provide the basis for examining the behavior of 304 English prepositions, with 1040 senses (patterns) describing in 20 fields. The PDEP corpora comprise dependency parses for 81,509 sentences in CoNLL-X format, previously used in several studies, particularly used for disambiguation modeling. We have now put these parse data int...
متن کاملEuropean Union Language Resources in Sketch Engine
Several parallel corpora built from European Union language resources are presented here. They were processed by state-of-the-art tools and made available for researchers in the Sketch Engine corpus management system. A completely new resource is introduced: EUR-Lex corpus, being one of the largest parallel corpus available at the moment, containing 840 million tokens of English and having the ...
متن کاملBilingual Terminology Extraction in Sketch Engine
We present a method of bilingual terminology extraction from parallel corpora and a few heuristics and experiments with improving the performance of the basic variant of the method. An evaluation is given using a small gold standard manually prepared for EnglishCzech language pair from DGT translation memory [1]. The bilingual terminology extraction (ABTE3) is available for several languages in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Computer Science and Cybernetics
سال: 2012
ISSN: 1813-9663,1813-9663
DOI: 10.15625/1813-9663/27/3/481