Lexical knowledge and human disagreement on a WSD task

نویسندگان

  • G. Craig Murray
  • Rebecca Green
چکیده

This paper explores factors correlating with lack of inter-annotator agreement on a word sense disambiguation (WSD) task taken from SENSEVAL-2. Twenty-seven subjects were given a series of tasks requiring word sense judgments. Subjects were asked to judge the applicability of word senses to polysemous words used in context. Metrics of lexical ability were evaluated as predictors of agreement between judges. A strong interaction effect was found for lexical ability, in which differences between levels of lexical knowledge predict disagreement. Individual levels of lexical knowledge, however, were not independently predictive of disagreement. The finding runs counter to previous assumptions regarding expert agreement on WSD annotation tasks, which in turn impacts notions of a meaningful ‘‘gold standard’’ for systems evaluation. 2004 Elsevier Ltd. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

USYD: WSD and Lexical Substitution using the Web1T corpus

This paper describes the University of Sydney’s WSD and Lexical Substitution systems for SemEval-2007. These systems are principally based on evaluating the substitutability of potential synonyms in the context of the target word. Substitutability is measured using Pointwise Mutual Information as obtained from the Web1T corpus. The WSD systems are supervised, while the Lexical Substitution syst...

متن کامل

An investigation and comparison of lexical knowledge of deaf and hearing children

Abstract Objectives: The present study examines the lexical knowledge of deaf children in two age groups of 9-10 and 10-11 years old with two groups of normal hearing children of 9-10 and 10-11 years old. Method: This study is a casual-comparative study. The achievement of 16 deaf children (ages 9-10 and 10-11 years old) and 16 hearing children (ages 9-10 and 10-11 years old) were examined on...

متن کامل

Pre-emption with or without Pre-task Planning: A Probe into L2 Lexical Diversity

The current study, setting a two-fold goal, attempted to see whether the preemptive focus on form (FonF) under either planned or unplanned conditions could contribute to increasing lexical diversity in written narratives and, second, to find whether there was a trade-off between the lexical diversity and accuracy. To this end, 32 beginner learners were selected following a Quick Oxford Placemen...

متن کامل

HIT-WSD: Using Search Engine for Multilingual Chinese-English Lexical Sample Task

We have participated in the Multilingual Chinese-English Lexical Sample Task of SemEval-2007. Our system disambiguates senses of Chinese words and finds the correct translation in English by using the web as WSD knowledge source. Since all the statistic data is obtained from search engine, the method is considered to be unsupervised and does not require any sense-tagged corpus.

متن کامل

COLEPL and COLSLM: An Unsupervised WSD Approach to Multilingual Lexical Substitution, Tasks 2 and 3 SemEval 2010

In this paper, we present a word sense disambiguation (WSD) based system for multilingual lexical substitution. Our method depends on having a WSD system for English and an automatic word alignment method. Crucially the approach relies on having parallel corpora. For Task 2 (Sinha et al., 2009) we apply a supervised WSD system to derive the English word senses. For Task 3 (Lefever & Hoste, 2009...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Speech & Language

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2004