From Social Networks To Distributional Properties: A Comparative Study On Computing Semantic Relatedness

نویسندگان

  • Ulli Waltinger
  • Irene Cramer
چکیده

In recent years a variety of approaches in computing semantic relatedness have been proposed. However, the algorithms and resources employed differ strongly, as well as the results obtained under different experimental conditions. This article investigates the quality of various semantic relatedness measures in a comparative study. We conducted an extensive experiment using a broad variety of measures operating on social networks, lexical-semantic nets and co-occurrence in text corpora. For two sample data sets we obtained human relatedness judgements which were compared to the estimates of the automated measures. We also analyzed the algorithms implemented and resources employed from a theoretical point of view, and we examined several practical issues, such as run time and coverage. While the performance of all measures is still mediocre, we could observe that in terms of of coverage and correlation distributional measures operating on controlled corpora perform best.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building Semantic Networks from Plain Text and Wikipedia with Application to Semantic Relatedness and Noun Compound Paraphrasing

The construction of suitable and scalable representations of semantic knowledge is a core challenge in Semantic Computing. Manually created resources such as WordNet have been shown to be useful for many AI and NLP tasks, but they are inherently restricted in their coverage and scalability. In addition, they have been challenged by simple distributional models on very large corpora, questioning...

متن کامل

Distributional Semantics for Entity Relatedness

Wikipedia provides an enormous amount of background knowledge to reason about the semantic relatedness between two entities. In this work, we present a distributional semantics based approach for computing entity relatedness, and a focused related entities explorer based on this approach.

متن کامل

tESA: a distributional measure for calculating semantic relatedness

BACKGROUND Semantic relatedness is a measure that quantifies the strength of a semantic link between two concepts. Often, it can be efficiently approximated with methods that operate on words, which represent these concepts. Approximating semantic relatedness between texts and concepts represented by these texts is an important part of many text and knowledge processing tasks of crucial importa...

متن کامل

Semantic Relatedness for All (Languages): A Comparative Analysis of Multilingual Semantic Relatedness Using Machine Translation

This paper provides a comparative analysis of the performance of four state-of-the-art distributional semantic models (DSMs) over 11 languages, contrasting the native language-specific models with the use of machine translation over English-based DSMs. The experimental results show that there is a significant improvement (average of 16.7% for the Spearman correlation) by using state-of-the-art ...

متن کامل

Human and Machine Judgements for Russian Semantic Relatedness

Semantic relatedness of terms represents similarity of meaning by a numerical score. On the one hand, humans easily make judgements about semantic relatedness. On the other hand, this kind of information is useful in language processing systems. While semantic relatedness has been extensively studied for English using numerous language resources, such as associative norms, human judgements and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009