Clustered Hierarchical Concept Based Semantic Closeness Between Two Concepts Using WordNet

نویسندگان

  • Boddu Bhaskara Rao
  • Vatsavayi Valli Kumari
چکیده

The search engine needs relatedness to measure closeness between two concepts for determining optimal results in major applications like information retrieval, information integration and of many more in natural language processing tasks i.e. text classification, word sense disambiguation, matching problems in artificial intelligence etc,. The clustered hierarchical concept network helps to overcome the fuzzy variations in different levels of granularity in measures of closeness based on weights, frequency or distances but these measures are not considered since no method takes the actual context of the user intention, user query or context domain subject fields. Clustered hierarchical concept network has three steps: Elicitation: extract the concepts of user query using concept extraction algorithm and name the output as context domain. Construction: building hierarchical clusters based on context or concept domain with related concepts as nodes and relations as edges. Matching: determine the matching concepts like Least Common General Concept (LCGC) and Least Common Specific Concept (LCSC). Clustered hierarchical concept based semantic closeness has three features i.e., context domain, concept net and common concepts. These features are used to calculate the relatedness. The primary goal of hierarchical concept network is to include the semantic of the concept by including its three features. The extraction of concepts are not only related to individual concepts, but it is also an organizational structure of the concepts that are combined in the ontology i.e. WorNet. In this paper, we propose a method for computing semantic closeness of two concepts in which the holonyms, meronyms, instances of concepts are considered synthetically. By calculating test data, the experiment results show that the method can compute concepts closeness effectively. The human judgments on a set of concept pairs led our approach to be more effective and have shown one of the best performance than the measures based on concept vector.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

Learning Hierarchical, Semantic Priors for Image Boundary Detection

Image segmentation is typically performed without knowledge of visual categories, higher-level context or semantics. Recently, learning-based methods have shown that segmentation can be improved by training on human-generated segmentations [5, 4]. However, incorporating the high-level knowledge that humans appear to use in segmentation is not usually considered. In our previous work [3], we inv...

متن کامل

A Korean-Japanese-Chinese Aligned Wordnet with Shared Semantic Hierarchy

A Korean-Japanese-Chinese aligned wordnet, “CoreNet” is introduced. For the purpose of this paper, the term “wordnet” refers to a network of words. It is constructed based on a shared semantic hierarchy that is originated from NTT Goidaikei (Lexical Hierarchical System). Korean wordnet was constructed through the semantic category assignment to every meaning of Korean words in a dictionary. Ver...

متن کامل

Using an Ontology for Improved Automated Content Scoring of Spontaneous Non-Native Speech

This paper presents an exploration into automated content scoring of non-native spontaneous speech using ontology-based information to enhance a vector space approach. We use content vector analysis as a baseline and evaluate the correlations between human rater proficiency scores and two cosine-similarity-based features, previously used in the context of automated essay scoring. We use two ont...

متن کامل

Semantic Similarity Measure Using Information Content Approach With Depth For Similarity Calculation

Similarity is criteria of measuring nearness or proximity between two concepts. Several algorithmic approaches for computing similarity have been proposed. Among the existing Similarity measure, majority of them utilize WordNet as an underlying ontology for calculating semantic similarity. WordNet is a lexical database for English Language which was created and maintained by Congnitive Science ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014