Semantic Similarity from Syntactic Relations

نویسندگان

  • Caroline Varaschin Gasperin
  • Vera Lúcia Strube
چکیده

This work presents the results of the application of a technique for automatic extraction of semantic relations between words from a corpus. The technique used is the one proposed by Grefenstette in [8]. We proposed contributions for the syntactic context notion adopted in [8], aiming to improve the identification of semantically related words. Such contributions include new information into the contexts, besides those used by Grefenstette, as well as a new sort of context. We carried out some experiments in order to investigate and validate Grefenstette’s technique and the proposed contributions for Portuguese language. The analysis of the experiments results is detailed, and a comparison is given between the lists of similar words generated through the original Grefenstette’s technique and the lists generated through the modified technique proposed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

برچسب‌زنی نقش معنایی جملات فارسی با رویکرد یادگیری مبتنی بر حافظه

Abstract Extracting semantic roles is one of the major steps in representing text meaning. It refers to finding the semantic relations between a predicate and syntactic constituents in a sentence. In this paper we present a semantic role labeling system for Persian, using memory-based learning model and standard features. Our proposed system implements a two-phase architecture to first identify...

متن کامل

Syntactic Contexts for Finding Semantically Related Words

Finding semantically related words is a first step in the direction of automatic ontology building. Guided by the view that similar words occur in similar contexts, we looked at the syntactic context of words to measure their semantic similarity. Words that occur in a direct object relation with the verb drink, for instance, have something in common (liquidity, ...). Co-occurrence data for comm...

متن کامل

Syntactic-Based Methods for Measuring Word Similarity

This paper explores different strategies for extracting similarity relations between words from partially parsed text corpora. The strategies we have analysed do not require supervised training nor semantic information available from general lexical resources. They differ in the amount and the quality of the syntactic contexts against which words are compared. The paper presents in details the ...

متن کامل

Structural Priming as Structure-Mapping: Children Use Analogies From Previous Utterances to Guide Sentence Production

What mechanisms underlie children's language production? Structural priming--the repetition of sentence structure across utterances--is an important measure of the developing production system. We propose its mechanism in children is the same as may underlie analogical reasoning: structure-mapping. Under this view, structural priming is the result of making an analogy between utterances, such t...

متن کامل

JU_CSE_NLP: Multi-grade Classification of Semantic Similarity between Text Pairs

This article presents the experiments carried out at Jadavpur University as part of the participation in Semantic Textual Similarity (STS) of Task 6 @ Semantic Evaluation Exercises (SemEval-2012). Task-6 of SemEval2012 focused on semantic relations of text pair. Task-6 provides five different text pair files to compare different semantic relations and judge these relations through a similarity ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002