Ontological Smoothing for Relation Extraction

نویسندگان

  • Congle Zhang
  • Raphael Hoffmann
  • Daniel S. Weld
چکیده

There is increasing interest in relation extraction, methods that convert natural language text into structured knowledge. The most successful techniques use supervised machine learning to generate extractors from sentences which have been labeled with the arguments of the relations of interest. Unfortunately, these methods require hundreds or thousands of training examples, which are expensive and time-consuming to produce. This paper presents ontological smoothing, a semi-supervised technique that exploits knowledge of a background ontology in order to learn extractors for a set of minimally-labeled relations. Ontological smoothing has three phases. The first step generates a mapping between the target relations and the background ontology. The second step uses knowledge-based weak supervision to heuristically generate new training examples for the target relations. The third step learns an extractor from a combination of the original and newly generated examples. Experiments on 61 relations across two target domains with Freebase as the background ontology show ontological smoothing can dramatically improve precision and recall.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relation Extraction: from Ontological Smoothing to Temporal Correspondence

Relation Extraction: from Ontological Smoothing to Temporal Correspondence

متن کامل

Ontological Smoothing for Relation Extraction with Minimal Supervision

Relation extraction, the process of converting natural language text into structured knowledge, is increasingly important. Most successful techniques use supervised machine learning to generate extractors from sentences that have been manually labeled with the relations’ arguments. Unfortunately, these methods require numerous training examples, which are expensive and time-consuming to produce...

متن کامل

Concept and Relation Extraction in the Finance Domain

In this paper, we describe the state of our work on the possible derivation of ontological structures from textual analysis. We propose an approach to semi-automatic generation of domain ontologies from scratch, on the basis of heuristic rules applied to the result of a multi-layered processing of textual documents.

متن کامل

On Learning Subtypes of the Part-Whole Relation: Do Not Mix Your Seeds

An important relation in information extraction is the part-whole relation. Ontological studies mention several types of this relation. In this paper, we show that the traditional practice of initializing minimally-supervised algorithms with a single set that mixes seeds of different types fails to capture the wide variety of part-whole patterns and tuples. The results obtained with mixed seeds...

متن کامل

Automatic Extraction of Hierarchical Relations from Text

Automatic extraction of semantic relationships between entity instances in an ontology is useful for attaching richer semantic metadata to documents. In this paper we propose an SVM based approach to hierarchical relation extraction, using features derived automatically from a number of GATE-based open-source language processing tools. In comparison to the previous works, we use several new fea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011