Acquisition of Linguistic Patterns for Knowledge-based Information Extraction

نویسندگان

  • Sanda M. Harabagiu
  • Steven J. Maiorano
چکیده

In this paper we present a new method of automatic acquisition of linguistic patterns for Information Extraction, as implemented in the CICERO system. Our approach combines lexico-semantic information available from the WordNet database with collocating data extracted from training corpora. Due to the open-domain nature of the WordNet information and the immediate availability of large collections of texts, our method can be easily ported to open-domain Information Extraction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowledge-Driven Event Extraction in Russian: Corpus-Based Linguistic Resources

Automatic event extraction form text is an important step in knowledge acquisition and knowledge base population. Manual work in development of extraction system is indispensable either in corpus annotation or in vocabularies and pattern creation for a knowledge-based system. Recent works have been focused on adaptation of existing system (for extraction from English texts) to new domains. Even...

متن کامل

Automatic Discovery of Linguistic Patterns for Information Extraction

Information Extraction (IE) systems typically rely on extraction patterns encoding domain-specific knowledge. When matched against natural language texts, these patterns recognize with high accuracy information relevant to the extraction task. Adapting an IE system to a new extraction scenario entails devising a new collection of extraction patterns a time-consuming and expensive process. To ov...

متن کامل

Development of Linguistic Rules Diagnosis of Failure in Centrifugal Pump for Use in Expert System

Operational failures in centrifuge pumps could be hydraulic or mechanical. However, most of these mechanical and hydraulic failures are connected cause of their operational nature and finding the right cause is due to considering numerous mechanical and hydraulic signs and parameters in pumps. On the other hand, due to non-linear and fluctuant behavior of pumps in the matter of time and not pre...

متن کامل

Infrastructure for Open-Domain Information Extraction

The problem of performing open-domain Information Extraction (IE) was historically tied to the problem of ad-hoc acquisition of extraction patterns. In this paper we show that this requirement is not sufficient and that we also need to build new IE architectures that combine the role of linguistic patterns with coreference knowledge and ambiguous syntactic and semantic information. We present t...

متن کامل

Development of Linguistic Rules Diagnosis of Failure in Centrifugal Pump for Use in Expert System

Operational failures in centrifuge pumps could be hydraulic or mechanical. However, most of these mechanical and hydraulic failures are connected cause of their operational nature and finding the right cause is due to considering numerous mechanical and hydraulic signs and parameters in pumps. On the other hand, due to non-linear and fluctuant behavior of pumps in the matter of time and not pre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Knowl. Data Eng.

دوره 7  شماره 

صفحات  -

تاریخ انتشار 1995