The PIA Project: Learning to Semantically Annotate Texts from an Ontology and XML-Instance Data

نویسندگان

Nigel Collier

Koichi Takeuchi

Keita Tsuji

چکیده

The development of the XML and RDF(S) standards offer a positive environment for machine learning to enable the automatic XML-annotation of texts that can encourage the extension of Semantic Web applications. After reviewing the current limitations of information extraction technology, specifically its lack of portability to new domains, we introduce the PIA project for automatically XML-annotating domain-based texts using example XML texts and an ontology for supervised training.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology-based Semantic Metadata Validation

Much of the Semantic Web content is generated from databases, especially the instance data based on the ontology classes used in applications. A recurring problem is that the instance data does not always semantically conform to the ontology used. It may be ambiguous, incomplete, or partly erroneous. Validating the data is necessary when it is transformed to a more semantic format. This may be ...

متن کامل

A Method for Mapping Sensor Data to SSN Ontology

Along with the continuous development of the sensor network technology, sensors from all over the world are constantly producing sensor data. However, the sensor data from different source is hard to work together for lack of semantic. Fortunately, SSN ontology provide a way to represent sensor data semantically, but how to transform sensor data into the instance of SSN ontology conveniently is...

متن کامل

Using WEESA to Semantically Annotate Cocoon

The Semantic Web is based on the idea that Web applications provide semantically annotatedWeb pages. This metadata is typically added in the semantic annotation process which is currently not part of the Web engineering process. Web engineering, however, proposes methodologies to design, implement and maintain Web applications but lack semantic annotation. In this paper we show how WEESA, a map...

متن کامل

Coupling Information Extraction and Data Mining for Ontology Learning in PARMENIDES

Strategic decision making, especially in the areas of business intelligence and competitive intelligence, requires the acquisition of decision-relevant information pieces like market trends, fusions and company values. This information is extracted by pre-processing and querying multiple sources, combining and condensing the findings. It is characteristic that the extraction process is resource...

متن کامل

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

The PIA Project: Learning to Semantically Annotate Texts from an Ontology and XML-Instance Data

نویسندگان

چکیده

منابع مشابه

Ontology-based Semantic Metadata Validation

A Method for Mapping Sensor Data to SSN Ontology

Using WEESA to Semantically Annotate Cocoon

Coupling Information Extraction and Data Mining for Ontology Learning in PARMENIDES

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

عنوان ژورنال:

اشتراک گذاری