The PIA Project: Learning to Semantically Annotate Texts from an Ontology and XML-Instance Data
نویسندگان
چکیده
The development of the XML and RDF(S) standards offer a positive environment for machine learning to enable the automatic XML-annotation of texts that can encourage the extension of Semantic Web applications. After reviewing the current limitations of information extraction technology, specifically its lack of portability to new domains, we introduce the PIA project for automatically XML-annotating domain-based texts using example XML texts and an ontology for supervised training.
منابع مشابه
Ontology-based Semantic Metadata Validation
Much of the Semantic Web content is generated from databases, especially the instance data based on the ontology classes used in applications. A recurring problem is that the instance data does not always semantically conform to the ontology used. It may be ambiguous, incomplete, or partly erroneous. Validating the data is necessary when it is transformed to a more semantic format. This may be ...
متن کاملA Method for Mapping Sensor Data to SSN Ontology
Along with the continuous development of the sensor network technology, sensors from all over the world are constantly producing sensor data. However, the sensor data from different source is hard to work together for lack of semantic. Fortunately, SSN ontology provide a way to represent sensor data semantically, but how to transform sensor data into the instance of SSN ontology conveniently is...
متن کاملUsing WEESA to Semantically Annotate Cocoon
The Semantic Web is based on the idea that Web applications provide semantically annotatedWeb pages. This metadata is typically added in the semantic annotation process which is currently not part of the Web engineering process. Web engineering, however, proposes methodologies to design, implement and maintain Web applications but lack semantic annotation. In this paper we show how WEESA, a map...
متن کاملCoupling Information Extraction and Data Mining for Ontology Learning in PARMENIDES
Strategic decision making, especially in the areas of business intelligence and competitive intelligence, requires the acquisition of decision-relevant information pieces like market trends, fusions and company values. This information is extracted by pre-processing and querying multiple sources, combining and condensing the findings. It is characteristic that the extraction process is resource...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001