Knowledge Representation Issues in Information Extraction
نویسندگان
چکیده
The advent of computing has exacerbated the problem of overwhelming information. Advanced information management strategies such as Information Extraction, Information Filtering, Information Retrieval, and Text Categorization are becoming important to manage the deluge of information. Information Extraction (IE) systems can be used to automatically extract relevant information from free-form text for update to databases or for report generation. This paper describes the major challenge of knowledge representation issues in an information extraction task – representing the meaning of the input text, the knowledge of the field of application (or domain application) and the knowledge about the target information to be extracted. In this research, we have chosen a directed graph structure to represent the input text meaning, a domain ontology to represent the domain application and a frame representation to capture the target information to be extracted. We discuss in this paper how these knowledge structures interplay to perform the task of information extraction.
منابع مشابه
A VICORE Architecture for Intelligent Knowledge Management
We consider the functionality, architecture, design and implementation issues related to the development of intelligent systems for knowledge management that assist people in finding relevant literature and in discovering new knowledge. We describe a visualized concept representation (VICORE) framework for knowledge management. Special attention is given to the use of concept association matric...
متن کاملKnowledge Acquisition from Multimedia Content using an Evolution Framework
We propose an approach to knowledge acquisition, which uses multimedia ontologies for fused extraction of semantics from multiple modalities, and feeds back the extracted information, aiming to evolve knowledge representation. This paper presents the basic components of the proposed approach and discusses the open research issues focusing on the fused information extraction that will enable the...
متن کاملSome empirical findings on dialogue management and domain ontologies in dialogue systems - Implications from an evaluation of BirdQuest
In this paper we present implications for development of dialogue systems, based on an evaluation of the system BIRDQUEST which combine dialogue interaction with information extraction. A number of issues detected during the evaluation concerning primarily dialogue management, and domain knowledge representation and use are presented and discussed.
متن کاملHyperspectral Image Classification Based on the Fusion of the Features Generated by Sparse Representation Methods, Linear and Non-linear Transformations
The ability of recording the high resolution spectral signature of earth surface would be the most important feature of hyperspectral sensors. On the other hand, classification of hyperspectral imagery is known as one of the methods to extracting information from these remote sensing data sources. Despite the high potential of hyperspectral images in the information content point of view, there...
متن کاملAeroDAML: Applying Information Extraction to Generate DAML Annotations from Web Pages
The DARPA Agent Markup Language (DAML) is an emerging knowledge representation for the Semantic Web. DAML can encode the semantics of a document for use by agents on the web. However, DAML annotation of documents and web pages is a tedious and time consuming task. AeroDAML is a knowledge markup tool that applies natural language information extraction techniques to automatically generate DAML a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998