Knowledge Acquisition using Documents, Conceptual Graphs and a Semantically Structured Dictionary
نویسنده
چکیده
In this paper, we first show how in CGKAT, our knowledge acquisition tool, any document element and its semantics may be represented using the Conceptual Graphs formalism (Sowa, 1984) and a structured document editor. Then, we study the kinds of hypertext links that may be set between documents elements and concepts or relations of the knowledge base (such links enables the use of search techniques on the KB for finding information within the documents). In a second part, we detail the top-level ontologies (for concepts and relations) proposed by CGKAT and its exploitation of a semantically structured dictionary for guiding knowledge representation and easing its later reuse.
منابع مشابه
Working on a botanic corpus
Extracting information from an encyclopedic corpus of botanic may be done by hand but it is a long and tedious work. More and more, it becomes interesting and possible to speed-up the process by automatizing it but still keeping an human expert for validation. Among the different kind of information that may be extracted from a botanic corpus, we can cite terminology, conceptual information to ...
متن کاملConcept clustering and knowledge integration from a children's dictionary
Knowledge structures called Concept Clustering Knowledge Graphs (CCKGs) are introduced along with a process for their construction from a machine readable dictionary. CCKGs contain multiple concepts interrelated through multiple semantic relations together forming a semantic cluster represented by a conceptual graph. The knowledge acquisition is performed on a children’s first dictionary. The c...
متن کاملFinding and Typing New Named Entities in Tibetan from Chinese-Tibetan Parallel Corpora
Currently there is much interest in the automatic acquisition of entities, with the goal of Named Entity Recognition (NER). However previous work has focused primarily on major languages, with the large, structured, and semantically rich knowledge bases and using the large corpus with annotated NER tags. In this paper, we describe a method for Chinese-Tibetan bilingual named entity recognition ...
متن کاملBuilding Automatically a Business Registration Ontology
We discuss a domain-independent, corpus based method for dictionary-less automatic extraction of ontological knowledge from domain-specific unannotated documents. We present the architecture, algorithms, and results for ONTOSTRUCT—a new system that uses machine learning and statistical techniques to analyze text sources, discover terms, link equivalent terms into concepts, learn both hierarchic...
متن کاملUsing Abductive Inference and Dynamic Indexing to Retrieve Multimedia SGML Documents
The retrieval of complex multimedia items such as SGML-structured texts can be facilitated by means of a formal representation of knowledge about these data. These information sources must be aggregated dynamically at the time of query processing. In this paper, an interactive, probabilistic retrieval system is proposed, comprising an extended Bayesian network, a multimedia indexing component a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995