Knowledge Acquisition using Documents, Conceptual Graphs and a Semantically Structured Dictionary

نویسنده

  • Philippe MARTIN
چکیده

In this paper, we first show how in CGKAT, our knowledge acquisition tool, any document element and its semantics may be represented using the Conceptual Graphs formalism (Sowa, 1984) and a structured document editor. Then, we study the kinds of hypertext links that may be set between documents elements and concepts or relations of the knowledge base (such links enables the use of search techniques on the KB for finding information within the documents). In a second part, we detail the top-level ontologies (for concepts and relations) proposed by CGKAT and its exploitation of a semantically structured dictionary for guiding knowledge representation and easing its later reuse.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Working on a botanic corpus

Extracting information from an encyclopedic corpus of botanic may be done by hand but it is a long and tedious work. More and more, it becomes interesting and possible to speed-up the process by automatizing it but still keeping an human expert for validation. Among the different kind of information that may be extracted from a botanic corpus, we can cite terminology, conceptual information to ...

متن کامل

Concept clustering and knowledge integration from a children's dictionary

Knowledge structures called Concept Clustering Knowledge Graphs (CCKGs) are introduced along with a process for their construction from a machine readable dictionary. CCKGs contain multiple concepts interrelated through multiple semantic relations together forming a semantic cluster represented by a conceptual graph. The knowledge acquisition is performed on a children’s first dictionary. The c...

متن کامل

Finding and Typing New Named Entities in Tibetan from Chinese-Tibetan Parallel Corpora

Currently there is much interest in the automatic acquisition of entities, with the goal of Named Entity Recognition (NER). However previous work has focused primarily on major languages, with the large, structured, and semantically rich knowledge bases and using the large corpus with annotated NER tags. In this paper, we describe a method for Chinese-Tibetan bilingual named entity recognition ...

متن کامل

Building Automatically a Business Registration Ontology

We discuss a domain-independent, corpus based method for dictionary-less automatic extraction of ontological knowledge from domain-specific unannotated documents. We present the architecture, algorithms, and results for ONTOSTRUCT—a new system that uses machine learning and statistical techniques to analyze text sources, discover terms, link equivalent terms into concepts, learn both hierarchic...

متن کامل

Using Abductive Inference and Dynamic Indexing to Retrieve Multimedia SGML Documents

The retrieval of complex multimedia items such as SGML-structured texts can be facilitated by means of a formal representation of knowledge about these data. These information sources must be aggregated dynamically at the time of query processing. In this paper, an interactive, probabilistic retrieval system is proposed, comprising an extended Bayesian network, a multimedia indexing component a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995