xGENIA: A comprehensive OWL ontology based on the GENIA corpus

نویسندگان

  • Rafal Rak
  • Lukasz Kurgan
  • Marek Reformat
چکیده

UNLABELLED The GENIA ontology is a taxonomy that was developed as a result of manual annotation of a subset of MEDLINE, the GENIA corpus. Both the ontology and corpus have been used as a benchmark to test and develop biological information extraction tools. Recent work shows, however, that there is a demand for a more comprehensive ontology that would go along with the corpus. We propose a complete OWL ontology built on top of the GENIA ontology utilizing the GENIA corpus. The proposed ontology includes elements such as the original taxonomy of categories, biological entities as individuals, relations between individuals using verbs and verb nominalizations as object properties, and links to the UMLS Metathesaurus concepts. AVAILABILITY http://www.ece.ualberta.ca/~rrak/ontology/xGENIA/

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Use of OWL 2 to Facilitate a Biomedical Knowledge Base Extracted from the GENIA Corpus

The annotation of the GENIA corpus, a set of biomedical articles, targets the classification of biological entities based on their association with a domain-tailored taxonomy of categories. By incorporating information extraction process on the corpus we have developed a knowledge base (KB) that includes a more comprehensive taxonomy of categories, relationships between biological entities, and...

متن کامل

Applying ontology design patterns to the implementation of relations in GENIA

Motivation: Annotated reference corpora such as the GENIA corpus play an important role in biomedical information extraction. A semantic annotation of the natural language texts in these reference corpora using formal ontologies and logic is challenging due to the ambiguous use of natural language and natural language semantics. Providing formal definitions and axioms for these relations would ...

متن کامل

Ontology design patterns to disambiguate relations between genes and gene products in GENIA

MOTIVATION Annotated reference corpora play an important role in biomedical information extraction. A semantic annotation of the natural language texts in these reference corpora using formal ontologies is challenging due to the inherent ambiguity of natural language. The provision of formal definitions and axioms for semantic annotations offers the means for ensuring consistency as well as ena...

متن کامل

From GENIA to BIOTOP - Towards a Top-Level Ontology for Biology

The increasing need for advanced ontology-based knowledge management in the life sciences is generally being acknowledged but, up until now, the development of biological ontologies lacks adherence to foundational principles of ontology design. This is particularly true of so-called upper-level ontologies such as the GENIA ontology which covers biological continuants and has mainly been devised...

متن کامل

An Executive Approach Based On the Production of Fuzzy Ontology Using the Semantic Web Rule Language Method (SWRL)

Today, the need to deal with ambiguous information in semantic web languages is increasing. Ontology is an important part of the W3C standards for the semantic web, used to define a conceptual standard vocabulary for the exchange of data between systems, the provision of reusable databases, and the facilitation of collaboration across multiple systems. However, classical ontology is not enough ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformation

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2007