Entity-Supported Summarization of Biomedical Abstracts
نویسندگان
چکیده
The increasing amount of biomedical information that is available for researchers and clinicians makes it harder to quickly find the right information. Automatic summarization of multiple texts can provide summaries specific to the user’s information needs. In this paper we look into the use named-entity recognition for graph-based summarization. We extend the LexRank algorithm with information about named entities and present EntityRank, a multi-document graph-based summarization algorithm that is solely based on named entities. We evaluate our system on a datasets of 1009 human written summaries provided by BioASQ and on 1974 gene summaries, fetched from the Entrez Gene database. The results show that the addition of named-entity information increases the performance of graph-based summarizers and that the EntityRank significantly outperforms the other methods with regard to the ROUGE measures.
منابع مشابه
BioChain: Using Lexical Chaining Methods for Biomedical Text Summarization
1 ABSTRACT Lexical chaining is a technique for identifying semantically-related terms in a text. It is useful in document summarization in order to identify the top sentences most likely to contain the main ideas of a document or document set. These top sentences are then extracted and combined in order to produce a summary of the document(s). To date, summarization work using lexical chains ha...
متن کاملTowards Automatic Generation of Gene Summary
In this paper we present an extractive system that automatically generates gene summaries from the biomedical literature. The proposed text summarization system selects and ranks sentences from multiple MEDLINE abstracts by exploiting gene-specific information and similarity relationships between sentences. We evaluate our system on a large dataset of 7,294 human genes and 187,628 MEDLINE abstr...
متن کاملCOMPENDIUM: A Text Summarization System for Generating Abstracts of Research Papers
Available online 14 August 2013 This article analyzes the appropriateness of a text summarization system, COMPENDIUM, for generating abstracts of biomedical papers. Two approaches are suggested: an extractive (COMPENDIUME), which only selects and extracts the most relevant sentences of the documents, and an abstractive-oriented one (COMPENDIUME–A), thus facing also the challenge of abstractive ...
متن کاملA Hybrid Approach to Generation of Missing Abstracts in Biomedical Literature
Readers usually rely on abstracts to identify relevant medical information from scientific articles. Abstracts are also essential to advanced information retrieval methods. More than 50 thousand scientific publications in PubMed Central lack author-generated abstracts, and the relevancy judgements for these papers have to be based on their titles alone. In this paper, we propose a hybrid summar...
متن کاملMulti-document Summarization of Dissertation Abstracts Using a Variable- Based Framework
This paper reports initial work on developing a method for automatic construction of multidocument summaries of sets of domain-specific dissertation abstracts. A variable-based framework for multi-document summarization of dissertation abstracts in the field of sociology and psychology that makes use of the macro-level and micro-level discourse structure of dissertation abstracts as well as cro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016