NUS at DUC 2007: Using Evolutionary Models of Text
نویسندگان
چکیده
This paper presents our new, querybased multi-document summarization system used in DUC 2007. Current graph-based approaches to text summarization, such as TextRank and LexRank, assume a static graphmodel which does not model how input text emerges. A suitable evolutionary graph model that is related to human writing/reading process may impart a better understanding of the text and improve the subsequent summarization process. We propose a timestamped graph (TSG) model motivated by human writing and reading processes, and show how input text emerges under the construction phase of TSG. We applied TSG on both the main task and update summary task in Document Understanding Conferences (DUC) 2007 and achieved satisfactory results. We also suggested a modified MMR re-ranker for the update task.
منابع مشابه
Timestamped Graphs: Evolutionary Models of Text for Multi-Document Summarization
Current graph-based approaches to automatic text summarization, such as LexRank and TextRank, assume a static graph which does not model how the input texts emerge. A suitable evolutionary text graph model may impart a better understanding of the texts and improve the summarization process. We propose a timestamped graph (TSG) model that is motivated by human writing and reading processes, and ...
متن کاملNUS at DUC 2005: Understanding Documents via Concept Links
The primary goal of our participation in DUC 2005 is two-fold. One is to benchmark the performance of a method of computing sentence semantic similarity. The other is to test the effectiveness of a new redundancy minimization formula inspired by Maximal Marginal Relevance(MMR). By using only these two features and eschewing other heuristics, our system performed competitively, achieving the top...
متن کاملOptimization of sediment rating curve coefficients using evolutionary algorithms and unsupervised artificial neural network
Sediment rating curve (SRC) is a conventional and a common regression model in estimating suspended sediment load (SSL) of flow discharge. However, in most cases the data log-transformation in SRC models causing a bias which underestimates SSL prediction. In this study, using the daily stream flow and suspended sediment load data from Shalman hydrometric station on Shalmanroud River, Guilan Pro...
متن کاملA Term Frequency Distribution Approach for the DUC-2007 Update Task
We present our system used in the DUC 2007 update task, which is our first entry in any of the DUC evaluations. We make use of ideas within our existing FreqDistSumm text summarizer, which has been shown to perform well in biomedical text summarization. Our system submitted to the DUC Update Task, called FreqDistUpdate, uses a context sensitive approach to scoring sentences based on a frequency...
متن کاملNUS at DUC 2006: Document Concept Lattice for Summarization
Concepts composed of open-class terms after semantic equivalence discovery can be considered as simplified basic elements. We utilize frequent concept sets to construct a Document Concept Lattice, which contains hierarchical summary information of a document cluster. Based on this lattice, we further extract a set of sentences with maximal representative power and minimal redundancy for summari...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007