A summarization system for Chinese news from multiple sources
نویسندگان
چکیده
This paper will propose a personal news secretariat that helps on-line readers absorb news information from multiple sources. Such a news secretariat eliminates the redundant information in the news, and reorganizes the news for readers. This multiple document summarization employs named entities and other signatures to cluster news stream; employs punctuation marks, linking elements, and topic chains to identify the meaningful units (MUs); employs nouns and verbs to find the similarity of MUs; and finally employs focusing and browsing models to display the summarization results. On the average, the document reduction rates are 70.77% and 42.26% for focusing and browsing summarization, respectively. The reading-time reduction rate is 30.86%, and the correct rate of question-and-answering task is 88.46% for browsing summarization.
منابع مشابه
Generating Natural Language Summaries from Multiple On-Line Sources
We present a methodology for summarization of news on current events. Our approach is included in a system, called SUMMONS which presents news summaries to the user in a natural language form along with appropriate background (historical) information from both textual (newswire) and structured (database) knowledge sources. The system presents novel approaches to several problems: summarization ...
متن کاملQuery-focused Summarization Using Text-to-Text Generation: When Information Comes from Multilingual Sources
The past five years have seen the emergence of robust, scalable natural language processing systems that can summarize and answer questions about online material. One key to the success of such systems is that they re-use text that appeared in the documents rather than generating new sentences from scratch. Re-using text is absolutely essential for the development of robust systems; full semant...
متن کاملA Platform for Multilingual News Summarization
We have developed a multilingual version of Columbia Newsblaster as a testbed for multilingual multi-document summarization. The system collects, clusters, and summarizes news documents from sources all over the world daily. It crawls news sites in many different countries, written in different languages, extracts the news text from the HTML pages, uses a variety of methods to translate the doc...
متن کاملA Chinese Automatic Text Summarization system for mobile devices
A large amount of on-line information and lengthiness information can’t fit for the mobile devices. In order to save this problem, we propose a method which collects original news text from on-line information and extracts summary sentences from them automatically. On this basis, we adopt WML(Wireless Markup Language) to build a news website for mobile devices browsing through the news summary....
متن کاملFirst International Workshop on Recent Trends in News Information Retrieval (NewsIR'16)
The news industry has gone through seismic shifts in the past decade with digital content and social media completely redefining how people consume news. Readers check for accurate fresh news from multiple sources throughout the day using dedicated apps or social media on their smartphones and tablets. At the same time, news publishers rely more and more on social networks and citizen journalis...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JASIST
دوره 54 شماره
صفحات -
تاریخ انتشار 2003