Text Summarization Extraction System (TSES) Using Extracted Keywords
نویسنده
چکیده
A new technique to produce a summary of an original text investigated in this paper. The system develops many approaches to solve this problem that gave a high quality result. The model consists of four stages. The preprocess stages convert the unstructured text into structured. In first stage, the system removes the stop words, pars the text and assigning the POS (tag) for each word in the text and store the result in a table. The second stage is to extract the important keyphrases in the text by implementing a new algorithm through ranking the candidate words. The system uses the extracted keywords/keyphrases to select the important sentence. Each sentence ranked depending on many features such as the existence of the keywords/keyphrase in it, the relation between the sentence and the title by using a similarity measurement and other many features. The Third stage of the proposed system is to extract the sentences with the highest rank. The Forth stage is the filtering stage. This stage reduced the amount of the candidate sentences in the summary in order to produce a qualitative summary using KFIDF measurement.
منابع مشابه
EXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کاملImproving Precision of Keywords Extracted From Persian Text Using Word2Vec Algorithm
Keywords can present the main concepts of the text without human intervention according to the model. Keywords are important vocabulary words that describe the text and play a very important role in accurate and fast understanding of the content. The purpose of extracting keywords is to identify the subject of the text and the main content of the text in the shortest time. Keyword extraction pl...
متن کاملApplying Formal Concept Analysis to Teaching Material Extraction
Text summarization system can save the time for user when reading large number of documents. The summary of text summarization system usually composed of meaningful sentence which represent content of text. The relations between keyword usually come from their cooccurrences in document. This study using hierarchical clustering method cluster sentences and apply concept formal analysis to find o...
متن کاملA Novel Approach for Mining Chosen Keywords Using Text Summarization Extraction System
The objective of text summarization is to decrease the measure of the content while preserving its important information and overall meaning. The ever-increasing user generated digital data available through the Internet has become an important source of information for individuals, organizations and government agencies. And yet, for users to completely find and use that data remains a complex ...
متن کاملAudio enabled information extraction system for cricket and hockey domains
‐ The proposed system aims at the retrieval of the summarized information from the documents collected from web based search engine as per the user query related to cricket and hockey domain. The system is designed in a manner that it takes the voice commands as keywords for search. The parts of speech in the query are extracted using the natural language extractor for English. Based on the key...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. Arab J. e-Technol.
دوره 1 شماره
صفحات -
تاریخ انتشار 2010