A Comparison of Summarisation Methods Based on Term Specificity Estimation

نویسندگان

  • Constantin Orasan
  • Viktor Pekar
  • Laura Hasler
چکیده

In automatic summarisation it was noticed that knowledge poor methods do not necessary preform worse than those which employ several knowledge sources to produce a summary. This paper presents a comprehensive comparison between several summarisation methods based on term specificity estimation in order to find out which one performs better. In the comparison parameters such as quality of the produced summary and the required resources are considered in order to find out which of these methods is more appropriate for a real world application. Intrinsic and extrinsic evaluation indicate that TF*RIDF, a variant of the commonly used TF*IDF, is the best performing method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Concept-centred summarisation: producing glossary entries for terms using summarisation methods

This paper describes a novel application of automatic summarisation methods for producing glossary entries. The proposed methodology is motivated by two observations: 1) glossary entries are increasingly used, especially on the Internet; and 2) information contained in a glossary entry is, in fact, a summary of information about the concept. From these two observations, we develop a method to a...

متن کامل

Pronominal anaphora resolution for text summarisation

The assumption of term-based summarisation method is that the importance of a sentence can be determined by the importance of the words it contains. One drawback of these methods is that they usually consider the words in isolation, ignoring relations such as anaphoric links between them. This paper investigates to what extent the integration of pronominal anaphora resolution into the summarisa...

متن کامل

Factors Influencing Drug Injection History among Prisoners: A Comparison between Classification and Regression Trees and Logistic Regression Analysis

Background: Due to the importance of medical studies, researchers of this field should be familiar with various types of statistical analyses to select the most appropriate method based on the characteristics of their data sets. Classification and regression trees (CARTs) can be as complementary to regression models. We compared the performance of a logistic regression model and a CART in predi...

متن کامل

Comparative Evaluation of Modular Automatic Summarisation Systems Using Cast

The information overload faced by today’s society poses great challenges to researchers who want to find a relevant piece of information. Automatic summarisation is a field of computational linguistics which can help humans to deal with this information overload by automatically extracting the gist of documents. This thesis attempts to gain insights into the automatic summarisation field from s...

متن کامل

Estimation of Ejection Fraction with Echocardiographic and Cardioangiographic Methods in 50 Cardiac Patients

Ejection fraction is the most important indicator of heart failure. Angiography , is a standard method to demonstrate ejection fraction but it is invasive. Therefore determination of sensitivity and specificity of echocardiography versus angiography is a good guidance for physicians. A total number of 50 patients were enrolled in a prospective study. Two dimension and M-Mode echocardiography...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004