Hierarchical Orderings of Textual Units

نویسنده

  • Alexander Mehler
چکیده

Text representation is a central task for any approach to automatic learning from texts. It requires a format which allows to interrelate texts even if they do not share content words, but deal with similar topics. Furthermore, measuring text similarities raises the question of how to organize the resulting clusters. This paper presents cohesion trees (CT) as a data structure for the perspective, hierarchical organization of text corpora. CTs operate on alternative text representation models taking lexical organization, quantitative text characteristics, and text structure into account. It is shown that CTs realize text linkages which are lexically more homogeneous than those produced by minimal spanning trees.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distinguishing between Coherent and Incoherent Texts

In this paper, I show that current discourse theories are not able to explain why different orderings of the same textual segments exhibit different properties with respect to coherence. I then propose a criterion of coherence that exploits both the strong tendency of textual units that are associated with certain rhetorical relations to obey a canonical ordering and the inclination of semantic...

متن کامل

Textuality: The ‘form’ to Be Focused on in SLA

Due to the special (procedural) nature of the language (verbal communication) ‘knowledge’, the dominant trends in applied linguistics research in the last few decades have been advocating ‘acquisition’ rather than ‘learning’ activities where the main focus in SL & FL education should be on ‘meaning’ while some ‘focus-on-form’ being justified. But the ‘form’ to be ‘focused-on’ is mostly misconce...

متن کامل

Preservation of Stochastic Orderings of Interdependent Series and Parallel Systems by Componentwise Switching to Exponentiated Models

This paper discusses the preservation of some stochastic orders between two interdependent series and parallel systems when the survival and distribution functions of all components switch to the exponentiated model. For the series systems, the likelihood ratio, hazard rate, usual, aging faster, aging intensity, convex transform, star, superadditive and dispersive orderings, and for the paralle...

متن کامل

Some New Results on Stochastic Orderings between Generalized Order Statistics

In this paper we specify the conditions on the parameters of pairs of gOS’s under which the corresponding generalized order statistics are ordered according to usual stochastic ordering, hazard rate ordering, likelihood ratio ordering and dispersive ordering. We consider this problem in one-sample as well as two-sample problems. We show that some of the results obtained by Franco et al. ...

متن کامل

A Study on Preference Orderings of Mathematical expectation, Expected Utility and Distorted Expectation

One of the challenges for decision-makers in insurance and finance is choosing the appropriate criteria for making decisions. Mathematical expectation, expected utility, and distorted expectation are the three most common measures in this area. In this article, we study these three criteria, and by providing some examples, we review and compare the decisions made by each measure.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002