Text characterization based on recurrence networks

نویسندگان

چکیده

Several complex systems are characterized by presenting intricate characteristics taking place at several scales of time and space. These multiscale characterizations used in various applications, including better understanding diseases, characterizing transportation systems, comparison between cities, among others. In particular, texts also a hierarchical structure that can be approached using multi-scale concepts methods. The properties constitute subject worth further investigation. addition, more effective approaches to text characterization analysis obtained emphasizing words with potentially informational content. present work aims developing these possibilities while focusing on mesoscopic representations networks. More specifically, we adopt an extension the approach represent narratives, which only recurrent relationships tagged parts speech (subject, verb direct object) considered establish connections sequential pieces (e.g., paragraphs). was then achieved considering scale-dependent complementary methods: accessibility, symmetry recurrence signatures. order evaluate potential methods, problem distinguishing literary genres (fiction non-fiction). A set 300 books organized into two were compared aforementioned approaches. All methods capable differentiating some extent genres. accessibility reflected narrative asymmetries, signature provided indication about non-sequential semantic along narrative.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recurrence Relations for Quotient Moment of Generalized Pareto Distribution Based on Generalized Order Statistics and Characterization

Generalized Pareto distribution play an important role in reliability, extreme value theory, and other branches of applied probability and statistics. This family of distributions includes exponential distribution, Pareto distribution, and Power distribution. In this paper, we established exact expressions and recurrence relations satisfied by the quotient moments of generalized order statistic...

متن کامل

Statistical Techniques for Text Classification Based on Word Recurrence Intervals

The decision as to whether two texts were written by the same author is usually a difficult one. Can an analysis of how the words in a text statistically cluster shed some light on authorship? In this paper we examine both English texts and the Greek source texts of the New Testament. The mathematical techniqes developed by Shannon [1,2] and Markov have been used for a number of years to analys...

متن کامل

Prediction of user's trustworthiness in web-based social networks via text mining

In Social networks, users need a proper estimation of trust in others to be able to initialize reliable relationships. Some trust evaluation mechanisms have been offered, which use direct ratings to calculate or propagate trust values. However, in some web-based social networks where users only have binary relationships, there is no direct rating available. Therefore, a new method is required t...

متن کامل

Multiscale characterization of recurrence-based phase space networks constructed from time series.

Recently, a framework for analyzing time series by constructing an associated complex network has attracted significant research interest. One of the advantages of the complex network method for studying time series is that complex network theory provides a tool to describe either important nodes, or structures that exist in the networks, at different topological scale. This can then provide di...

متن کامل

Text-independent Speaker Verification Based on Probabilistic Neural Networks

In this paper, a text-independent Probabilistic Neural Network (PNN)-based Speaker Verification system is presented. Modular structure with a distinct PNN for each enrolled speaker is used. A gender-dependent universal background model is built to represent the impostor speakers. A detailed description of the system, as well as the time required for training and processing all the test trials i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information Sciences

سال: 2023

ISSN: ['0020-0255', '1872-6291']

DOI: https://doi.org/10.1016/j.ins.2023.119124