Mathematical and Computational Linguistics Project N.1 Persistent Homology of Syntactic Parameters

نویسنده

  • MATILDE MARCOLLI
چکیده

In recent years, a new approach to data analysis has been developed, based on topological methods. The basic idea is to understand structures in a (large) set of data in a (high-dimensional) space, by associating to it a simplicial topological space and studying its topology. The starting point is a set of data with a proximity parameter (such as a distance function). The simplicial complex (Vietoris-Rips complex) is constructed by taking the set of data as the vertex set and assigning a k-dimensional face (k-simplex) to a k + 1-tuple of data {x0, . . . , xk} iff the distances satisfy d(xi, xj) ≤ for all 0 ≥ i, j ≤ k. Other versions of simplicial complexes associated to sets of data are described in [1]. General introductions to topological data analysis can be found in [1], [2], [3], [6]. Software packages are available at [5].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mathematical and Computational Linguistics Project N.4 Syntactic Phylogenetic Trees

1. Linguistic Phylogenetic Trees The reconstruction of phylogenetic trees of language families is one of the main problems in Historical Linguistics. In recent years, computational methods have been used, mostly borrowed from similar techniques in mathematical biology, see for instance the collection of papers in [1]. Mostly, the computational reconstructions of linguistic phylogenetic trees fo...

متن کامل

Mathematical and Computational Linguistics Project N.3 Syntactic Parameters as a Spin Glass Model

The project consists of running simulations of language evolution as a spin-glass model, on a graph with vertices a group of languages and edges representing the interaction between them. The syntactic parameters are viewed as spin variables at the vertices. The current data of syntactic parameters of various languages provide the initial configuration. An extensive database of syntactic parame...

متن کامل

Neuron detection in stack images: a persistent homology interpretation

Automation and reliability are the two main requirements when computers are applied in Life Sciences. In this paper we report on an application to neuron recognition, an important step in our long-term project of providing software systems to the study of neural morphology and functionality from biomedical images. Our algorithms have been implemented in an ImageJ plugin called NeuronPersistentJ...

متن کامل

Syntactic Structures and Rhetorical Functions of Electrical Engineering, Psychiatry, and Linguistics Research Article Titles in English and Persian: A Cross-linguistic and Cross-disciplinary Study

A research article (RA) title is the first and foremost feature that attracts the reader's attention, the feature from which she/he may decide whether the whole article is worth reading. The present study attempted to investigate syntactic structures and rhetorical functions of RA titles written in English and Persian and published in journals in three disciplines of Electrical Engineering, Psy...

متن کامل

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015