Large deviation principles for the Ewens-Pitman sampling model
نویسندگان
چکیده
منابع مشابه
Generalized Ewens–Pitman model for Bayesian clustering
We propose a Bayesian method for clustering from discrete data structures that commonly arise in genetics and other applications. This method is equivariant with respect to relabelling units; unsampled units do not interfere with sampled data; and missing data do not hinder inference. Cluster inference using the posterior mode performs well on simulated and real datasets, and the posterior pred...
متن کاملAsymptotics for the number of blocks in a conditional Ewens-Pitman sampling model
The study of random partitions has been an active research area in probability over the last twenty years. A quantity that has attracted a lot of attention is the number of blocks in the random partition. Depending on the area of applications this quantity could represent the number of species in a sample from a population of individuals or the number of cycles in a random permutation, etc. In ...
متن کاملThe ubiquitous Ewens sampling formula
Ewens’s sampling formula exemplifies the harmony of mathematical theory, statistical application, and scientific discovery. The formula not only contributes to the foundations of evolutionary molecular genetics, the neutral theory of biodiversity, Bayesian nonparametrics, combinatorial stochastic processes, and inductive inference but also emerges from fundamental concepts in probability theory...
متن کاملRejoinder: The Ubiquitous Ewens Sampling Formula
The main article and extended discussion point to Ewens’s sampling formula (ESF) as one of a few essential probability distributions. Arratia, Barbour and Tavaré explain the emergence of ESF by the Feller coupling and also touch on number theoretic considerations; Feng provides deeper background on diffusion processes and nonequilibrium versions of ESF; and McCullagh regales us with a story fro...
متن کاملConvergence Time to the Ewens Sampling Formula
In this paper, we establish the cutoff phenomena for the discrete time infinite alleles Moran model. If M is the population size and μ is the mutation rate, we find a cutoff time of log(Mμ)/μ generations. The stationary distribution for this process in the case of sampling without replacement is the Ewens sampling formula. We show that the bound for the total variation distance from the generat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronic Journal of Probability
سال: 2015
ISSN: 1083-6489
DOI: 10.1214/ejp.v20-3668