Markov Bases for Noncommutative Harmonic Analysis of Partially Ranked Data
نویسندگان
چکیده
Given the result v0 of a survey and a nested collection of summary statistics that could be used to describe that result, it is natural to ask which of these summary statistics best describe v0. In 1998 Diaconis and Sturmfels presented an approach for determining the conditional significance of a higher order statistic, after sampling a space conditioned on the value of a lower order statistic. Their approach involves the computation of a Markov basis, followed by the use of a Markov process with stationary hypergeometric distribution to generate a sample. This technique for data analysis has become an accepted tool of algebraic statistics, particularly for the study of fully ranked data. In this thesis, we explore the extension of this technique for data analysis to the study of partially ranked data, focusing on data from surveys in which participants are asked to identify their top k choices of n items. Before we move on to our own data analysis, though, we present a thorough discussion of the Diaconis–Sturmfels algorithm and its use in data analysis. In this discussion, we attempt to collect together all of the background on Markov bases, Markov processes, Gröbner bases, implicitization theory, and elimination theory, that is necessary for a full understanding of this approach to data analysis.
منابع مشابه
ar X iv : m at h / 04 05 06 0 v 1 [ m at h . A C ] 4 M ay 2 00 4 Markov Bases for Noncommutative Fourier Analysis of Ranked Data
To calibrate Fourier analysis of S5 ranking data by Markov chain Monte-Carlo techniques, a set of moves (Markov basis) is needed. We calculate this Markov basis, and use it to provide a new statistical analysis of two datasets. The calculation involves a large Gröbner basis computation (45825 generators in 120 indeterminates), but reduction to a minimal basis and by natural symmetries leads to ...
متن کاملMarkov bases for noncommutative Fourier analysis of ranked data
To calibrate Fourier analysis of S5 ranking data by Markov chain Monte Carlo techniques, a set of moves (Markov basis) is needed. We calculate this basis, and use it to provide a new statistical analysis of two data sets. The calculation involves a large Gröbner basis computation (45825 generators), but reduction to a minimal basis and reduction by natural symmetries leads to a remarkably small...
متن کامل5 Markov Bases for Noncommutative Fourier Analysis of Ranked Data
To calibrate Fourier analysis of S5 ranking data by Markov chain Monte Carlo techniques, a set of moves (Markov basis) is needed. We calculate this basis, and use it to provide a new statistical analysis of two data sets. The calculation involves a large Gröbner basis computation (45825 generators), but reduction to a minimal basis and reduction by natural symmetries leads to a remarkably small...
متن کامل50 60 v 2 9 M ar 2 00 5 Markov Bases for Noncommutative Fourier Analysis of Ranked Data
To calibrate Fourier analysis of S5 ranking data by Markov chain Monte Carlo techniques, a set of moves (Markov basis) is needed. We calculate this basis, and use it to provide a new statistical analysis of two data sets. The calculation involves a large Gröbner basis computation (45825 generators), but reduction to a minimal basis and reduction by natural symmetries leads to a remarkably small...
متن کاملDynamic Harmonic Analysis of Long Term over Voltages Based on Time Varying Fourier series in Extended Harmonic Domain
Harmonics have become an important issue in modern power systems. The widespread penetration of non-linear loads to emerging power systems has turned power quality analysis into an important operation issue under both steady state and transient conditions. This paper employs an Extended Harmonic Domain (EHD) based framework for dynamic analysis of long term analysis over voltages during the tra...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011