Association Plots: visualizing cluster-specific associations in high-dimensional correspondence analysis biplots

نویسندگان

چکیده

Abstract In molecular biology, just as in many other fields of science, data often come the form matrices or contingency tables with observations (rows) for a set variables (columns). While projection methods like principal component analysis correspondence (CA) can be applied obtaining an overview such data, cases where matrix is very large associated loss information upon into two three dimensions may dramatic. However, when grouped clusters, this opens up new angle on data. We focus question which are to cluster and distinguish it from clusters. CA employs geometry geared towards answering question. exploit feature order introduce Association Plots visualizing cluster-specific complex Regardless dimensionality two-dimensional depict variables. demonstrate our method small sets then use study challenging genomic comprising >10,000 samples. show that clearly highlight those characterise

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RLE plots: Visualizing unwanted variation in high dimensional data

Unwanted variation can be highly problematic and so its detection is often crucial. Relative log expression (RLE) plots are a powerful tool for visualizing such variation in high dimensional data. We provide a detailed examination of these plots, with the aid of examples and simulation, explaining what they are and what they can reveal. RLE plots are particularly useful for assessing whether a ...

متن کامل

Visualizing Independence Using Extended Association Plots

Association plots—a visualization technique for the independence problem in 2-way contingency tables—are extended in three directions: 1. The visualization is enhanced by using colors for the importance of the residuals. 2. The implementation currently available in R is improved using a more modular design and allowing a more flexible specification of plotting parameters. 3. Two methods for the...

متن کامل

Visualizing Independence Using Extended Association and Mosaic Plots

Two visualization techniques for the independence problem in 2-way contingency tables—assocation and mosaic plots—are extended in two directions: 1. The visualization is enhanced by displaying the significance of an appropriate test for independence and by using improved color schemes. 2. The implementation in the R system is improved using a more modular design and allowing for more flexible s...

متن کامل

Visualizing Multiple Quantile Plots.

Multiple-quantile plots provide a powerful graphical method for comparing the distributions of two or more populations. This article develops a method of visualizing triple-quantile plots and their associated confidence tubes, thus extending the notion of a quantile-quantile (QQ) plot to three dimensions. More specifically, we consider three independent one-dimensional random samples with corre...

متن کامل

Cluster Correspondence Analysis.

A method is proposed that combines dimension reduction and cluster analysis for categorical data by simultaneously assigning individuals to clusters and optimal scaling values to categories in such a way that a single between variance maximization objective is achieved. In a unified framework, a brief review of alternative methods is provided and we show that the proposed method is equivalent t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied statistics

سال: 2023

ISSN: ['1467-9876', '0035-9254']

DOI: https://doi.org/10.1093/jrsssc/qlad039