Association Plots: visualizing cluster-specific associations in high-dimensional correspondence analysis biplots
نویسندگان
چکیده
Abstract In molecular biology, just as in many other fields of science, data often come the form matrices or contingency tables with observations (rows) for a set variables (columns). While projection methods like principal component analysis correspondence (CA) can be applied obtaining an overview such data, cases where matrix is very large associated loss information upon into two three dimensions may dramatic. However, when grouped clusters, this opens up new angle on data. We focus question which are to cluster and distinguish it from clusters. CA employs geometry geared towards answering question. exploit feature order introduce Association Plots visualizing cluster-specific complex Regardless dimensionality two-dimensional depict variables. demonstrate our method small sets then use study challenging genomic comprising >10,000 samples. show that clearly highlight those characterise
منابع مشابه
RLE plots: Visualizing unwanted variation in high dimensional data
Unwanted variation can be highly problematic and so its detection is often crucial. Relative log expression (RLE) plots are a powerful tool for visualizing such variation in high dimensional data. We provide a detailed examination of these plots, with the aid of examples and simulation, explaining what they are and what they can reveal. RLE plots are particularly useful for assessing whether a ...
متن کاملVisualizing Independence Using Extended Association Plots
Association plots—a visualization technique for the independence problem in 2-way contingency tables—are extended in three directions: 1. The visualization is enhanced by using colors for the importance of the residuals. 2. The implementation currently available in R is improved using a more modular design and allowing a more flexible specification of plotting parameters. 3. Two methods for the...
متن کاملVisualizing Independence Using Extended Association and Mosaic Plots
Two visualization techniques for the independence problem in 2-way contingency tables—assocation and mosaic plots—are extended in two directions: 1. The visualization is enhanced by displaying the significance of an appropriate test for independence and by using improved color schemes. 2. The implementation in the R system is improved using a more modular design and allowing for more flexible s...
متن کاملVisualizing Multiple Quantile Plots.
Multiple-quantile plots provide a powerful graphical method for comparing the distributions of two or more populations. This article develops a method of visualizing triple-quantile plots and their associated confidence tubes, thus extending the notion of a quantile-quantile (QQ) plot to three dimensions. More specifically, we consider three independent one-dimensional random samples with corre...
متن کاملCluster Correspondence Analysis.
A method is proposed that combines dimension reduction and cluster analysis for categorical data by simultaneously assigning individuals to clusters and optimal scaling values to categories in such a way that a single between variance maximization objective is achieved. In a unified framework, a brief review of alternative methods is provided and we show that the proposed method is equivalent t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied statistics
سال: 2023
ISSN: ['1467-9876', '0035-9254']
DOI: https://doi.org/10.1093/jrsssc/qlad039