Abstract To solve key biomedical problems, experimentalists now routinely measure millions or billions of features (dimensions) per sample, with the hope that data science techniques will be able to build accurate data-driven inferences. Because sample sizes are typically orders magnitude smaller than dimensionality these data, valid inferences require finding a low-dimensional representation p...