Sinkhorn Distances: Lightspeed Computation of Optimal Transport

نویسنده

  • Marco Cuturi
چکیده

Optimal transport distances are a fundamental family of distances for probability measures and histograms of features. Despite their appealing theoretical properties, excellent performance in retrieval tasks and intuitive formulation, their computation involves the resolution of a linear program whose cost can quickly become prohibitive whenever the size of the support of these measures or the histograms’ dimension exceeds a few hundred. We propose in this work a new family of optimal transport distances that look at transport problems from a maximumentropy perspective. We smooth the classic optimal transport problem with an entropic regularization term, and show that the resulting optimum is also a distance which can be computed through Sinkhorn’s matrix scaling algorithm at a speed that is several orders of magnitude faster than that of transport solvers. We also show that this regularized distance improves upon classic optimal transport distances on the MNIST classification problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration

Computing optimal transport distances such as the earth mover’s distance is a fundamental problem in machine learning, statistics, and computer vision. Despite the recent introduction of several algorithms with good empirical performance, it is unknown whether general optimal transport distances can be approximated in near-linear time. This paper demonstrates that this ambitious goal is in fact...

متن کامل

Tsallis Regularized Optimal Transport and Ecological Inference

Optimal transport is a powerful framework for computing distances between probability dis-tributions. We unify the two main approaches to optimal transport, namely Monge-Kantorovitchand Sinkhorn-Cuturi, into what we define as Tsallis regularized optimal transport (trot).trot interpolates a rich family of distortions from Wasserstein to Kullback-Leibler, encompass-ing as well...

متن کامل

Supplementary Material for Multilevel Clustering via Wasserstein Means

i,j ∈ Rk×k ′ + is the cost matrix, i.e. matrix of pairwise distances of elements betweenG andG′, and 〈A,B〉 = tr(AB) is the Frobenius dot-product of matrices. The optimal T ∈ Π(G,G′) in optimization problem (1) is called the optimal coupling ofG andG′, representing the optimal transport between these two measures. When k = k′, the complexity of best algorithms for finding the optimal transport i...

متن کامل

Wasserstein Discriminant Analysis

Wasserstein Discriminant Analysis (WDA) is a new supervised method that can improve classification of high-dimensional data by computing a suitable linear map onto a lower dimensional subspace. Following the blueprint of classical Linear Discriminant Analysis (LDA), WDA selects the projection matrix that maximizes the ratio of two quantities: the dispersion of projected points coming from diffe...

متن کامل

Greedy stochastic algorithms for entropy-regularized optimal transport problems

Optimal transport (OT) distances are finding evermore applications in machine learning and computer vision, but their wide spread use in larger-scale problems is impeded by their high computational cost. In this work we develop a family of fast and practical stochastic algorithms for solving the optimal transport problem with an entropic penalization. This work extends the recently developed Gr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013