Distance Functions: Theory, Algorithms and Applications
نویسندگان
چکیده
Distance functions are at the core of numerous computer vision and machine learning tasks. For example, a multiple view geometry application (such as homography estimation), often includes a descriptor matching step. First, small image patches are selected. Then, descriptors (vectors) are computed to describe these patches. To decide if two descriptors (from two different images) are from the same real-world location the descriptors are compared using a distance function. The choice of the distance function determines both the accuracy and the speed of the method. Distance functions are also used for image retrieval and determine both the percent of similar images returned and the time it takes to return them. In machine learning, knearest neighbor classification performance is effected by the chosen distance function. This thesis introduces several new distance functions and investigates their properties. Special emphasis is on practical applicability with theoretical insights. The thesis presents efficient algorithms to compute several distances. The distances are designed to be robust to common image noise such as occlusion, geometrical transforms, light changes and non-rigid deformations. On the other hand, they are designed to be as distinctive as possible. Our proposed methods have been successfully used both by computer vision researchers and by researchers in other fields. The success of the methods in other fields is probably because the noise characteristics in those fields are similar to image noise characteristics. The first and second parts of the thesis are devoted to the Earth Mover’s Distance (EMD) [1]. The EMD is a generalization of the transportation distance (also known as Monge-Kantorovich, Mallows, 1 Wasserstein and match distance) for non-normalized histograms. We introduce a new generalization of the transportation distance for non-normalized histograms called, ÊMD. Unlike the classic EMD, ÊMD is a metric even when the histograms are non-normalized. Additionally, in practice, ÊMD is often better than EMD. We also show that ÊMD is a generalization of the L1 distance. Histograms of oriented gradient descriptors are ubiquitous tools in numerous computer vision tasks. Unfortunately, the distances that are used to match such descriptors do not take into account the special structure of these histograms. One problem with these distances is that they only compare corresponding bins of histograms. Although there are cross-bin distances that take into account the amount of similarity between all bins, they are not adequate to the histograms of oriented gradients that are computed from images. Additionally, most cross-bin distances are too slow. We present a
منابع مشابه
یادگیری نیمه نظارتی کرنل مرکب با استفاده از تکنیکهای یادگیری معیار فاصله
Distance metric has a key role in many machine learning and computer vision algorithms so that choosing an appropriate distance metric has a direct effect on the performance of such algorithms. Recently, distance metric learning using labeled data or other available supervisory information has become a very active research area in machine learning applications. Studies in this area have shown t...
متن کاملAssessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories
In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...
متن کاملON THE FUNCTION OF BLOCK ANTI DIAGONAL MATRICES AND ITS APPLICATION
The matrix functions appear in several applications in engineering and sciences. The computation of these functions almost involved complicated theory. Thus, improving the concept theoretically seems unavoidable to obtain some new relations and algorithms for evaluating these functions. The aim of this paper is proposing some new reciprocal for the function of block anti diagonal matrices. More...
متن کاملSample size determination for logistic regression
The problem of sample size estimation is important in medical applications, especially in cases of expensive measurements of immune biomarkers. This paper describes the problem of logistic regression analysis with the sample size determination algorithms, namely the methods of univariate statistics, logistics regression, cross-validation and Bayesian inference. The authors, treating the regr...
متن کاملApplications of TP2 Functions in Theory of Stochastic Orders: A Review of some Useful Results
In the literature on Statistical Reliability Theory and Stochastic Orders, several results based on theory of TP2/RR2 functions have been extensively used in establishing various properties. In this paper, we provide a review of some useful results in this direction and highlight connections between them.
متن کاملStrict L∞ Isotonic Regression
Given a function f and weightsw on the vertices of a directed acyclic graphG, an isotonic regression of (f, w) is an order-preserving real-valued function that minimizes the weighted distance to f among all order-preserving functions. When the distance is given via the supremum norm there may be many isotonic regressions. One of special interest is the strict isotonic regression, which is the l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011