Corrigendum to “Efficient Similarity Search and Classification via Rank Aggregation” by
نویسندگان
چکیده
In this corrigendum, we correct an error in the paper [1]. The error was discovered by Alexandr Andoni, and the corrected theorem is due to the three authors of [1], along with Alexandr Andoni and Mihai Pǎtraşcu. Theorem 4 of [1] states: Let D be a collection of n points in R. Let r1, . . . rm be random unit vectors in R, where m = α −2 logn with α suitably chosen. Let q ∈ R be an arbitrary point, and define, for each i with 1 ≤ i ≤ m, the ranked list Li of the n points in D by sorting them in increasing order of their distances to the projection of q along ri. For each element x of D, let medrank(x) = median(L1(x), . . . , Lm(x)). Let z be a member of D such that medrank(z) is minimized. Then with probability at least 1−1/n, we have ‖z−q‖2 ≤ (1+ )‖x−q‖2 for all x ∈ D. As stated, the above theorem does not hold, but a version of it holds if one replaces the median over ranks by a median over suitably defined scores. Below, we give a counterexample to the original theorem, and then present our modification to the theorem, and the resulting algorithm.
منابع مشابه
On the use of Heronian means in a similarity classifier
This paper introduces new similarity classifiers using the Heronian mean, and the generalized Heronian mean operators. We examine the use of these operators at the aggregation step within the similarity classifier. The similarity classifier was earlier studied with other operators, in particular with an arithmetic mean, generalized mean, OWA operators, and many more. The two classifiers here ar...
متن کاملRank Aggregation for Similar Items
The problem of combining the ranked preferences of many experts is an old and surprisingly deep problem that has gained renewed importance in many machine learning, data mining, and information retrieval applications. Effective rank aggregation becomes difficult in real-world situations in which the rankings are noisy, incomplete, or even disjoint. We address these difficulties by extending sev...
متن کاملA comprehensive experimental comparison of the aggregation techniques for face recognition
In face recognition, one of the most important problems to tackle is a large amount of data and the redundancy of information contained in facial images. There are numerous approaches attempting to reduce this redundancy. One of them is information aggregation based on the results of classifiers built on selected facial areas being the most salient regions from the point of view of classificati...
متن کاملDiscovering Popular Clicks\' Pattern of Teen Users for Query Recommendation
Search engines are still the most important gates for information search in internet. In this regard, providing the best response in the shortest time possible to the user's request is still desired. Normally, search engines are designed for adults and few policies have been employed considering teen users. Teen users are more biased in clicking the results list than are adult users. This leads...
متن کاملDeep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning
Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008