Near-optimal clustering in the k-machine model
نویسندگان
چکیده
The clustering problem, in its many variants, has numerous applications operations research and computer science (e.g., bioinformatics, image processing, social network analysis, etc.). As sizes of data sets have grown rapidly, researchers focused on designing algorithms for problems models computation suited large-scale such as MapReduce, Pregel, streaming models. k-machine model (Klauck et al., (SODA 2015) [8]) is a simple, message-passing distributed graph processing. This paper considers three the most prominent examples problems: uncapacitated facility location p-median p-center problem presents O(1)-factor approximation these running O˜(n/k) rounds model. These are optimal up to polylogarithmic factors because this also shows Ω˜(n/k) lower bounds obtaining polynomial-factor problems. first results We assume that metric provided input only implicitly provided, an edge-weighted nutshell, our main technical contribution show constant-factor all can be obtained by learning small portion metric.
منابع مشابه
SVM Venn Machine with k-Means Clustering
In this paper, we introduce a new method of designing Venn Machine taxonomy based on Support Vector Machines and k-means clustering for both binary and multi-class problems. We compare this algorithm to some other multi-probabilistic predictors including SVM Venn Machine with homogeneous intervals and a recently developed algorithm called Venn-ABERS predictor. These algorithms were tested on a ...
متن کاملNear-Optimal Virtual Machine Packing Based on Resource Requirement of Service Demands Using Pattern Clustering
Upon the expansion of Cloud Computing and the positive outlook of organizations with regard to the movements towards using cloud computing and their expanding utilization of such valuable processing method, as well as the solutions provided by the cloud infrastructure providers with regard to the reduction of the costs of processing resources, the problem of organizing resources in a cloud envi...
متن کاملComparing Model-based Versus K-means Clustering for the Planar Shapes
In some fields, there is an interest in distinguishing different geometrical objects from each other. A field of research that studies the objects from a statistical point of view, provided they are invariant under translation, rotation and scaling effects, is known as the statistical shape analysis. Having some objects that are registered using key points on the outline...
متن کاملAlmost Optimal Solutions to k-Clustering Problems
Pankaj Kumar1 and Piyush Kumar2 1 Purdue University 2 Florida State University (a) American Cities (b) Lyria bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bcbc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bcbc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc...
متن کاملOptimal clustering for detecting near-native conformations in protein docking.
Clustering is one of the most powerful tools in computational biology. The conventional wisdom is that events that occur in clusters are probably not random. In protein docking, the underlying principle is that clustering occurs because long-range electrostatic and/or desolvation forces steer the proteins to a low free-energy attractor at the binding region. Something similar occurs in the dock...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Theoretical Computer Science
سال: 2022
ISSN: ['1879-2294', '0304-3975']
DOI: https://doi.org/10.1016/j.tcs.2021.11.026