Two Medoid-Based Algorithms for Clustering Sets

نویسندگان

چکیده

This paper proposes two algorithms for clustering data, which are variable-sized sets of elementary items. An example such data occurs in the analysis a medical diagnosis, where goal is to detect human subjects who share common diseases possibly predict future illnesses from previous history. The first proposed algorithm based on K-medoids and second extends random swap algorithm, has proven be capable efficient careful clustering; both depend distance function among objects (sets), can use application-sensitive weights or priorities. makes it possible exploit several seeding methods that improve accuracy. A key factor their parallel implementation Java, functional programming using streams lambda expressions. parallelism smooths out O(N2) computational cost behind indexes as Silhouette index allows handling non-trivial datasets. applies benchmark case studies demonstrates how accurate time-efficient solutions achieved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online fuzzy medoid based clustering algorithms

This paper describes two new online fuzzy clustering algorithms based on medoids. These algorithms have been developed to deal with either very large datasets that do not fit in main memory or data streams in which data are produced continuously. The innovative aspect of our approach is the combination of fuzzy methods, which are well adapted to outliers and overlapping clusters, with medoids a...

متن کامل

K-medoid-style Clustering Algorithms for Supervised Summary Generation

This paper centers on the discussion of k-medoid-style clustering algorithms for supervised summary generation. This task requires clustering techniques that identify class-uniform clusters. This paper investigates such a novel clustering technique we term supervised clustering. Our work focuses on the generalization of k-medoid-style clustering algorithms. We investigate two supervised cluster...

متن کامل

MACOC: A Medoid-Based ACO Clustering Algorithm

The application of ACO-based algorithms in data mining is growing over the last few years and several supervised and unsupervised learning algorithms have been developed using this bio-inspired approach. Most recent works concerning unsupervised learning have been focused on clustering, showing great potential of ACO-based techniques. This work presents an ACO-based clustering algorithm inspire...

متن کامل

Visually Interacting with k-medoid-based Algorithms

Although database management systems supply basic tools to data recovery and analysis through standard transactions in large amounts of data, the analysis of these data in high dimensional spaces is an overwhelming job. On the other hand, human beings have a great capacity to quickly absorb and understand information presented in graphical form. As data clustering is one of the key techniques i...

متن کامل

The Impact of Triangular Inequality Violations on Medoid-Based Clustering

We evaluate the extent to which a dissimilarity space differs from a metric space by introducing the notion of metric point and metricity in a dissimilarity space. The the effect of triangular inequality violations on medoid-based clustering of objects in a dissimilarity space is examined and the notion of rectifier is introduce to transform a dissimilarity space into a metric space.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Algorithms

سال: 2023

ISSN: ['1999-4893']

DOI: https://doi.org/10.3390/a16070349