Interpretable Clustering via Multi-Polytope Machines

نویسندگان

چکیده

Clustering is a popular unsupervised learning tool often used to discover groups within larger population such as customer segments, or patient subtypes. However, despite its use for subgroup discovery and description few state-of-the-art algorithms provide any rationale behind the clusters found. We propose novel approach interpretable clustering that both data points constructs polytopes around discovered explain them. Our framework allows additional constraints on including ensuring hyperplanes constructing polytope are axis-parallel sparse with integer coefficients. formulate problem of via Mixed-Integer Non-Linear Program (MINLP). To solve our formulation we two phase where first initialize using alternating minimization, then coordinate descent boost performance. benchmark suite synthetic real world problems, algorithm outperforms state art non-interpretable algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning

This paper proposes a learning strategy that extracts objectpart concepts from a pre-trained convolutional neural network (CNN), in an attempt to 1) explore explicit semantics hidden in CNN units and 2) gradually grow a semantically interpretable graphical model on the pre-trained CNN for hierarchical object understanding. Given part annotations on very few (e.g. 3–12) objects, our method mines...

متن کامل

Interpretable Sparse High-Order Boltzmann Machines

Fully-observable high-order Boltzmann Machines are capable of identifying explicit highorder feature interactions theoretically. However, they have never been used in practice due to their prohibitively high computational cost for inference and learning. In this paper, we propose an efficient approach for learning a fully-observable high-order Boltzmann Machine based on sparse learning and cont...

متن کامل

Interpretable support vector machines for functional data

Support Vector Machines (SVM) has been shown to be a powerful nonparametric classification technique even for high-dimensional data. Although predictive ability is important, obtaining an easy-to-interpret classifier is also crucial in many applications. Linear SVM provides a classifier based on a linear score. In the case of functional data, the coefficient function that defines such linear sc...

متن کامل

Multi-way Interacting Regression via Factorization Machines

Modeling interactions Definition 1. Let S = {e1, . . . , eD} be a set of D objects (e.g. indices of variables) and Z = {Z1, . . . ,ZJ} set of J subsets of S: Zj ⊂ S, for j = 1, . . . , J . Then we say that G = (S,Z) is a hypergraph with D vertices and J hyperedges. Interactions form a hypergraph. Z incidence matrix of interactions: Z ∈ {0, 1}D×J , where Zi1j = Zi2j = 1 iff i1 and i2 are part of...

متن کامل

Mixtures of Rectangles: Interpretable Soft Clustering

To be eeective, data-mining has to conclude with a succinct description of the data. To this end, we explore a clustering technique that nds dense regions in data. By constraining our model in a speciic way, we are able to represent the interesting regions as an intersection of intervals. This has the advantage of being easily read and understood by humans. Speciically, we t the data to a mixtu...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i7.20693