Interpretable Clustering via Multi-Polytope Machines
نویسندگان
چکیده
Clustering is a popular unsupervised learning tool often used to discover groups within larger population such as customer segments, or patient subtypes. However, despite its use for subgroup discovery and description few state-of-the-art algorithms provide any rationale behind the clusters found. We propose novel approach interpretable clustering that both data points constructs polytopes around discovered explain them. Our framework allows additional constraints on including ensuring hyperplanes constructing polytope are axis-parallel sparse with integer coefficients. formulate problem of via Mixed-Integer Non-Linear Program (MINLP). To solve our formulation we two phase where first initialize using alternating minimization, then coordinate descent boost performance. benchmark suite synthetic real world problems, algorithm outperforms state art non-interpretable algorithms.
منابع مشابه
Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning
This paper proposes a learning strategy that extracts objectpart concepts from a pre-trained convolutional neural network (CNN), in an attempt to 1) explore explicit semantics hidden in CNN units and 2) gradually grow a semantically interpretable graphical model on the pre-trained CNN for hierarchical object understanding. Given part annotations on very few (e.g. 3–12) objects, our method mines...
متن کاملInterpretable Sparse High-Order Boltzmann Machines
Fully-observable high-order Boltzmann Machines are capable of identifying explicit highorder feature interactions theoretically. However, they have never been used in practice due to their prohibitively high computational cost for inference and learning. In this paper, we propose an efficient approach for learning a fully-observable high-order Boltzmann Machine based on sparse learning and cont...
متن کاملInterpretable support vector machines for functional data
Support Vector Machines (SVM) has been shown to be a powerful nonparametric classification technique even for high-dimensional data. Although predictive ability is important, obtaining an easy-to-interpret classifier is also crucial in many applications. Linear SVM provides a classifier based on a linear score. In the case of functional data, the coefficient function that defines such linear sc...
متن کاملMulti-way Interacting Regression via Factorization Machines
Modeling interactions Definition 1. Let S = {e1, . . . , eD} be a set of D objects (e.g. indices of variables) and Z = {Z1, . . . ,ZJ} set of J subsets of S: Zj ⊂ S, for j = 1, . . . , J . Then we say that G = (S,Z) is a hypergraph with D vertices and J hyperedges. Interactions form a hypergraph. Z incidence matrix of interactions: Z ∈ {0, 1}D×J , where Zi1j = Zi2j = 1 iff i1 and i2 are part of...
متن کاملMixtures of Rectangles: Interpretable Soft Clustering
To be eeective, data-mining has to conclude with a succinct description of the data. To this end, we explore a clustering technique that nds dense regions in data. By constraining our model in a speciic way, we are able to represent the interesting regions as an intersection of intervals. This has the advantage of being easily read and understood by humans. Speciically, we t the data to a mixtu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i7.20693