Multivariate outlier explanations using Shapley values and Mahalanobis distances

نویسندگان

چکیده

For the purpose of explaining multivariate outlyingness, it is shown that squared Mahalanobis distance an observation can be decomposed into outlyingness contributions originating from single variables. The decomposition obtained using Shapley value, a well-known concept game theory became popular in context Explainable AI. In addition to outlier explanation, this also relates recent formulation cellwise where values employed obtain variable for outlying observations with respect their “expected” position given data structure. combination distances, calculated at low numerical cost, making them even more attractive tool interpretation. Simulations and real-world examples demonstrate usefulness these concepts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Local Invariant Mahalanobis Distances

For many tasks and data types, there are natural transformations to which the data should be invariant or insensitive. For instance, in visual recognition, natural images should be insensitive to rotation and translation. This requirement and its implications have been important in many machine learning applications, and tolerance for image transformations was primarily achieved by using robust...

متن کامل

Large Margin Nearest Neighbor Classification using Curved Mahalanobis Distances

We consider the supervised classification problem of machine learning in Cayley-Klein projective geometries: We show how to learn a curved Mahalanobis metric distance corresponding to either the hyperbolic geometry or the elliptic geometry using the Large Margin Nearest Neighbor (LMNN) framework. We report on our experimental results, and further consider the case of learning a mixed curved Mah...

متن کامل

Multivariate Outlier Detection Using Independent Component Analysis

The recent developments by considering a rather unexpected application of the theory of Independent component analysis (ICA) found in outlier detection , data clustering and multivariate data visualization etc . Accurate identification of outliers plays an important role in statistical analysis. If classical statistical models are blindly applied to data containing outliers, the results can be ...

متن کامل

Weighted Shapley levels values

This paper presents a collection of four different classes of weighted Shapley levels values. All classes contain generalisations of the weighted Shapley values to cooperative games with a level structure. The first class is an upgrade of the weighted Shapley levels value in Gómez-Rúa and Vidal-Puga (2011), who use the size of components as weights. The following classes contain payoff vectors ...

متن کامل

Shapley Inconsistency Values

There are relatively few proposals for inconsistency measures for propositional belief bases. However inconsistency measures are potentially as important as information measures for artificial intelligence, and more generally for computer science. In particular, they can be useful to define various operators for belief revision, belief merging, and negotiation. The measures that have been propo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Econometrics and Statistics

سال: 2023

ISSN: ['2452-3062', '2468-0389']

DOI: https://doi.org/10.1016/j.ecosta.2023.04.003