Finding an unknown number of multivariate outliers
نویسندگان
چکیده
We use the forward search to provide robust Mahalanobis distances to detect the presence of outliers in a sample of multivariate normal data. Theoretical results on order statistics and on estimation in truncated samples provide the distribution of our test statistic. We also introduce several new robust distances with associated distributional results. Comparisons of our procedure with tests using other robust Mahalanobis distances show the good size and high power of our procedure. We also provide a unification of results on correction factors for estimation from truncated samples.
منابع مشابه
Identification of outliers types in multivariate time series using genetic algorithm
Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...
متن کاملWho Should be Interviewed? A Response from Cluster Analysis
Objective: This article presents an application of cluster analysis for social sciences researches especially those studies that have an interview as part of their data collection. This application is more suitable for sequential mixed method researchers who use quantitative data to frame subsequent qualitative subsamples for conducting interviews. Methods: In more detail, the algorithm (i....
متن کاملSequential Application of Multivariate Outlier Test : a Robust Approach
Identification of outliers in multivariate data is not trivial. especially when there exists several outliers in the data. The classical identification method based on the sample mean and sample covariance matrix cannot always find them, because the classicd rnean and covariance matris are themselves affected by outliers. This problem is termed as masting7 because the outliers get maslied by ea...
متن کاملOutliers in biometrical data: What's old, What's new
The problem of finding and identifying outliers in multivariate measurement data is as dealing with biometrical Nowadays the data sets are larger both in the number of specimens and in the number of features characterizing the given specimens. The knowledge about the outliers is especially important when constructing models describing the data. Therefore, since long, some methodologies (called ...
متن کاملLocal multivariate outliers as geochemical anomaly halos indicators, a case study: Hamich area, Southern Khorasan, Iran
Anomaly recognition has always been a prominent subject in preliminary geochemical explorations. Among the regional geochemical data processing, there are a range of statistical and data mining techniques as well as different mapping methods, which serve as presentations of the outputs. The outlier’s values are of interest in the investigations where data are gathered under controlled condition...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008