Filtering of multivariate samples containing “outliers” for clustering
نویسندگان
چکیده
منابع مشابه
Interpretation of multivariate outliers for compositional data
data Peter Filzmoser, Karel Hron, Clemens Reimann Department of Statistics and Probability Theory, Vienna University of Technology, Wiedner Hauptstraße 8-10, A-1040 Vienna, Austria. Tel +43 1 58801 10733, FAX +43 1 58801 10799 Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Faculty of Science, 17. listopadu 12, CZ-77146 Olomouc, Czech Republic Geological...
متن کاملIdentification of local multivariate outliers
Abstract The Mahalanobis distance between pairs of multivariate observations is used as a measure of similarity between the observations. The theoretical distribution is derived, and the result is used for judging on the degree of isolation of an observation. In case of spatially dependent data where spatial coordinates are available, different exploratory tools are introduced for studying the ...
متن کاملFiltering Outliers in Bayesian Optimization
Jarno Vanhatalo, Pasi Jylänki, and Aki Vehtari. Gaussian process regression with Student-t likelihood. In NIPS, pages 1910–1918, 2009. Amar Shah, Andrew Gordon Wilson, and Zoubin Ghahramani. Student-t processes as alternatives to Gaussian processes. In AISTATS, pages 877–885, 2014. Anthony O'Hagan. On outlier rejection phenomena in Bayes inference. Journal of the Royal Statistical Society. Seri...
متن کاملA monitoring display of multivariate outliers
A projection pursuit approach is considered for the detection and visualization of multivariate outliers in the wide sense. Such a framework leads to assess the significance of the displays themselves rather than the significance of suspected observations. The necessary mathematical results to implement this strategy are provided in the case of a generalized principal component analysis aimed t...
متن کاملIdentifying Outliers via Clustering for Anomaly Detection
Detecting known vulnerabilities (Signature Detection) is not sufficient for complete security. This has raised recent interest in Anomaly Detection (AD), in which a model is built from normal behavior and signification deviations from this model are flagged anomalous. However, most AD algorithm assumes clean training data, which could be hard to obtain. Our proposed algorithm relaxes. For this,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition Letters
سال: 1998
ISSN: 0167-8655
DOI: 10.1016/s0167-8655(98)00094-4