Finding the number of disparate clusters with background contamination

Atkinson, A. C., Cerioli, A., Morelli, G. & Riani, M. (2015). Finding the number of disparate clusters with background contamination. In Lausen, B., Krolak-Schwerdt, S. & Böhmer, M. (Eds.), Data Science, Learning by Latent Structures, and Knowledge Discovery (pp. 29-42). Springer Berlin / Heidelberg. https://doi.org/10.1007/978-3-662-44983-7
Copy

The Forward Search is used in an exploratory manner, with many random starts, to indicate the number of clusters and their membership in continuous data. The prospective clusters can readily be distinguished from background noise and from other forms of outliers. A confirmatory Forward Search, involving control on the sizes of statistical tests, establishes precise cluster membership. The method performs as well as robust methods such as TCLUST. However, it does not require prior specification of the number of clusters, nor of the level of trimming of outliers. In this way it is “user friendly”.

mail Request Copy picture_as_pdf

subject
Accepted Version
lock
Restricted to Registered users only

Download Request Copy

Export as

EndNote BibTeX Reference Manager Refer Atom Dublin Core JSON Multiline CSV
Export