Abstract

We provide a novel view on group structure in data. Projecting observations onto a subspace spanned by a small selection of observations, we calculate orthogonal distances as a measure for dissimilarity. Sequentially exchanging the observations, used to span the subspace, we receive a series of distances. Observations, taken from a similar group structure will behave similar along those projections. This leads to a visualisation of high dimensional data providing some basic diagnostic on group structures and outliers. The series of distances can be further utilized to perform cluster algorithms, leading to significant improvement when facing clusters located in different subspaces.

Reference

Ortner, T., Filzmoser, P., Brodinova, S., Zaharieva, M., & Breiteneder, C. (2016). Forward Projection for High-Dimensional Data. International Conference COMPUTER DATA ANALYSIS & MODELING, Minsk, Belarus, Non-EU. http://hdl.handle.net/20.500.12708/86324

Projects