Abstract
A set of n-principal points of a distribution is defined as a set of n points that optimally represent the distribution in terms of mean squared distance. It provides an optimal n-point-approximation of the distribution. However, it is in general difficult to find a set of principal points of a multivariate distribution. Tarpey et al. [T. Tarpey, L. Li, B. Flury, Principal points and self-consistent points of elliptical distributions, Ann. Statist. 23 (1995) 103-112] established a theorem which states that any set of n-principal points of an elliptically symmetric distribution is in the linear subspace spanned by some principal eigenvectors of the covariance matrix. This theorem, called a "principal subspace theorem", is a strong tool for the calculation of principal points. In practice, we often come across distributions consisting of several subgroups. Hence it is of interest to know whether the principal subspace theorem remains valid even under such complex distributions. In this paper, we define a multivariate location mixture model. A theorem is established that clarifies a linear subspace in which n-principal points exist.
Original language | English |
---|---|
Pages (from-to) | 213-224 |
Number of pages | 12 |
Journal | Journal of Multivariate Analysis |
Volume | 102 |
Issue number | 2 |
DOIs | |
Publication status | Published - 2011 Feb |
Externally published | Yes |
Keywords
- Location mixture
- Mean squared distance
- Primary
- Principal points
- Secondary
- Self-consistency
- Spherically symmetric distribution
ASJC Scopus subject areas
- Statistics and Probability
- Numerical Analysis
- Statistics, Probability and Uncertainty