Abstract
Hubness has been recently identified as a problematic phenomenon occurring in high-dimensional space. In this paper, we address a different type of hubness that occurs when the number of samples is large. We investigate the difference between the hubness in highdimensional data and the one in large-sample data. One finding is that centering, which is known to reduce the former, does not work for the latter. We then propose a new hub-reduction method, called localized centering. It is an extension of centering, yet works effectively for both types of hubness. Using real-world datasets consisting of a large number of documents, we demonstrate that the proposed method improves the accuracy of knearest neighbor classification.
Original language | English |
---|---|
Title of host publication | Proceedings of the 29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015 |
Publisher | AI Access Foundation |
Pages | 2645-2651 |
Number of pages | 7 |
Volume | 4 |
ISBN (Electronic) | 9781577357025 |
Publication status | Published - 2015 Jun 1 |
Externally published | Yes |
Event | 29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015 - Austin, United States Duration: 2015 Jan 25 → 2015 Jan 30 |
Other
Other | 29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015 |
---|---|
Country/Territory | United States |
City | Austin |
Period | 15/1/25 → 15/1/30 |
ASJC Scopus subject areas
- Software
- Artificial Intelligence