Abstract
A new boosting method for a kind of noisy data is developed, where the probability of mislabeling depends on the label of a case. The mechanism of the model is based on a simple idea and gives natural interpretation as a mislabel model. The boosting algorithm is derived from an extension of the exponential loss function, which provides the AdaBoost algorithm. A connection between the proposed method and an asymmetric mislabel model is shown. It is also shown that the loss function proposed constructs a classifier which attains the minimum error rate for a true label. Numerical experiments illustrate how well the proposed method performs in comparison to existing methods.
Original language | English |
---|---|
Pages (from-to) | 203-218 |
Number of pages | 16 |
Journal | Computational Statistics |
Volume | 27 |
Issue number | 2 |
DOIs | |
Publication status | Published - 2012 Jun |
Externally published | Yes |
Keywords
- Asymmetric mislabeling mechanism
- Bayes error rate
- Boosting method
- Robustness
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty
- Computational Mathematics