TY - JOUR
T1 - Why Does a Hilbertian Metric Work Efficiently in Online Learning with Kernels?
AU - Yukawa, Masahiro
AU - Muller, Klaus Robert
N1 - Publisher Copyright:
© 1994-2012 IEEE.
PY - 2016/10
Y1 - 2016/10
N2 - The autocorrelation matrix of the kernelized input vector is well approximated by the squared Gram matrix (scaled down by the dictionary size). This holds true under the condition that the input covariance matrix in the feature space is approximated by its sample estimate based on the dictionary elements, leading to a couple of fundamental insights into online learning with kernels. First, the eigenvalue spread of the autocorrelation matrix relevant to the hyperplane projection along affine subspace algorithm is approximately a square root of that for the kernel normalized least mean square algorithm. This clarifies the mechanism behind fast convergence due to the use of a Hilbertian metric. Second, for efficient function estimation, the dictionary needs to be constructed in general by taking into account the distribution of the input vector, so as to satisfy the condition. The theoretical results are justified by computer experiments.
AB - The autocorrelation matrix of the kernelized input vector is well approximated by the squared Gram matrix (scaled down by the dictionary size). This holds true under the condition that the input covariance matrix in the feature space is approximated by its sample estimate based on the dictionary elements, leading to a couple of fundamental insights into online learning with kernels. First, the eigenvalue spread of the autocorrelation matrix relevant to the hyperplane projection along affine subspace algorithm is approximately a square root of that for the kernel normalized least mean square algorithm. This clarifies the mechanism behind fast convergence due to the use of a Hilbertian metric. Second, for efficient function estimation, the dictionary needs to be constructed in general by taking into account the distribution of the input vector, so as to satisfy the condition. The theoretical results are justified by computer experiments.
KW - Kernel adaptive filter
KW - online learning
KW - reproducing kernel Hilbert space (RKHS)
UR - http://www.scopus.com/inward/record.url?scp=84990185861&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84990185861&partnerID=8YFLogxK
U2 - 10.1109/LSP.2016.2598615
DO - 10.1109/LSP.2016.2598615
M3 - Article
AN - SCOPUS:84990185861
SN - 1070-9908
VL - 23
SP - 1424
EP - 1428
JO - IEEE Signal Processing Letters
JF - IEEE Signal Processing Letters
IS - 10
M1 - 7536151
ER -