TY - GEN
T1 - Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition
AU - Okita, Shinsuke
AU - Mitsukura, Yasue
AU - Hamada, Nozomu
PY - 2013
Y1 - 2013
N2 - For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on 'viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.
AB - For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on 'viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.
KW - image processing
KW - pattern recognition
KW - visemes
KW - visual speech recognition
UR - http://www.scopus.com/inward/record.url?scp=84897786557&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84897786557&partnerID=8YFLogxK
U2 - 10.1109/SPC.2013.6735104
DO - 10.1109/SPC.2013.6735104
M3 - Conference contribution
AN - SCOPUS:84897786557
SN - 9781479922093
T3 - Proceedings - 2013 IEEE Conference on Systems, Process and Control, ICSPC 2013
SP - 62
EP - 67
BT - Proceedings - 2013 IEEE Conference on Systems, Process and Control, ICSPC 2013
PB - IEEE Computer Society
T2 - 2013 IEEE Conference on Systems, Process and Control, ICSPC 2013
Y2 - 13 December 2013 through 15 December 2013
ER -