Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition

Shinsuke Okita, Yasue Mitsukura, Nozomu Hamada

研究成果: Conference contribution

抄録

For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on 'viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.

本文言語English
ホスト出版物のタイトルProceedings - 2013 IEEE Conference on Systems, Process and Control, ICSPC 2013
出版社IEEE Computer Society
ページ62-67
ページ数6
ISBN(印刷版)9781479922093
DOI
出版ステータスPublished - 2013
イベント2013 IEEE Conference on Systems, Process and Control, ICSPC 2013 - Kuala Lumpur, Malaysia
継続期間: 2013 12月 132013 12月 15

出版物シリーズ

名前Proceedings - 2013 IEEE Conference on Systems, Process and Control, ICSPC 2013

Other

Other2013 IEEE Conference on Systems, Process and Control, ICSPC 2013
国/地域Malaysia
CityKuala Lumpur
Period13/12/1313/12/15

ASJC Scopus subject areas

  • 制御およびシステム工学

フィンガープリント

「Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル