An improved speech/nonspeech classification based on feature combination for audio indexing

Ji Soo Keum, Hyon Soo Lee, Masafumi Hagiwara

研究成果: Article査読

抄録

In this letter, we propose an improved speech/nonspeech classification method to effectively classify a multimedia source. To improve performance, we introduce a feature based on spectral duration analysis, and combine recently proposed features such as high zero crossing rate ratio (HZCRR), low short time energy ratio (LSTER), and pitch ratio (PR). According to the results of our experiments on speech, music, and environmental sounds, the proposed method obtained high classification results when compared with conventional approaches.

本文言語English
ページ(範囲)830-832
ページ数3
ジャーナルIEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
E93-A
4
DOI
出版ステータスPublished - 2010 4月

ASJC Scopus subject areas

  • 信号処理
  • コンピュータ グラフィックスおよびコンピュータ支援設計
  • 電子工学および電気工学
  • 応用数学

フィンガープリント

「An improved speech/nonspeech classification based on feature combination for audio indexing」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル