Online sound structure analysis based on generative model of acoustic feature sequences

Keisuke Imoto, Nobutaka Ono, Masahiro Niitsuma, Yoichi Yamashita

研究成果: Conference contribution

抄録

We propose a method for the online sound structure analysis based on a Bayesian generative model of acoustic feature sequences, with which the hierarchical generative process of the sound clip, acoustic topic, acoustic word, and acoustic feature is assumed. In this model, it is assumed that sound clips are organized based on the combination of latent acoustic topics, and each acoustic topic is represented by a Gaussian mixture model (GMM) over an acoustic feature space, where the components of the GMM correspond to acoustic words. Since the conventional batch algorithm for learning this model requires a huge amount of calculation, it is difficult to analyze the massive amount of sound data. Moreover, the batch algorithm does not allow us to analyze the sequentially obtained data. Our variational Bayes-based online algorithm for this generative model can analyze the structure of sounds sound clip by sound clip. The experimental results show that the proposed online algorithm can reduce the calculation cost by about 90% and estimate the posterior distributions as efficiently as the conventional batch algorithm.

本文言語English
ホスト出版物のタイトルProceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
出版社Institute of Electrical and Electronics Engineers Inc.
ページ1316-1321
ページ数6
ISBN(電子版)9781538615423
DOI
出版ステータスPublished - 2018 2月 5
外部発表はい
イベント9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017 - Kuala Lumpur, Malaysia
継続期間: 2017 12月 122017 12月 15

出版物シリーズ

名前Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
2018-February

Other

Other9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
国/地域Malaysia
CityKuala Lumpur
Period17/12/1217/12/15

ASJC Scopus subject areas

  • 人工知能
  • 人間とコンピュータの相互作用
  • 情報システム
  • 信号処理

フィンガープリント

「Online sound structure analysis based on generative model of acoustic feature sequences」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル