Multi-Layer Combined Frequency and Periodicity Representations for Multi-Pitch Estimation of Multi-Instrument Music

Tomoki Matsunaga, Hiroaki Saito

研究成果: Article査読

1 被引用数 (Scopus)

抄録

Multi-pitch estimation (MPE) is one of the most important tasks in automatic music transcription (AMT). Since music generally involves a wide variety of instruments, MPE should be applied to multi-instrument music. The combined frequency and periodicity (CFP) approach detects pitches in multi-instrument music by comparing the frequency-domain spectrum with the quefrency-domain cepstrum. Although CFP considers the matching peak positions of the spectrum and cepstrum mapped according to the equal-tempered scale as pitches, its pitch selection method lacks sufficient rationality when dealing with stacked harmonics. In this paper, we propose an unsupervised method to effectively detect pitches from the stacked harmonics by extending CFP with partial cepstra extracted by suitably liftering the cepstrum. The frequency-domain and quefrency-domain features are multilayered by the partial cepstra; thus, the proposed method is constructed as a multi-layer CFP (ML-CFP). We compare the proposed ML-CFP with existing state-of-the-art MPE methods on one single-instrument and three multi-instrument datasets and demonstrate that ML-CFP provides the best overall performance among unsupervised methods. In addition, the generalizability of ML-CFP in terms of the degree of polyphony, duration scale, and instrument type is evaluated on another large-scale multi-instrument dataset. The results reveal the performance differences for different values of each measure with the limitations on musical properties of music signals required for ML-CFP to perform well.

本文言語English
ページ(範囲)3171-3184
ページ数14
ジャーナルIEEE/ACM Transactions on Audio Speech and Language Processing
32
DOI
出版ステータスPublished - 2024

ASJC Scopus subject areas

  • コンピュータ サイエンス(その他)
  • 音響学および超音波学
  • 計算数学
  • 電子工学および電気工学

フィンガープリント

「Multi-Layer Combined Frequency and Periodicity Representations for Multi-Pitch Estimation of Multi-Instrument Music」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル