Multi-Layer Combined Frequency and Periodicity Representations for Multi-Pitch Estimation of Multi-Instrument Music

Tomoki Matsunaga, Hiroaki Saito

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Multi-pitch estimation (MPE) is one of the most important tasks in automatic music transcription (AMT). Since music generally involves a wide variety of instruments, MPE should be applied to multi-instrument music. The combined frequency and periodicity (CFP) approach detects pitches in multi-instrument music by comparing the frequency-domain spectrum with the quefrency-domain cepstrum. Although CFP considers the matching peak positions of the spectrum and cepstrum mapped according to the equal-tempered scale as pitches, its pitch selection method lacks sufficient rationality when dealing with stacked harmonics. In this paper, we propose an unsupervised method to effectively detect pitches from the stacked harmonics by extending CFP with partial cepstra extracted by suitably liftering the cepstrum. The frequency-domain and quefrency-domain features are multilayered by the partial cepstra; thus, the proposed method is constructed as a multi-layer CFP (ML-CFP). We compare the proposed ML-CFP with existing state-of-the-art MPE methods on one single-instrument and three multi-instrument datasets and demonstrate that ML-CFP provides the best overall performance among unsupervised methods. In addition, the generalizability of ML-CFP in terms of the degree of polyphony, duration scale, and instrument type is evaluated on another large-scale multi-instrument dataset. The results reveal the performance differences for different values of each measure with the limitations on musical properties of music signals required for ML-CFP to perform well.

Original languageEnglish
Pages (from-to)3171-3184
Number of pages14
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume32
DOIs
Publication statusPublished - 2024

Keywords

  • Automatic music transcription
  • multi-pitch estimation
  • music signal processing
  • partial cepstrum

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Acoustics and Ultrasonics
  • Computational Mathematics
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Multi-Layer Combined Frequency and Periodicity Representations for Multi-Pitch Estimation of Multi-Instrument Music'. Together they form a unique fingerprint.

Cite this