Automatic Bass Line Transcription from Streaming Polyphonic Audio, examples

Author: Matti Ryynänen, homepage
Last modified: September 2006

Transcription method

M. Ryynänen and A. Klapuri, "Automatic Bass Line Transcription from Streaming Polyphonic Audio," in Proc. 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'07), Honolulu, Hawaii, USA, April 2007.

We propose a method for the automatic transcription of the bass line in polyphonic music. The method uses a multiple-F0 estimator as a front-end and this is followed by acoustic and musicological models. The acoustic modeling consists of separate models for bass notes and rests. The musicological model estimates the key and determines probabilities for the transitions between notes using a conventional bigram or a variable-order Markov model. The transcription is obtained with Viterbi decoding through the note and rest models. In addition, a causal algorithm is presented which allows transcription of streaming audio. The method was evaluated using 87 minutes of music from the RWC Popular Music Database. Recall and precision rates of 64% and 60%, respectively, were achieved for discrete note events.

The transcription method produces a MIDI file containing the transcribed bass line. The following sound examples include the acoustic input signal in the left channel and the transcribed bass line (synthesized directly from the output MIDI file) in the right channel.

For each example, we show a score image of the transcription. The score images are automatically generated from the transcription MIDI files by using Lilypond music score typesetter. Please notice that the notes are not temporally quantized so that the score typesetting could be done unambiguously. The score images give some idea on the transcriptions, but they are not accurate in timing and assume time signature to be always 4/4.

RWC music database: popular music

Most of the following transcription examples are from the RWC (Real World Computing) popular music database which contains 100 acoustic recordings of typical pop songs [1]. The same database was used in the development and the evaluation of the transcription method. See RWC music database homepage for additional information on the music performance examples. The authors of the RWC database kindly granted a permission to set the following RWC input music files available.

Other transcription examples

Some other transcription examples are also included. The sound samples are from artists' web pages or web-store sound samples. Please notice that this copyrighted material is here used only for research purposes for short demonstrations.

References

[1] M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWC music database: Popular, classical, and jazz music databases," in Proc. 3rd International Conference on Music Information Retrieval, Oct. 2002.


Quick links

RWC music database:
HORO (RWC-MDB-P-2001 No. 3)mp3score
Koino Ver.2.4 (RWC-MDB-P-2001 No. 5)mp3score
Tokimeki no syunkan (RWC-MDB-P-2001 No. 20) mp3 score
Feeling In My Heart (RWC-MDB-P-2001 No. 21) mp3 score
Koi ni ochiru jikan ni kansuru kousatsu (RWC-MDB-P-2001 No. 22) mp3 score
What could I do for you (RWC-MDB-P-2001 No. 32) mp3 score
Senro wa tsuzukuyo (RWC-MDB-P-2001 No. 46) mp3 score
Haru ga kurukara (RWC-MDB-P-2001 No. 52) mp3 score
For You (RWC-MDB-P-2001 No. 61) mp3 score
Life (RWC-MDB-P-2001 No. 94) mp3 score
Other song samples:
Billy Sheehan: Under My Thumb mp3 score
Jazzanova: Hanazono mp3 score
Niacin: Slapped Silly mp3 score
Planet X: Ignotus Per Ignotum mp3 score

Transcription examples from RWC music database

HORO (RWC-MDB-P-2001 No. 3)

Koino Ver.2.4 (RWC-MDB-P-2001 No. 5)

Tokimeki no syunkan (RWC-MDB-P-2001 No. 20)

Feeling In My Heart (RWC-MDB-P-2001 No. 21)

Koi ni ochiru jikan ni kansuru kousatsu (RWC-MDB-P-2001 No. 22)

What could I do for you (RWC-MDB-P-2001 No. 32)

Senro wa tsuzukuyo (RWC-MDB-P-2001 No. 46)

Haru ga kurukara (RWC-MDB-P-2001 No. 52)

For You (RWC-MDB-P-2001 No. 61)

Life (RWC-MDB-P-2001 No. 94)


Other transcription examples

Billy Sheehan: Under My Thumb (sample from www.billysheehan.com)

Jazzanova: Hanazono (sample from www.cduniverse.com)

Niacin: Slapped Silly (sample from www.billysheehan.com)

Planet X: Ignotus Per Ignotum (sample from www.virgildonati.com)