Tuomas Virtanen, researcher
Tampere University of Technology
P.O. Box 553, FI-33101 Tampere, FINLAND
Tel. +358 3 3115 4798
tuomas.virtanen@tut.fi
I am a senior researcher at the Department of Signal Processing, Audio Research Group, Tampere University of Technology. My
research topic is computational analysis of audio, especially
sound source separation, which
has several applications in the analysis, editing and
manipulation of audio signals. These include for example structured
audio coding, automatic transcription of music, and noise-robust automatic speech recognition.
I completed my PhD studies in November 2006. My doctoral thesis "Sound Source
Separation in
Monaural
Music Signals"
in pdf format: virtanen_phd.pdf.
Audio demonstrations.
Teaching
Publications
-
A. Mesaros and T. Virtanen. Automatic recognition of lyrics in
singing, EURASIP Journal on Audio, Speech and Music Processing, Volume
2010 (2010), (in press).
- A. Klapuri and T. Virtanen, "Representing Musical Sounds with an
Interpolating State Model," IEEE Trans. Audio, Speech and Language
Processing, to appear.
- E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj.
Voice Conversion Using Partial Least Squares. IEEE Transactions on
Audio, Speech, and Language Processing, accepted for publication.
- M. Helén and T. Virtanen, Audio query by example using similarity
measures between probability density functions of features, EURASIP
Journal on Audio, Speech and Music Processing, Volume 2010 (2010), (in
press).
- J. F. Gemmeke and T. Virtanen
Noise robust exemplar-based connected digit recognition, to be
presented at the 35th International Conference on Acoustics, Speech, and
Signal Processing (ICASSP),
Dallas, USA, 2010.
- A. Klapuri, T. Virtanen, and T. Heittola. Sound source separation
in monaural music signals using excitation-filter model and EM
algorithm, to be presented at the 35th International Conference on
Acoustics, Speech, and Signal Processing (ICASSP), Dallas, USA, 2010.
- A. Mesaros and T. Virtanen, Recognition of phonemes and words in
singing, to be presented at
the 35th International Conference on Acoustics, Speech, and
Signal Processing (ICASSP),
Dallas, USA, 2010.
- J. Nikunen and T. Virtanen, Noise-to-mask ratio minimization by
weighted non-negative matrix factorization, to be presented at
the 35th International Conference on Acoustics, Speech, and
Signal Processing (ICASSP),
Dallas, USA, 2010.
- T. Heittola, A. Klapuri, and T. Virtanen.
Musical Instrument Recognition in Polyphonic Audio Using Source-Filter
Model for Sound Separation, in Proc. 10th Int. Society for
Music Information Retrieval Conf. (ISMIR 2009), Kobe, Japan, 2009. The
paper won the best paper award of the conference.
- T. Virtanen and T. Heittola.
Interpolating Hidden Markov Model and Its Application to
Automatic Instrument Recognition, presented in ICASSP 2009.
- A. Mesaros. and T. Virtanen.
Adaptation of a singing recognizer for singing voice
, in EUSIPCO 2009.
- T. Virtanen.
Spectral Covariance in Prior Distributions of Non-Negative Matrix
Factorization Based Speech Separation
, in EUSIPCO 2009.
- M. Myllymäki and T. Virtanen.
Non-Stationary Noise Model Compensation in Voice Activity Detection
, in EUSIPCO 2009.
- T. Virtanen and A. T. Cemgil.
Mixtures of Gamma Priors for Non-Negative Matrix
Factorization Based Speech Separation, presented in ICA 2009. © Springer-Verlag. The
publication will become available at springerlink.com.
- T. Virtanen, A. Mesaros, M. Ryynänen. Combining
Pitch-Based Inference and
Non-Negative Spectrogram Factorization in Separating Vocals from Polyphonic
Music, SAPA 2008.
- T. Virtanen, A. T. Cemgil, and S. J. Godsill. Bayesian
Extensions to Non-negative Matrix Factorisation for Audio Signal
Modelling, ICASSP 2008. This work was carried out in University of Cambridge, Signal Processing and
Communications Laboratory.
- A. Mesaros and T. Virtanen. Automatic
Alignment of Music Audio and Lyrics, DAFX08.
- M. Myllymäki and T. Virtanen. Voice
Activity Detection
in the Presence of Breathing Noise Using Neural Network and Hidden Markov
Model, EUSIPCO
2008.
- M. Ryynänen, T. Virtanen, J. Paulus, and A. Klapuri, Accompaniment
Separation and Karaoke Application Based on Automatic Melody
Transcription, in Proc. 2008 IEEE International Conference on
Multimedia & Expo (ICME'08), Hannover, Germany, June 2008. (demonstrations)
- A. Klapuri and T. Virtanen, Progress towards automatic music
transcription, In Handbook of Signal Processing in Acoustics, David
Havelock, Sonoko Kuwano, and Michael Vorlander (Eds.), Springer-Verlag,
2008.
- Virtanen, Tuomas., Monaural Sound Source Separation by Non-Negative Matrix
Factorization with Temporal Continuity and Sparseness Criteria,
IEEE Transactions on Audio, Speech, and Language Processing, vol 15, no. 3, March 2007.
- Virtanen, T., Helén, M.,
Probabilistic Model Based Similarity Measures for Audio Query-by-Example
, in proc. WASPAA 2007.
- Mesaros, A., Virtanen, T., Klapuri, A.
Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition
Methods,
International Conference on Music Information Retrieval, Vienna, Austria, 2007.
- Helén, M., Virtanen, T., Query
by Example of Audio signals Using Euclidean Distance Between Gaussian
Mixture Models, in proc. ICASSP 2007. Note: two small
errors in equations (8) - (11) have been corrected. The corrections do
not appear in the ICASSP conference proceedings.
- Helén, M., Virtanen, T.,
A Similarity Measure for Audio Query by Example Based on Perceptual Coding and Compression, in proc. 10th International Conference on Digital Audio Effects (DAFx-07), September 10-15. 2007.
- Virtanen, Tuomas, Monaural
Sound Source Separation by Perceptually Weighted Non-Negative Matrix
Factorization, Technical report, Tampere University of Technology,
Institute of Signal Processing, 2007.
- Virtanen, T., Klapuri, A., Analysis of polyphonic audio using source-filter model and non-negative matrix factorization, in Advances in Models for Acoustic Processing, Neural Information Processing Systems Workshop, 2006 (extended abstract).
- Virtanen, Tuomas., Speech Recognition Using Factorial Hidden Markov Models for Separation in the Feature Space, in proc. Interspeech 2006, Pittsburgh, USA. (demonstrations). The second best results among the papers presented in Interspeech 2006 Speech Separation Challenge special session.
- Virtanen, Tuomas. "Unsupervised Learning Methods for Source Separation", in "Signal Processing Methods for Music Transcription", eds. Klapuri, A., Davy, M., Springer-Verlag, 2006.
- Helén, M., Virtanen, T., Separation of Drums From Polyphonic Music Using Non-Negative Matrix Factorization and Support Vector Machine, in proc. 13th European Signal Processing Conference Antalaya, Turkey, 2005.
(demonstrations)
- Klapuri, A., Virtanen, T., Helén, M., Modeling musical sounds with an interpolating state model, in proc. 13th European Signal Processing Conference, Antalya, Turkey, 2005.
- Paulus, J., Virtanen, T., Drum Transcription with Non-negative Spectrogram Factorisation, in proc. 13th European Signal Processing Conference Antalaya, Turkey, 2005
(demonstrations)
- Virtanen, Tuomas,
Separation of Sound Sources by Convolutive Sparse Coding,
ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, SAPA 2004.(demonstrations)
-
M.Helén, T.Virtanen, Perceptually Motivated Parametric Representation for Harmonic Sounds for Data Compression Purposes, 6th International conference on Digital Audio Effects (DAFx-03), 2003, London, UK.
- Virtanen, Tuomas,
Algorithm for the separation of harmonic sounds with
time-frequency smoothness constraint, in proc. the 6th
International Conference on Digital Audio Effects (DAFx-03), London, UK.
- Virtanen, Tuomas,
Sound Source Separation Using Sparse Coding with Temporal
Continuity Objective, International Computer Music
Conference, ICMC 2003.
(demonstrations)
- Parviainen, M., Virtanen, T.,
Two-channel separation of
speech
using direction-of-arrival estimation and sinusoids plus transients
modeling, IEEE International Symposium on Intelligent Signal
Processing and Communication Systems, ISPACS 2003.
- Virtanen, T., Klapuri A.,
Separation of Harmonic Sounds Using Linear Models for the Overtone
Series, IEEE International Conference on Acoustics, Speech and
Signal Processing, ICASSP 2002.
(demonstrations)
- Virtanen, Tuomas,
Accurate Sinusoidal Model Analysis and Parameter Reduction
by Fusion of Components, 110th Audio Engineering Society Convention,
Amsterdam, Netherlands 2001.
- Virtanen, T., Klapuri A.
Separation of Harmonic Sounds Using Multipitch Analysis and Iterative
Parameter Estimation, Proc. IEEE Workshop on Applications of
Signal Processing to Audio and Acoustics, New Paltz, New York, 2001.
(demonstrations)
- Klapuri, A., Virtanen, T., Holm, J.-M.,
Robust multipitch estimation for the analysis and manipulation of
polyphonic musical signals. In Proc. COST-G6 Conference
on Digital Audio Effects, DAFx-00, Verona, Italy, 2000.
- Sillanpää, J., Klapuri, A., Seppänen, J., Virtanen, T.,
Recognition of acoustic noise mixtures by combined bottom-up and
top-down processing. Proceedings of the European Signal
Processing Conference EUSIPCO, 2000.
- Virtanen, T., Klapuri, A.
Separation of Harmonic Sound Sources Using Sinusoidal Modeling,
IEEE International Conference on Acoustics, Speech and Signal Processing,
ICASSP 2000.
(demonstrations)
- Virtanen, Tuomas,
Audio Signal Modeling with Sinusoids Plus Noise, MSc
thesis, Tampere University of Technology 2001.
(demonstrations 1,
demonstrations 2)
IEEE-Copyrighted Material:
Personal use of this material is permitted. However, permission to
reprint/republish this material for advertising or promotional
purposes or for creating new collective works for resale or
redistribution to servers or lists, or to reuse any copyrighted
component of this work in other works, must be obtained from the
IEEE. Contact: Manager, Copyrights and Permissions / IEEE Service
Center / 445 Hoes Lane / P.O. Box 1331 / Piscataway, NJ 08855-1331,
USA. Telephone: +Intl. 908-562-3966.
- Tuomas Virtanen, tuomas.virtanen@tut.fi