Faculty of Engineering and Natural Sciences
Probabilistic Models and Inference for Acoustic Processing
A. Taylan Cemgil
Signal Processing and Communications Laboratory, Dept. of Engineering,
University of Cambridge, UK
The analysis of audio signals is central to the scientific understanding of human hearing abilities and in a broad spectrum of engineering applications ranging from sound localisation, to hearing aids or music information retrieval. Historically, the main mathematical tools are from signal processing: digital filtering theory, system identification and various transform methods such as Fourier techniques. In recent years, there is an increasing interest for statistical approaches and tools from machine learning.
The application of statistical techniques is quite natural: acoustical time series can be conveniently modelled using hierarchical signal models by incorporating prior knowledge from various sources: from physics or studies of human cognition and perception. Once a realistic hierarchical model is constructed, many tasks such as coding, analysis, restoration, transcription, separation, identification or resynthesis can be formulated consistently as Bayesian posterior inference problems.
In this talk, I will sketch my past and current work on audio and music signal analysis. In particular, I will focus on factorial switching state space models and illustrate how using this formalism realistic generative signal models can be developed for many problems in audio and processing, such as transcription, restoration or source separation. In this model class, certain changepoint models admit exact inference, otherwise efficient algorithms based on variational and stochastic approximation methods can be developed.
December 28, 2006, 14:40, FENS G032