Last week I attended ICASSP 2011, which was held in Prague, and presented a poster there. As always, it was a conference full of people, well-organized, in a nice place and with many parallel sessions. This year's edition had a strong presence of European researchers (Europe, Middle East and Africa had 1236 submissions compared to the 914 of Asia and Australia or the 701 from the USA), and specially of French people (227 submissions). Curiously, Spain was the 9th country in number of submissions (98; USA, China and France were the three first). This is quite remarkable if we think on the percentage this represents over the population of each country (are we producing too much PhDs in Spain?).
As well as in 2010, the conference had 3 sessions entirely devoted to music signal processing (1 oral + 2 poster sessions). In fact, music signal processing was explicitly acknowledged to be one of the rising topics within the IEEE Signal Processing community. Apart from music processing, source separation and sparsity-related stuff were quite well represented. Across the whole conference I could spot some interesting papers related to music, for example:
- Yun Wang & Zijian Ou, "Combining HMM-based melody extraction and NMF-based soft masking for separating voice and accompaniment from monaural audio".
- Jinyu Han & Wei Chen, "Improving melody extraction using probabilistic latent component analysis".
- Thierry Bertin-Mahieux, Graham Grindlay, Ron J. Weiss & Daniel P. W. Ellis, "Evaluating music sequence models through missing data".
- Other posters of AASP-P2 that I couldn't check because I had to stay with mine.
- Hung-Yi Lo, Ju-Chiang Wang, Hsin-Min Wang & Shou-De Lin, "Cost-sensitive stacking for audio tag annotation and retrieval".
- Naoki Yasuraoka, Hirokazu Kameoka, Takuya Yoshioka & Hiroshi G. Okuno, "I-divergence-based dereverberation method with auxilliary function approach".
A new thing this year seemed to be the "Trends in XXX" sessions, which were half-hour sessions where three experts in a certain field discussed about the trends of such field. As always, this kind of sessions are very interesting for people outside the field, but I doubt they brought something new to people inside a particular field.
Last but not least, there was a very stimulating special session on "Innovative Representations of Audio", with all papers worth reading (I really enjoyed it!). In particular, I would highlight:
- Nima Mesgarani & Shihab Shamma, "Speech processing with a cortical representation of audio".
- Richard Lyon, Jay Ponte & Gal Chechik, "Sparse coding of auditory features for machine hearing in interference".
- Paris Smaragdis, "Approximate nearest-subspace representations for sound mixtures".