Melody, bassline and harmony representations for music version identification

19/4/2012

In this paper we compare the use of different musical representations for the task of version identification (i.e. retrieving alternative performances of the same musical piece). We automatically compute descriptors representing the melody and bass line using a state-of-the-art melody extraction algorithm, and compare them to a harmony-based descriptor. The similarity of descriptor sequences is computed using a dynamic programming algorithm based on nonlinear time series analysis which has been successfully used for version identification with harmony descriptors. After evaluating the accuracy of individual descriptors, we assess whether performance can be improved by descriptor fusion, for which we apply a classification approach, comparing different classification algorithms. We show that both melody and bass line descriptors carry useful information for version identification, and that combining them increases version detection accuracy. Whilst harmony remains the most reliable musical representation for version identification, we demonstrate how in some cases performance can be improved by combining it with melody and bass line descriptions. Finally, we identify some of the limitations of the proposed descriptor fusion approach, and discuss directions for future research.

J. Salamon, J. Serrà, and E. Gómez. Melody, bassline, and harmony representations for music version identification. Proc. of the Int. World Wide Web Conf., Workshop on Advances in Music Information Research (AdMIRe), pp. 887-894. Lyon, France. April 2012.

Melody, bassline and harmony representations for music version identification

Archives