Tools for Music Information Retrieval and Playing - Intelligent Music Information Systems: Tools and Methodologies

Information Technology Reference

In-Depth Information

In this contest it is difficult to point out the

accuracy rate of the various algorithms because

some widespread dataset and common method-

ological evaluation routine are not yet accepted.

This situation is partly due to the choice of dataset,

which often depends on the goals of the system

(Dixon, 2001).

Various models have been proposed in or-

der to extract the beat from performance data.

The primary distinction we want to point out

is between real-time and batch algorithms. For

example automatic accompaniment systems have

to use real-time algorithms. Transcription and

analysis software tends to process data off-line,

because rhythmically ambiguous sections can

be frequently determined analyzing all the beat

information found in the song. Thus the choice

between real-time and off-line systems it is directly

related to algorithm aim.

Actually the beat tracking system works on a

two-stage model. The first stage is an onset de-

tector, the second one is an interpretative system,

which gets the onset detector output and tries to

understand the tempo of the song and the correct

beat position.

For music with drums to develop an onset

detector system the simplest way is to high pass

the signal and then clustering the filtered signal

and introduce a threshold to the cluster energies.

Obviously this trivial algorithm does not work

very well, but it permits understanding that a

drums beat has a spectral frequency with rich

high components.

With the interpretative system, it is possible

to find many different solutions like the agents

model (Dixon, 2001; Goto, 2001), or probabilistic

systems (Sethares, Sethares, & Morris, 2005)

and so on.

We will present two different solutions to

the interpretative problem that use the agents

model.

In the next sections, four different tempo-

beat tracking algorithms (one solely dedicated to

tempo tracking) will be described. The algorithms

(Dixon 2001; Goto, 2001) work in PCM format,

the algorithms presented in (D'Aguanno, Haus,

& Vercellesi, 2006;Wang, & Vilernmo, 2001) are

dedicated to MP3 standard.

The Goto Algorithm

The first algorithm was developed by Masataka

and Goto (2001). This algorithm is based on the

previous Goto and Marauroka (1998; 1999) works,

one for music with drums and the other for mu-

sic without drums. In Figure 23 is presented the

algorithm scheme.

This algorithm describes a real-time beat-

tracking system that can deal with the audio

signals of popular-music compact discs in real

time regardless of whether or not those signals

contain drum sounds. The system can recognize

the hierarchical beat structure comprising the

quarter-note level (almost regularly spaced beat

times), the half-note level, and the measure level

(Goto, 2001)

The algorithm consists of two components: the

first one extracts the musical elements from audio

signals; the second component tries to understand

the beat structure.

The first step detects three kinds of musical

elements as the beat tracking cues:

•

Onset times

•

Chord changes

•

Drum patterns

These elements are extracted from the fre-

quency spectrum calculated with the FFT (1024

samples) of the input (16 bit / 22.05 kHz) using

the Hanning window. The frequency spectrum is

subdivided into seven critical bands. The onset

times can be detected by a frequency analysis

process that takes into account the rapidity of

an increase in power and the power present in

nearby time frequency bands. The results of this

algorithm are stored in an onset-time vector. By

using autocorrelation and crosscorrelation of the

Intelligent Music Information Systems: Tools and Methodologies

Search WWH ::

Custom Search

Home