Motivic Pattern Extraction in Symbolic Domain - Intelligent Music Information Systems: Tools and Methodologies

Information Technology Reference

In-Depth Information

the ratio of the number of notes in the pattern to

the total number of notes in the piece that occur

within the region spanned by the pattern. The

“region spanned by a pattern” is defined in a

number of different ways: either as the smallest

“time segment” that contains the pattern, or as

the “bounding-box” or the “convex hull” spanned

by the pattern in the bi-dimensional representa-

tion (time/pitch) of the score. See Figure 1 for

an illustration of these geometrical objects. The

underlying assumption is that typically at least

one occurrence of the pattern should have a high

compactness value, even if the other occurrences

are highly embellished.

For instance, in Mozart's Theme and Varia-

tions on Twinkle, Twinkle, Little Star (K. 300e),

the basic theme common to all the variations does

not contain the pitch repetition and is therefore

reduced to only one note per bar (circled in the

figure). During the exposition of the theme (first

occurrence), its compactness with respect to the

corresponding “time segment” (between crochets)

is very low: 8 notes out of 31, or 26%. The results

are better for “bounding box” (gray rectangle, 8

notes out of 16, or 50%) and for “convex hull”

(black triangle, 8 notes out of 14, or 57%) but

not particularly high. Hence the main theme is

likely to be rejected by the heuristics, unless the

minimum compactness threshold is fixed to a low

value, below 57%. But a low threshold value may

dramatically increase the proportion of false posi-

tives, without necessarily insuring a significant

reduction of false negatives.

(Saussure, 1916). Yet, in general, music takes the

form of a complex polyphonic flux that cannot

easily be reduced to a single string of events. For

instance, the Mozart example, as shown in Figure

2, features two strings, one for each voice 2 . The

adaptation of this approach to polyphony will

require extensive subsequent works.

Following the string paradigm, each voice is

considered separately, and motives are formalized

as substrings of the monodic string that contain

all the notes of the string between two given

limits. In its very simple version, this approach

can only formalize exact repetition of patterns,

transpositions, or similar simple transformations.

Ornamented motives on the contrary,—like in

the Mozart example, where secondary notes are

added in the temporal neighbourhood of the pri-

mary notes (i.e., the notes forming the reduced

theme itself) -, are beyond the compass of this

simple framework. Primary notes may be retrieved

through an automated filtering of secondary notes,

for instance, by focusing on metrically strong posi-

tions (Conklin & Anagnostopoulou, 2001; Eerola,

Järvinen, Louhivuori, & Toiviainen, 2001). This

heuristic does not, however, work correctly for

appoggiaturas, where secondary notes, instead

of primary notes, are placed on downbeats. For

instance, the pattern repetition of the theme could

not be detected in this way in the first variation of

the Twinkle theme (Figure 2, staves 2 and 3).

Another solution (Mongeau & Sankoff, 1990;

Rolland, 1999) consists of tolerating gaps, using

alignment strategies based on dynamic program-

ming. The idea is that a substring may be com-

pared to another substring through a successive

sequence of transformations, such as insertion of

notes from the second substring, suppression of

notes in the initial first substring, modification of

one note, and so forth. With each transformation

is associated a cost, whose summation defines

a global dissimilarity distance between the two

motives, from which identification decision can

be inferred. As explained in the next section, this

decision is based on a dissimilarity threshold,

string-Based formalization

The alternate approach is focused on monodic

sequences, which are formalized as strings of

notes. This follows the idea that music sequences

are perceived as a succession of events closely

connected one after the other. This hypothesis

can be associated with the phenomenological

idea of primary retention (Husserl, 1905) and

the linguistic concept of syntagmatic relation

Intelligent Music Information Systems: Tools and Methodologies

Search WWH ::

Custom Search

Home