Motion Database Retrieval with Application to Gesture Recognition in a Virtual Reality Dance Training System - Multimedia Database Retrieval: Technology and Applications - page 313

Database Reference

In-Depth Information

Table 11.5 Gesture

recognition results averaged

over the 30 gestures defined

in Table 11.2

Average recognition accuracy

Testing data L1 L2 HI

Teacher PO 77.7 74.3 77.7

PSC 58.0 61.3 57.7

PT 96.0 79.3 96.0

PTSC 83.0 84.3 83.3

Student PO 66.7 66.3 66.7

PSC 54.7 56.0 54.7

PT 88.3 73.3 88.3

PTSC 79.7 83.0 76.7

These include the reversals of the gestures

dataset can reach 96 %. However, the system has a poorer performance at about 88 %

for recognition of the Student dataset. This may be because the dance sequences

performed by the student may be inconsistent, as compared to the teacher.

11.6

Online Gesture Recognition and Segmentation

In order to perform matching between an incoming gesture and known templates,

the incoming set of postures is projected onto the SSOM to extract the unknown

posture sequence S

. This projection is conducted

online as the student is performing a set of moves. The task of recognition is non-

trivial, due to the differing lengths of gestures (across classes), and the differing

speeds with which they may be enacted (by the student/teacher). In order to address

this, an online probabilistic framework demonstrated in [ 352 ] is adopted. The

standard Bayesian framework is utilised for progressively estimating an updated

posterior probability P

=(

u 1 ,...,

u t ,...,

u T ) ,

t

∈ [

1

,

T

]

K gesture classes. The

likelihood is computed at each unit of time by considering the single posture

triggered on the map, and whether or not it occurred in each gesture template. In

[ 352 ], the likelihood P

(

k

|

S

)

for each of the k

=

1

,...,

was computed as the ratio of the existence of the current

posture in gesture class k , to the total number of different postures in class k .Inthe

current work, the likelihood is reframed as a histogram intersection [Eq. ( 11.17 )],

between a progressively growing sequence S (inclusive of postures from time t 0

to t ), which may be described as a histogram of either: PS, PO, PT or PTSC (defined

in Sect. 11.5 ), versus the corresponding templates for each gesture class.

Let h s be the input histogram for the current sample at time t , and h k to be the

reference template for the class k . We thus define (for time t ), the posterior P t (

(

S

|

k

)

k

|

h s )

,

likelihood P t (

h s |

k

)

, and prior probabilities P t (

k

)

according to the following:

P t (

h s |

k

)

P t (

k

)

P t (

h s |

k

)

P t (

k

)

P t (

k

|

h s )=

=

(11.14)

P t (

h s )

∑

P t (

h s |

k

)

P t (

k

)

Next Page

Multimedia Database Retrieval: Technology and Applications

Search WWH ::

Custom Search

Home