Self-adaptation in Image and Video Retrieval - Multimedia Database Retrieval: Technology and Applications

Database Reference

In-Depth Information

Table 3.7 Summary of competitive learning algorithm for template generation for video

database indexing

Input:

Set of feature vectors extracted from video frames in a given database:

,where h j = h j 1 ,

h jP t

P is the color

histogram vector of the j -th video frame, and H is the number of training

samples.

h 1 ,

h 2 ,...,

h H ]

h ji ]

h j 2 ,...,

∈ R

Output:

The set of weight vectors

= {

g r |

,...,

g r ∈ R

Initialization:

Maximum number of iterations

t f

Learning parameter

= ʷ 0

Weight vectors, g 1 ,

g 2 ,...,

g R

Computation:

h ji ₐ− ( h ji − μ i )

˃ i

(normalized all patterns)

where μ i and ˃ i are the mean value and the standard deviation of the i-th

column vector in H , respectively.

Repeat

Randomly select a pattern h , at iteration t

r ₐ− arg min

h − g r

(classify h )

g r ₐ− g r + ʷ ( h − g r )

(weight update)

ʷ ₐ− ʷ 0 1 −

t f (parameter update)

Until:

t = t f

g 1 , g 2 ,..., g R

Return:

templates that are the common templates among videos in a retrieved set. Then,

those templates considered to be the most significant for reweighting the existing

templates of the initially submitted query are weighted, to improve the ranking

performance. In other words, we allow the templates that are not presented by

the initially submitted query (i.e., w qr =

), but are common among

the top-ranked videos (i.e., the potentially relevant videos), to expand . This results

in reorganization of the degree of importance of the query's templates for better

measurement of video similarity.

The re-ranking process is performed by an adaptive cosine network [ 100 , 101 ],

with the network architecture presented in Fig. 3.13 . The network is composed of

three layers: one for the query templates, one for the video templates, and the third

for the videos themselves. Each node has a connection weight communicated to its

neighbors via the connection links. The query template nodes initiate the inference

process by sending signals to the video template nodes. The video template nodes

then themselves generate signals to the video nodes. Upon receiving this stimulus,

the video nodes, in turn, generate new signals directed back to the video template

nodes. This process might repeat itself several times, through the second and the

∈ [

,...,

]

Multimedia Database Retrieval: Technology and Applications

Search WWH ::

Custom Search

Home