Adaptive Retrieval in a P2P Cloud Datacenter - Multimedia Database Retrieval: Technology and Applications

Database Reference

In-Depth Information

A structured P2P network, as shown in Fig. 8.1 c, uses a DHT to link nodes so

that a query can be effectively and efficiently resolved. This will be discussed in the

following section.

8.3

Adaptive Image Retrieval in a Self-organizing

Chord P2P Network

8.3.1

System Architecture

In a distributed system, images are located at participating nodes across the network.

It is impractical for an image query to traverse all the participating nodes due to

the high communication cost. Thus, one way to make a distributed search system

efficient is by performing a pre-computation, a search index before queries are made.

While answering quires, these indices can be used to precisely locate the nodes

serving the matching images without having to communicate with all the nodes.

Instead of storing the entire index in one server, a distributed indexing system

distributes the index to multiple nodes in the network. Thus, the challenge is finding

the right scheme to partition the index across the nodes in the network. A solution is

to partition indexes by cluster identifications (IDs) of an image and store all index

entries of a given cluster ID on a specific node. In this scheme, a user searches for

images using the cluster ID of a query image, while the system utilizes the cluster

ID as a key . To process a query, the system needs to fetch the image list for the key

from the network. Once the list has arrived, the images that appear in the list are the

matching images for the query.

This section presents the Cluster-identification Search System (CSS), that gen-

erates index entries with cluster ID as a key. Figure 8.2 shows the overview of

the CSS, which proceeds in two stages, indexing and query processing. Due to

its simplicity and elegant approach to routing queries, the Chord protocol [ 226 ]

is selected to implement the system. The processing is conducted between the

application layer and the DHT layer. In the indexing stage in Fig. 8.2 a, each node has

a local image database to be shared with other nodes. The image feature extractor

accesses each image in the local database and then performs mapping via SOTM

(discussed in Chap. 3 ). This results in a cluster ID of the best matched SOTM cluster.

The mapping provides an index entry

for the input image, where

clusID is the cluster ID, and the URL contains IP addresses, port numbers, and

image names of the owner node. It also applies to all images in the local database,

resulting in index entries corresponding to various cluster IDs. All index entries

for a given cluster ID will be stored on a particular node. The node is chosen

by hashing the cluster ID, using a hashing function and Chord. Specifically, the

command put(key, data) is the application programing interface (API) that

lets the application layer put a data item (i.e., index entry) in the nodes in the P2P

network. DHash layer works by associating the keys with data items in the nodes.

<

clusID ,URL

>

Multimedia Database Retrieval: Technology and Applications

Search WWH ::

Custom Search

Home