Information Technology Reference
In-Depth Information
Fig. 4.1 A diagram outlining the two major components of DARCI. Image analysis learns how to
annotate new images with adjectives using a series of appreciation networks trained with labeled
images (outlined in blue ). Image generation uses a semantic memory model to identify nouns and
adjectives associated with a given concept. The nouns are composed into a source image that is
rendered to reflect the adjectives, using a genetic algorithm that is governed by a set of evaluation
metrics. The final product (outlined in red ) is an image that communicates the given concept
to provide adjectives that describe the image. When users input a word with multiple
senses, they are presented with a list of the available senses, along with the WordNet
gloss, and asked to select the most appropriate one. Additionally, for each image
presented to the user, DARCI lists seven adjectives that it associates with the image.
The user is then allowed to flag those labels that are not accurate.
Learning image to synset associations is a multi-label classification problem [ 19 ],
meaning each image can be associated with more than one synset. To handle this,
we use a collection of artificial neural networks (ANNs) that we call appreciation
networks , each of which outputs a single real value, between 0 and 1, indicating
the degree to which a given image can be described by the network's corresponding
synset (adjective). An appreciation network is created for each synset that has a
sufficient number of training data, and as data are incrementally accumulated, new
neural networks are dynamically added to the collection to accommodate any new
synsets. There are currently close to 300 appreciation networks in the system.
Semantic Memory Model The system also contains a simple cognitive model, built
as a semantic network forming a graph of associations between words [ 20 , 21 ]. These
word associations are acquired in one of two ways: from people and by automatic
inference from a corpus, with the idea being to use the human word associations to
capture general knowledge and then to fill in the gaps using the corpus associations.
For the human word associations, we use two pre-existing databases of free asso-
ciation norms (FANs): the Edinburgh Associative Thesaurus [ 22 ] and the University
of Florida's Word Association Norms [ 23 ]. These word associations were acquired
Search WWH ::




Custom Search