Information Technology Reference
In-Depth Information
that represents movie stars as vertices and edges connecting them if they ever
starred in a movie together. A version of the Hollywood graph in 2001 represents
355,848 actors and actresses from 170,479 movies. In this graph, the focus is on
the centrality of Hollywood actor Kevin Bacon in the film industry. This Hollywood
graph has gained widespread publicity, partly because researchers have found a way
to replicate some key characteristics of this graph (Watts and Strogatz 1998 ). Brett
Tjaden and Glenn Wasson of the University of Virginia maintain TheOracleof
Bacon on the Web that calculates Bacon numbers.
Small-world networks are defined by three properties: sparseness, clustering, and
small diameter (Watts 1999 ). Sparseness means that the graph has relatively few
edges. Clustering means that edges are not uniformly distributed among vertices;
instead there tend to be clumps in the graph. Small diameter means that the longest
shortest path across the graph is small.
In 1998, Duncan Watts and Steven Strogatz of Cornell University found these
properties in the Hollywood graph and several other huge graphs have (Watts and
Strogatz 1998 ). Watts and Strogatz used a rewiring strategy to produce a graph
somewhere between a random graph and a regular graph. The rewiring process
started with a regular lattice and then rewired some of the edges according to a
probability p , ranging from 0 to 1. If p is equal to 0, then everything remains
unchanged. If p is equal to 1, then every edge is re-arranged randomly and the lattice
becomes a random graph. They calculated the minimum path length L averaged
over all pairs of vertices and found that L dropped dramatically when just a few of
the edges were rewired. Watts and Strogatz also measured the degree of clustering
in their hybrid graphs using a clustering coefficient C . They found the clustering
coefficient C remained high until the rewiring probability was rather large. The
Hollywood graph demonstrated a good match to their model.
3.5.4
Semantic Networks
Semantic networks are useful tools as representations for semantic knowledge and
inference systems. Historically, semantic networks refer to the classic network the-
ory of Collins and Quillian ( 1969 ) in which concepts are represented as hierarchies
of interconnected nodes with nodes linked to certain attributes. It is important to
understand the organization of large-scale semantic networks. By applying graph-
theoretic analyses, the large-scale structure of semantic networks can be specified
by distributions over a few variables, such as the length of the shortest path between
two words and the number of connections per word. Researchers have shown that
the large-scale organization of semantic networks reveals a small-world structure
that is very similar to the structure of several other real-life networks such as the
neural network of the worm C. elegans, the collaboration network of film actors
and the WWW. We have seen examples of Erdos numbers and Bacon numbers. We
return to C. elegans in later chapters for an example of gene expression visualization
of C. elegans.
Search WWH ::




Custom Search