Towards Validating Social Network Simulations - Advances in Social Simulation

Information Technology Reference

In-Depth Information

3

Social Network Analysis Measures

Using social network analysis (SNA) measures to evaluate graphs that are output

from simulations is common. However, a single measure (or a small set of measures)

is unlikely to establish the validity of a simulated network. When a particular measure

captures some aspect that is crucial to the kind of process being investigated, the im-

plicit assumption is that the closer the value of the measure on the synthetic graph is

to that of the reference graph, the more similar the two graphs will be in that respect.

How well this works depends upon what we mean by the 'closeness' of a measure

when comparing networks. For instance, [21] reports on that centrality measures de-

grade gracefully with the introduction of random changes in the network, however,

this result might depend on the topology of the network [27]. Below, we briefly pre-

view some of the most popular approaches [32, 37] used by agent-based modelers.

For a more detailed review on networks in agent-based simulations, we refer to [5, 6],

and for formal definitions and extensive discussions of network measures [31, 37].

Node Degree Distribution: is the number of edges each node has in a graph. For a

directed graph, this includes incoming, outgoing and total edges; while for undirected

graphs, it means just the total edges of each node has. In order to determine the node

degree distribution of a network, one can measure its deviance from another distribu-

tion using such measures, such as the Least Square Error (LSE) or the Kolmogorov-

Smirnov statistic. When a theoretical distribution is assumed, against which the distri-

bution of a simulated network is compared, maximum likelihood estimation (MLE)

techniques are useful. The degree distribution gives a good idea of the prevalence of

different kinds of node but can be inadequate for validation purposes (e.g., [35]).

Assortativity Mixing: assortative mixing means that nodes with similar degrees tend

to be connected with each other, calculated by working out the correlation of the de-

grees of all connected node pairs in the graph. It ranges from -1 to 1: a positive value

indicating that nodes tend to connect with others with similar degrees and a negative

value otherwise [7]. Usually, the averaged value of all node pairs is calculated for the

whole graph for comparison - known as the global clustering coefficient. A value

approaching 1 would result from a graph where there are uniformly densely intercon-

nected nodes and other areas that are uniformly but sparsely connected. A negative

value might result from a graph where nodes with a high degree are distributed away

from each other. This measure is useful when the local uniformity of degree distribu-

tion (or otherwise) is being investigated.

Cluster Coefficient: is the ratio of the number of edges that exist between a node's

immediate neighbors and the maximum number of edges that could exist between

them. If all of one's friends know one another this would result in a cluster coefficient

of 1; in the other extreme, where none knew each other, would result in a value of 0.

This measure is important, for instance, if one had hypothesized that, the process of

making new friends is driven by the “friend of a friend”. The local cluster coefficients

Search WWH ::

Custom Search

Home