Biomedical Engineering Reference
In-Depth Information
Figure 5 . Distribution of 10,000 random numbers, generated according to a log-normal distri-
bution with E [log X ] = 0 and ) (log X ) = 3.
Figure 6. One might hope that it would be easy to tell that this data does not
come from a power law, since there are a rather large number of observations
(5,112), extending over a wide domain (more than four orders of magnitude).
Nonetheless, R 2 is 0.962. This, in and of itself, constitutes a demonstration that
getting a high R 2 is not a reliable indicator that one's data was generated by a
power law. 29
An Illustration: Blogging . An amusing empirical illustration of the diffi-
culty of distinguishing between power laws and other heavy-tailed distributions
is provided by political weblogs, or "blogs"—websites run by individuals or
small groups providing links and commentary on news, political events, and the
writings of other blogs. A rough indication of the prominence of a blog is pro-
vided by the number of other blogs linking to it—its in-degree . (For more on
network terminology, see Part II, chapter 4, by Wuchty, Ravasz and Barabási,
this volume.) A widely read essay by Shirky claimed that the distribution of in-
degree follows a power law, and used that fact, and the literature on the growth
of scale-free networks, to draw a number of conclusions about the social organi-
zation of the blogging community (207). A more recent paper by Drenzer and
Farrell (208), in the course of studying the role played by blogs in general politi-
cal debate, re-examined the supposed power-law distribution. 30 Using a large
population of inter-connected blogs, they found a definitely heavy-tailed distri-
bution which, on a log-log plot, was quite noticeably concave (Figure 7); none-
theless, R 2 for the conventional regression line was 0.898.
Search WWH ::




Custom Search