AndrianiPDW2010

advertisement
Academy of Management
Montreal , 6 August 2010
Empirical Exploration of
Complexity in Human Systems:
Data Collection & Interpretation Techniques
Power law statistics and Pareto Science
Pierpaolo Andriani
Durham Business School, University of Durham, UK
LIKELIHOOD distribution:
sampling & inference
Statistical inference:
Drawing conclusions
about the whole
population on the basis
of a sample
Precondition for
statistical inference:
A sample is randomly
selected from the
population (=probability
sample)
PROB (data given population)
take a
sample
population
mean
variance
etc.
size is
infinite
Representative
agent links
population to
sample level and
allows reduction
of population
complexity to
single agent
complexity
infer a
value
INFERENTIAL distribution:
PROB (population given data)
sample
sample mean
sample
variance
etc.
sample size
From Starbuck: The production of knowledge (2006)
Starbucks: The production of knowledge (2006)
• Consensus favoring use of null-hypothesis significance tests
affords a clear example of paradigm stability. Although
methodologists have been trying to discourage the use of
these tests since the 1950s, the tests have remained very
prevalent, and there is no sign that social scientists are
shifting to other criteria. …. Hubbard and Ryan (2000: 678)
concluded: ‘It seems inconceivable to admit that a
methodology as bereft of value as SST (statistical significance
tests) has survived, as the centerpiece of inductive inference
no less, more than four decades of criticism in the
psychological literature’.
p. 77
Starbucks: The production of knowledge (2006)
• Choosing two variables utterly at random, a researcher has 2-to-1 odds
of finding a significant correlation on the first try, and 24-to-1 odds of
finding a significant correlation within three tries. … the main inference I
drew from these statistics was that the social sciences are drowning in
statistically significant but meaningless noise. Because the differences
and correlations that social scientists test have distributions quite different
from those assumed in hypothesis test, social scientists are using tests that
assign statistical significance to confounding background relationships.
Because social scientists equate statistical significance with meaningful
relationships, they often mistake confounding background relationships for
theoretically important information. One result is that social science
research creates a cloud of statistically significant differences and
correlations that not only have no real meaning but also impede
scientific progress by obscuring the truly meaningful relationships.
p. 49
Starbucks: The production of knowledge (2006)
• I began to think of statistical tests as arcane
rituals that demonstrate membership in an
esoteric subculture
p. 18
Gaussian
A tale of two worlds
World
Gaussian
Paretian
Unit of analysis
‘Things’, entities
Relations
Relations btw UoA
Independence
(or
Paretian
Interdependence
weak interdepencdence)
Variability
Limited
Unbounded
Scaling property
Phenomena are fractal
Property of world
Phenomena have proper
scale
Closure
Statistics
Bell distribution
Pareto
(finite variance distributions)
Scientific ‘approach’
Linear science
Openness
(infinite variance)
Non-linear science
(principle of superposition)
Philosophical origin
Parmenides, Plato, Newton
Eraclitus, Aristotile
Bell curve distribution
Power-law distribution
of node linkages
of node linkages
Number of links
Exponential Network
Number of nodes (log scale)
No large number
Number of nodes
Number of nodes
Typical node
Number of links
Number of links (log scale)
Scale-free Network
From Barabasi/Bonabeau, Scientific American, May 2003
By assuming finite variability and
compressing data around mean/variance,the
Gaussian approach
but also ignores
or downplays
tiny initiating
events on the
left hand side of
the distribution
ignores or
downplays
extreme events
on the right
hand side of the
distribution
http://www.zazzle.com/statisticians_do_it_within_3_standard_deviations_
tshirt-235087605979353103
Rationality, stock market and the
butterfly effect
Growth-related power laws - ratio
imbalances
1
Surface /
volume Law
Organisms; villages: In organisms, surfaces absorbing energy grow by the square
but the organism grows by the volume, resulting in an imbalance (Galileo 1638,
Carneiro 1987); fractals emerge to bring surface/volume back into balance. West
and Brown (1997) show that several phenomena in biology such as metabolic
rate, height of trees, life span, etc. are described by allometric power law whose
exponent is a multiple of ±¼. The cause is a fractal distribution of resources.
Allometric power laws hold across 27 orders of magnitude (of mass).
2
Least effort
Language; transition: Word frequency is a function of ease of usage by both
speaker/writer and listener/reader; this gives rise to Zipf’s (power) Law (1949);
now found to apply to language, firms, and economies in transition (Ferrer i
Cancho & Solé, 2003; Dahui et al., 2005; Ishikawa, 2005; Podobnik et al., 2006).
3
Hierarchical
modularity
Growth unit connectivity: As cell fission occurs by the square, connectivity
increases by n(n–1)/2, producing an imbalance between the gains from fission vs.
the cost of maintaining connectivity; consequently organisms form modules so as
to reduce the cost of connectivity; Simon argued that adaptive advantage goes to
“nearly decomposable” systems (Simon, 1962; Bykoski, 2003). Complex adaptive
systems: Heterogeneous agents seeking out other agents to copy/learn from so as
to improve fitness generate networks; there is some probability of positive
feedback such that some networks become groups, some groups form larger
groups & hierarchies (Kauffman, 1969, 1993; Holland, 1995).
Combinations
4
Interactive
Breakage
theory
Wealth; mass extinctions/explosions: A few independent elements
having multiplicative effects produce lognormals; if the elements
become interactive with positive feedback loops materializing, a
power law results; based on Kolmogorov’s “breakage theory” of
wealth creation (1941).
# of exponentials; complexity: Multiple exponential or lognormal
5
distributions or increased complexity of components (subtasks,
Combination
processes) sets up, which results in a power law distribution
theory
(Mandelbrot, 1963; West & Deering, 1995; Newman, 2005).
6
Interacting
fractals
Food web; firm & industry size, heartbeats: The fractal structure of
a species is based on the food web (Pimm, 1982), which is a
function of the fractal structure of predators and niche resources
(Preston 1950; Halloy, 1998; Solé & Alonso, 1998; Camacho &
Solé, 2001; Kostylev & Erlandsson, 2001, West, 2006).
Positive feedback loops
7
Preferential
attachment
Nodes; gravitational attraction: Given newly arriving agents into a
system, larger nodes with an enhanced propensity to attract agents will
become disproportionately even larger, resulting in the power law
signature (Yule, 1925; Young, 1928; Arthur, 1988; Barabási, 2000).
8
Irregularity
generated
gradients
Coral growth; blockages: Starting with a random, insignificant
irregularity, coupled with positive feedback, the initial irregularity
increases its effect. This explains the growth of coral reefs, blockages
changing the course of rivers, (Juarrero, 1999; Turner, 2000; Barabási,
2005). Diffusion limited accretion (DLA). See also “niche
constructionism” in biology (Odling-Smee, 2003)
Contextual effects
9
Phase
transitions
Turbulent flows: Exogenous energy impositions cause autocatalytic,
interaction effects and percolation transitions at a specific energy
level—the 1st critical value—such that new interaction groupings form
with a Pareto distribution (Bénard, 1901; Prigogine, 1955; Stauffer,
1985; Newman, 2005).
10
Selforganized
criticality
Sandpiles; forests; heartbeats: Under constant tension of some kind
(gravity, ecological balance, delivery of oxygen), some systems reach a
critical state where they maintain stasis by preservative behaviors—
such as sand avalanches, forest fires, changing heartbeat rate—which
vary in size of effect according to a power law (Bak et al., 1987;
Drossel & Schwabl, 1992; Bak, 1996).
Markets: When production, distribution, and search become cheap and
11
easily available, markets develop a long tail of proliferating niches
Niche
containing fewer customers; they become Paretian with mass-market
proliferation products at one end and a long tail of niches at the other (Anderson,
2006).
Gaussian – heights of individuals
Tallest man (Robert Pershing Wadlow) 272 cm
Ratio:
= 3.7
Shortest man (He Pingping) 74 cm
Source : Lada Adamic - http://www.hpl.hp.com/research/idl/papers/ranking/ranking.html
Paretian: city size
Largest city (Mumbai) population 13,922,125
Ratio:
= 605310
Smallest city (Hum, Croatia ) pop. 23
Mumbai, India
Hum, Croatia
Krugman on the Zipf law:
“we are unused to seeing
regularities this exact in
economics – it is so exact
that I find it spooky”
(1996) p.40
Source: Bak (1996) “How Nature Works”
Two tails of a power law
Ricther-Gutenberg Law
Nc (Earthquakes/Year)
Small events tail
Casti _126
Find gutemberg
Earthquake magnitude (mb ) ~ Log E
Extreme events tail
Main properties of Paretian distributions
• Moments:
Pr[X ≥ x] = k*x-α
• Largest value:
• maximum value depends on size of sample
• highly skewed distribution (80/20 Rule)
• Scaling property:
p(bx) = g(b)p(x) for any b
Moments of distributions
 1st: average
– Representative?
– Stable?
number of AOL
visitors to other
websites in
1997*
 2nd: variance
– Finite or unbounded?
– Stable?
• 3rd: Skewness
• 4th: Kurtosis
* Lada Adamic, Zipf, Power-laws, and Pareto - a ranking tutorial, http://www.hpl.hp.com/research/idl/papers/ranking/ranking.html
Largest value
• Financial markets
Central limit theorem doesn’t apply. No convergence to the
mean, no central tendency.
The world shows an unlimited and irreducible stock of
surprises!
Scalability
Scalability
Scalability
Scalability
Scalability
Scalability
Scalability in financial markets
Which approach to statistics?
Traditional statistics assume bellshaped distribution, with typical scale
(mean) and rapidly decaying tails
Neo-classical economics and
equilibrium-based management
theories assume normal distributions
and descriptive/behavioral parameters
gathering around means. Extreme
events are very rare and therefore
negligible
Power-law distributions show no
mean (scale-free) and exhibit long fat
tails (infinite variance). A PL explores
the maximum dynamic range of
diversity of the variable, limited only
by size of network and agent.
Extreme events are more frequent
and their magnitude is
disproportionately bigger than in the
bell distribution case.
In a Gaussian world:
In a Paretian world:
Challenge: manage the population
Challenge: manage the frontier
– How: reduce population to the
representative agent and define
variance (of population)
– Manage around mean and variance
– Identify outliers and manage the tails
(together with the bulk) of the
distribution
– Manage the tails
Change: gradualism
Change: extreme events
– EEs are exceedingly rare and can be
treated as perturbation (system
restores equilibrium after transient)
– EEs arise in the tails and determine
the industry next structure
Scale-free theories
Scale-free theories
– Don’t exist in Gaussian systems
– The growth of most systems follows a
set of scaling trends that link tiny
initiating events with more significant
or even extreme outcomes.
The danger of averages
Any questions?
Thank you
Download