Multilevel networks and world ethnography Doug White and UC team

advertisement
Multilevel networks and world
ethnography
Doug White and UC+ team
SFI noon seminar 12:15
Wed, Sept 1, 2010
UC+=UCI_imbs+UCSD_econ+sfi, project team
Scott D. White, UCI , One Spot
Halbert White UCSD
Karim Chalak BU, Econ
B. Tolga Oztan, IMBS
and aunt
Assist from
Judea Pearl
UCLA
Laura Fortunato, SFI
Ren Feng. Xi’an
Xiaotung Univ.
Tony Eff, MS, Econ
Language families
Folded image: Core, Semi-periphery1, SP2, Periphery1-2
Core
Semi-Peri1
Semi-Peri2
Periphery1
Periphery2
A structurally endogamous kinship network core of
a Turkish nomad clan
(White and Johansen 2005: 379; 76-79).
Standard Cross-Cultural Sample
(wikipedia page maps by Tony Eff
Afro-Eurasia drawn to a
slightly smaller scale
Causal graph, Pearl’s regression method
“Say for three variables you are trying to estimate the direct
effect c of X on Z given an indirect effect of Y. The causal
diagram model gives you a license to do it by the regression
method, where, for example
E(y|x, z) – E(y|x´, z)
a
X Y
c = —————————————
(1)
c
b
x – x´
,
Z
Controlling for the change from x to x´, E(y|x, z) and E(y|x´,
z) are the changes in variable Z due to unit changes in X
controlling for Y.” (email from Pearl see Pearl 2000:151;
Chalak and White 2010). Because the x,z in (y|x,z) is a joint
distribution, eqn (1) means that x→x´ changes y which
through the x-y-x path, considered as a joint distribution,
changes z. From this it follows, given the single door
criterion (Pearl 2000:150). that c + a•b = rxy.z, the coef for
total effect of X on Z.
Solving Galton’s problem, Two stage OLS
Two stage regression with peer effects (different notation)
Stage 1
Calculate the
Instruments
Stage 2 use
Instruments
in OLS
X1
X2
Y
13 linked regressions out of 2000+ SCCS variables
http://eclectic.ss.uci.edu/~drwhite/courses/SCCCodes.htm
Nodes are variables in regression analyses of variables from the Standard Cross-Cultural Sample of 186 societies (SCCS).
Lines represent independent variables. They point down to 13 dependent variables in successive colored layers.
Black lines are positive effects, red lines negative effects from regression results.
Colors of nodes for variables show depth in a causal hierarchy with net effects estimated as causal graphs (Pearl 2000).
At level 4 the Evil eye dependent variable has a triangular relationship with money and milked domestic animals.
The regressions control for peer effects of spatial transmission (distance) and cultural transmission (language phylogeny),
incorporated as Instrumental Variables in a second-stage regression, with the IVs estimated in a first-stage regression.
Node sizes reflect the significance of spatial transmission peer effects. Language effects are sometimes negative.
v1189 Belief in evil eye
v238 Moral gods==4
238.
HIGH GODS
18
. = Missing data
68
1 = Absent or not reported
47
2 = Present but not active in human affairs
13
3 = Present and active in human affairs but not supportive
of human morality
40
4 = Present, active, and specifically supportive of human morality
v1189 Belief in evil eye (dichotomy)
Large nodes red
Small nodes orange
155. SCALE
77
14
43
27
25
v155 True money==5
7- MONEY (here, an independent variable)
1 = None
2 = Domestically usable articles
3 = Alien currency
4 = Elementary forms
5 = True money
v1189 Belief in evil eye
v272 Caste stratification
272. CASTE STRATIFICATION (ENDOGAMY) (two cases have secondary
castes)
5 . = Missing data
(154) 0 = (Omitted from map) Absent or insignificant
17 1 = Despised occupational group(s)
3 2 = Ethnic stratification
7 3 = Complex
v1189 Belief in evil eye
v245 Milked animals
v1189 Belief in evil eye
Model 1
Description Re: Evil eye
Eff-Dow
coef
pvalue
-0.247
0.775
NA
(Intercept)
VIF
Var.
Probit
coef
1.1715
0.12
0.6944
0.00004
-0.2267
0.38
Wy fydd
Spatial transmission
0.763
.0000022
3.452
NA
Wy fyll
Cultural transmission
-0.228
0.362
2.329
NA
Milk
Milking of animals
0.664
0.080
2.328
245
*
0.3235
0.48
CaststratLDg
Degree of caste stratification
1.372
0.017
1.225
272
**
0.5078
0.04
Money >1>3>4
Degree of monetization
0.597
0.017
1.152
155~v17
**
0.1011
0.05
Moral gods
Degree of morality of gods
0.294
0.020
1.664
238
**
0.1161
0.04
Diagnostics
Fstat
df
***
pvalue
pvalue
Fstat
RESET test. H0: model has right functional form
3.400
1801.470
0.065
0.717
0.397
Wald test. H0: appropriate variables dropped
0.476
308.949
0.491
0.431
0.512
Breusch-Pagan test. H0: residuals homoskedastic
1.193
3282.405
0.275
8.753
0.003
16.146
9268.270
0.000
3.618
0.057
LM test. H0: Cultural lag (language) not needed
0.713
1877017.
0.398
1.086
0.297
LM test. H0: Spatial lag (distance) not needed
1.768
20982.58
0.184
2.214
0.137
Shapiro-Wilk test. H0: residuals normal
Notes: R2 = 0.513; N=186; number of imputations=10; standard errors and R2 adjusted for two-stage least squares.
“***” p-value ≤ 0.01, “**” p-value ≤0.05, “*” p-value ≤ 0.10. Language non-significant (p > .33).
v155 Money
Model 2
Description Re: Money
(Intercept)
Eff-Dow
coef
pvalue
-0.775
VIF
Var.
Probit
coef
Pvalue
0.002
NA
0.2316
.0000279
3.644
NA
***
0.9057
3.758
***
-0.9220
4.309
0.1005
1.228
Wy fydd
Spatial transmission
0.954
Wy fyll
Cultural (language) transmission
-0.928
0.003
4.190
NA
Foodtrade
Imported food
0.430
0.134
1.219
819
Fratgrpstr
Fraternal interest group strength+
0.120
0.092
1.840
570
*
0.1663
1.757
Milk
Milking of animals
-0.393
0.012
1.560
245
*
-0.2394
1.478
Caststrat LGd
Degree of caste stratification+,++
0.430
0.134
1.219
272
Moral gods
Degree of morality of gods+.++
0.102
0.142
1.502
238
0.1021
0.142
Popdens
Population density
0.206
.0000053
1.552
156
0.3147
1.627
Superjh PCsize
Supra cmnty jurisdictional hier.
0.304
.0000002
1.633
Diagnostics
RESET test. H0: model has right functional form
Fstat
df
pvalue
237
***
***
Fstat
1.943
5301.617
0.163
2.187
0.139
Wald test. H0: appropriate variables dropped
15.266
17.503
0.001
13.332
0.000
Breusch-Pagan test. H0: residuals homoskedastic
13.833
950.560
0.000
4.995
0.027
Shapiro-Wilk test. H0: residuals normal
0.267
282.276
0.606
0.363
0.548
LM test. H0: Cltural lag (language) not needed
1.287
657642.
0.257
1.773
0.183
LM test. H0: Spatial lag (distance) not needed
1.352
991.504
0.245
1.902
0.168
Notes: R2 = 0.490; N=186; number of imputations=10; standard errors and R2 adjusted for two-stage least squares.
“***” p-value ≤ 0.01, “**” p-value ≤0.05, “*” p-value ≤ 0.10. Language non-significant (p > .33).
Probit note: R2 = 0.481; IV(distance)=0.9911; (language)=0.9957 see last two columns for coef and pvalue.
v238 Moral gods
Model 3
Description Re: Moral gods
(Intercept)
Eff-Dow
coef
pvalue
VIF
Var.
Probit
Coef
0.725
0.477
NA
0.917
.00000003
2.526
NA
***
pvalue
1.076
0.166
0.881
.0000007
-.471
0.311
Fydd
Spatial transmission
Fyll
Cultural-language-transmission
-0.700
0.140
2.579
NA
PCAP
PC Agricultural potential
-0.038
0.075
1.148
921
*
-.097
0.059
PCsize
PC Juris. Hierarchy
0.554
0.035
23.844
63^2
**
0.212
0.002
PCsize2
PC Juris. Hierarchy squared
-0.076
0.107
23.404
245
0.344
0.022
Milk
Milking of animals
0.403
0.065
2.287
245^2
0.135
0.031
Foodstress
Chronic food stress
0.207
0.152
1.103
1685
-.190
0.003
Eextwar
Frequency of external war
-0.032
0.006
1.124
1650
-.187
0.024
bridewealth
Bridewealth payments
0.194
0.221
1.307
208=1
0.155
0.146
caststratLgd
Log of Caste stratification
0.704
0.030
1.276
272
0.183
0.035
Diagnostics
Fstat
Df
Pvalue
*
**
**
Fstat
RESET test. H0: model has right functional form
0.236
182.981
0.628
0.287
0.593
Wald test. H0: appropriate variables dropped
1.602
24.460
0.217
1.958
0.165
Breusch-Pagan test. H0: residuals homoskedastic
3.115
121.839
0.080
2.133
0.144
Shapiro-Wilk test. H0: residuals normal
7.527
1106.014
0.006
1.201
0.273
LM test. H0: Cultural lag (language) not needed
1.486
29167.282
0.223
1.228
0.268
LM test. H0: Spatial lag (distance) not needed
0.921
46574.050
0.337
0.809
0.368
Notes: R2 = 0.504; N=186; number of imputations=10; standard errors and R2 adjusted for two-stage least squares.
“***” p-value ≤ 0.01, “**” p-value ≤0.05, “*” p-value ≤ 0.10. Language non-significant (p > .33).
Probit note: R2 = 0.481; IV(distance)=0.9942; (language)=0.9861 see last two columns for coef and pvalue.
Table 4: Transmission effects (Galton’s problem): Spatial and cultural
Peer Effect
Spatial
Transmission
(Distance)
Cultural
Transmission
(Language)
Variable
Money
Moral gods
Evil eye
Money
Moral gods
Evil eye
coef
.960
.824
.767
-.988
-.672
-.228
pvalue
.0000009
.0000014
.000002
.002
p > 0.14
p > 0.36
The negative peer effects for language indicate that, for each of these dependent
variables, there is a tendency, strong for Money and weak for the other two variables, NOT to
be the result of cultural tradition but of innovation that differentiates the societies with
Money, Moral gods and Evil eye from the norms in their respective language families. This
tendency is nearly significant (pvalue < 0.15) for societies with Moral gods.
Figure 3: Causal graph with multiple triangular regression coefficients,
excluding peer effects (numbers are the regression coefficients)
-0.393
Milking animals A
B Money (v155)
(v245)
0.484
0.102 p<0.14
Moral gods D
(v238)
0.294
0.792
0.430
0.597
0.664
1.372
C Evil eye (v1188)
Caststrat LGd E
Table 6: Causal graph total effects and bivariate table regression slopes
Independent
Variable
Dependent
Variable
Money
Moral gods
Milking
Evil eye
Evil eye
Evil eye
Net effects=Direct and Indirect Causal
=Tota Fig.
Graph Effects
l
Slope
effects
0.597
0.597 .741
0.294+(0.102*.597)
0.355 .950
0.664+(-.393*.597)+(.484*.104*.597)
0.744 .810
Moral gods
Milking
Money
Money
0.102
-.393+(0.484*.102)
0.102
-0.344
.482
.244
Other kinds of cross-cultural data structures and analyses:
Statistical Entailment Analyses:
Society sets for variables tend to form chains of sets
Galois duality lattice (Concept lattices):
Society sets for variables tend to form chains of sets
and intersections, and opposite ordering of
Sets of variables that tend to form chains of sets
VS1 VS2 VS3 VS4
A B C D
A B C D
Intrasocietal network structure overlays on genealogy
For each society these will define new variables such as
1) sidedness, reciprocal marriage to opposites.
2) structurally endogamous groups
3) marriage-type census as against random simulation
4) distribution of structural features over generations
Multilevel analysis e.g. regional or world system effects
local societies.
on
Fig. 3: An exact world
entailment digraph for the
sexual division of labor
Late Task A Early Task B
Female
Male
Female
Male
Fig. 3: An exact world
ethnographic lattice of kin
avoidances has a four-dimensional
partial ordering of distributions: 1)
parents of Hu, Wi (opp/same sex,
within circles), 2) siblings and
siblings-in-law of Hu and Wife
(opp/same sex, in parallelograms),
3) opposite sex siblings & parents
siblings & parallel cousins (White
1995). Lower types of avoidances
entail upper ones features in
perfect inclusion relations, found
by statistical entailment analysis
(White 1999b). Of the 250
societies, names attached to each
node show each subset of
avoidance relations.
Table 1
Pajek Repast Simulation
X
X
Peer Effects
ArcGIS.com
New Codes
New Ethnogr. Cases
X
X
3
400 foragers2
X
X
(Binford & Boehm)
85 World-system 3
X
X
1294 Atlas4
X
X
0
186 SCCS5
28 1945-19656
30 Post 19657
X
X
0
X
X
X
28 (SCCS)
X
X
X
308 (eSCCS)
80KinSources1
X
Cohesion
2
(country data)
1 http://kinsource.net/kinsrc/bin/view/KinSources archives kinship network data
contributed by anthropologists. Only three KS ethnographies remain for conversion from
paper-based genealogies to e-networks for analysis with Pajek, but others will be added.
2,5 Binford’s (2001) Constructing Frames of Reference forager database has been
spreadsheeted by Boehm and Hill. Non-foragers from the SCCS will be analyzed separately.
Extensive testing of “peer effects” methods have established their validity.
3 Smith and White (1992) have postwar WS commodity flow time series in 5yr intervals;
capital and migration flow will be added.
4 Murdock’s Ethnographic Atlas (EA) in Spss format has been supplemented by newly
authored installments 30-31.
5 Murdock and White’s (1969) Standard Cross-Cultural Sample dataset on 186 societies in
Spss and R formats has coded data contributions from 80+ different authors on 2008+
variables. Citations to SCCS are now 95+/year and growing.
Table 1
80 KinSources1
Pajek
X
Repast Simulation
X
Cohesion
X
Peer Effects
ArcGIS.com
New Codes
New Ethnogr. Cases
X
X
3
400 foragers2
X
X
(Binford & Boehm)
85 Wrld-system3
X
X
1294 Atlas4
X
X
0
186 SCCS5
X
X
0
28 1945-19656
X
X
X
28 (SCCS)
30 Post 19657
X
X
X
308 (eSCCS)
2
(country data)
5 Murdock and White’s (1969) Standard Cross-Cultural Sample dataset on 186 societies in
Spss and R formats has coded data contributions from 80+ different authors on 2008+
variables. Citations to SCCS are now 95+/year and growing.
6 109 missing codes for 28 SCCS variables 1006-1115 will be coded for 28 SCCS societies on
the world-system impacts variables partially coded in White and Burton’s (1985-1988) NSF
8507685 funded research on “World-Systems and Ethnological Theory.”
7 To bring the SCCS societies up to date for post-1965 societies, 30 well described post-1965
ethnographic cases will be added to an (expanded) eSCCS and coded for EA variables and
the CDC Cultural Diversity Codebook of 180 SCCS variables.
8 Given that the SCC Sample was published in 1969, the eSCCS additions to the sample will
bring it up to date temporally. This will allow study of world-system impacts on 37 welldescribed ethnographic cases in the contemporary post-war period.
A structurally endogamous kinship network core of
a Turkish nomad clan (White and Johansen 2005:
379; 76-79).
Fig. 1.A. Gmap of Cultural Survival
(2010) 100+ recent trouble spot
study cases: Gmaps extend to
networks at the global level,
clicking into cases at the local level.
Live: http://bit.ly/c1funC
Fig. 1.B. This google map tracks
cases of swine flu in 2009, types of
cases are color coded, fatal cases
have no dot, clicking a region gives
a more detailed map of cases within
the region.
Similarly, Wolf (1982) drills down at several
hundred ethnographically data points to
analyze how commodity exchange affected
indigenous societies in the 1500-1980 period
of overseas conquest and modern worldsystems.
Interactive maps provide for drilling down
from a network at one level (network spread
of disease not shown here) by clicking a node
to see a more detailed map or a network
within that node. The upper level nodes can
be societies with organizations networks
reached by a click of a given node.
Download