Mosaicplots and Variations Heike Hofmann I

advertisement
Mosaicplots
and Variations
Heike Hofmann
IOWA STATE UNIVERSITY
1
Outline
Construction of Mosaicplots
Associations and Mosaics
Variations:
Doubledecker plots, Fluctuation diagrams,
Same-Bin-Size displays
Generalisations
2
The Titanic Data
3
The Titanic Data
4
The Titanic Data
5
Mosaics
hierarchically built
area-based plots
default: “slice & dice”
order of variables matters
limiting factors: #variables, screen space
6
Properties of Mosaics
Class
First
Second
Third
Crew
Survived
P( Survival = yes | Class = First )
No
181.3 pt
Yes
P( Class = First )
47.7 pt
7
Properties of Mosaics
Class
First
Second
Third
Crew
Survived
P( Survival = yes | Class = First )
No
181.3 pt
Yes
P( Class = First )
47.7 pt
7
Associations
m11
m21
}d
m12
m22
no interaction
some interaction
}d
stronger interaction
8
Associations
m11
m21
}d
m12
m22
no interaction
X and Y
independent
some interaction
}d
stronger interaction
<=>
8
Associations
m11
m21
}d
}d
m12
m22
no interaction
X and Y
independent
some interaction
stronger interaction
<=>
(Hofmann 2000)
8
Associations
m11
m21
}d
}d
m12
m22
no interaction
X and Y
independent
some interaction
stronger interaction
<=>
(Hofmann 2000)
significance of interaction depends on number of cases
8
Higher-Order Interactions
Y
a1
c1
b1
a2
b2
d1
c2
d2
X
Z
Under control group constraints:
9
High-D Associations
Y
H
H
Z
Z
X
interaction: H*Z | X, Y
(Z ordinal)
10
Variations
Changes to properties of default mosaics
slice & dice build
area proportional to cell value
space-filling plot
11
Doubledecker Plot
splits in horizontal direction only
Survived
no
Double Decker Plot of Sex and Class
yes
Sex
Class
female
male
Crew
1st
2nd
3rd
highlighting: heights proportional to cond. probability
loose: relationship between Sex and Class
12
Same-Bin-Size Diagram
All rectangles have the same size
Highlighting gives conditional probabilities
highlighted: x2bar > 9
width
0-3
4
5
6
7-15
A
B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Letter
13
Fluctuation Diagram
Start from Same-Bin-Size display,
shrink each rectangle according to value of
corresponding cell
loose: screen space,
high sensitivity to aspect ratio
gain: comparisons along axes in two directions
14
Fluctuation Diagram
SES and IQ have positive association
SES (socio-economic status)
low
lower
middle
upper
middle
high
IQ
low
lower
middle
upper
middle
high
15
Fluctuation Diagram
SES and IQ have positive association
SES (socio-economic status)
low
lower
middle
College Plans
upper
middle
high
IQ
low
No
Yes
lower
middle
upper
middle
high
Either SES or IQ propel positive college plans
16
Fluctuation Diagram
Correlation
Matrix:
Monthly
Temperatures
over six years
in 576 locations
J F M A M J J A SO N D J F M A M J J A SO N D J F M A M J J A SO N D
J
F
M
A
M
J
J
A
S
O
N
D
J
F
M
A
M
J
J
A
S
O
N
D
J
F
M
A
M
J
J
A
S
O
N
D
17
Fluctuation Diagram
Data Matrix visualization
overall percentages
Ash
Fat
deviations from row averages
Lactose Protein Water
Ash
Fat
Lactose Protein Water
BISON
BUFFALO
CAMEL
CAT
DEER
DOG
DOLPHIN
DONKEY
ELEPHANT
FOX
GUINEA PIG
HIPPO
HORSE
LLAMA
MONKEY
MULE
ORANGUTAN
PIG
RABBIT
RAT
REINDEER
SEAL
SHEEP
WHALE
ZEBRA
18
Fluctuation Diagram
Ash
Fat
Lactose
Protein
Water
HORSE
Cluster 1
ORANGUTAN
MONKEY
DONKEY
MULE
HIPPO
CAMEL
Cluster 2
ZEBRA
BISON
LLAMA
BUFFALO
FOX
SHEEP
ELEPHANT
Cluster 3
GUINEA
PIG
CAT
DOG
Cluster 5 Cluster 4
RAT
RABBIT
DEER
REINDEER
WHALE
SEAL
DOLPHIN
20
10
1
# clusters
19
Multiple Barcharts
Vehicle
4 -wheeledvehicles
Bicycles
Mot orcycles
Pedest rians
Age
Gender
0 -1 0
1 0 -2 0 2 0 -3 0 3 0 -5 0
Female
50+
0 -1 0
1 0 -2 0 2 0 -3 0 3 0 -5 0
50+
Male
Barcharts in each panel
-> Mosaicplots area special case of Trellis displays
20
Multiple Barcharts
Vehicle
Male
4−wheeled−vehicles
4 -wheeledvehicles
Male
Bicycles
Male
Motorcycles
Male
Pedestrians
15000
10000
5000
Bicycles
Observed
0
Female
4−wheeled−vehicles
15000
Female
Bicycles
Female
Motorcycles
Female
Pedestrians
0−10 10−20 20−30 30−50 50+
0−10 10−20 20−30 30−50 50+
Mot orcycles
10000
5000
0
Pedest rians
0−10 10−20 20−30 30−50 50+
Age
Gender
0 -1 0
0−10 10−20 20−30 30−50 50+
1 0 -2 0 2 0 -3 0 3 0 -5 0
Female
50+
0 -1 0
1 0 -2 0 2 0 -3 0 3 0 -5 0
50+
Male
Barcharts in each panel
-> Mosaicplots area special case of Trellis displays
20
Some Generalisations
Mosaics are special cases of
Trellis Displays
type of display is more flexible
Treemaps
different variables in same level of hierarchy
21
Treemap - construction
eicosenoic
< 6.5
!6.5
linoleic
< 951
palmitoleic
!951
Apulia
< 1053.5
!95.5 < 95.5
stearic
Calabria
North
Sardinia
< 264.5
Apulia
AT&T Labs - Research
linoleic
Sicily
Visualizing Trees and Forests, Simon Urbanek
22
22
Treemap - construction
eicosenoic
< 6.5
!6.5
linoleic
< 951
palmitoleic
!951
Apulia
< 1053.5
!95.5 < 95.5
stearic
Calabria
North
Sardinia
< 264.5
Apulia
AT&T Labs - Research
linoleic
Sicily
Visualizing Trees and Forests, Simon Urbanek
23
23
Treemap - construction
eicosenoic
< 6.5
!6.5
linoleic
< 951
palmitoleic
!951
Apulia
< 1053.5
!95.5 < 95.5
stearic
Calabria
North
Sardinia
< 264.5
Apulia
AT&T Labs - Research
linoleic
Sicily
Visualizing Trees and Forests, Simon Urbanek
24
24
Treemap - construction
eicosenoic
< 6.5
!6.5
linoleic
< 951
palmitoleic
!951
Apulia
< 1053.5
!95.5 < 95.5
stearic
Calabria
North
Sardinia
< 264.5
Apulia
AT&T Labs - Research
linoleic
Sicily
Visualizing Trees and Forests, Simon Urbanek
25
25
Treemap - construction
eicosenoic
< 6.5
!6.5
linoleic
< 951
palmitoleic
!951
Apulia
< 1053.5
!95.5 < 95.5
stearic
Calabria
North
Sardinia
< 264.5
Apulia
AT&T Labs - Research
linoleic
Sicily
Visualizing Trees and Forests, Simon Urbanek
26
26
Treemap - construction
eicosenoic
< 6.5
!6.5
linoleic
< 951
palmitoleic
!951
Apulia
< 1053.5
!95.5 < 95.5
stearic
Calabria
North
Sardinia
< 264.5
Apulia
AT&T Labs - Research
linoleic
Sicily
Visualizing Trees and Forests, Simon Urbanek
27
27
Treemaps in the
Infovis Community
Voronoi Treemap
Circle Treemap
Figure 10: Enhanced AW Voronoi Treemap layout of 4075 nodes at
10 hierarchy levels (a brighter color indicates a lower hierarchy level)
Figure 11: Enhanced PW Voronoi Treemap layout of 16288 nodes at
7 hierarchy levels (a brighter color indicates a lower hierarchy level)
become very large. Thus, with regard to computation time, other
Treemap layout algorithms outperform this method by far. This
problem is diminished by using distributed computing environ-
[6] Qiang Du and Xiaoqiang Wang. Centroidal voronoi tessellation based
algorithms for vector fields visualization and segmentation. In Proceedings of the IEEE Visualization, pages 43–50. IEEE Computer So-
Balzer et al. 2005
Kai Wetzel, 2005
28
Thank you!
Comments? Questions?
Software: Manet (all mosaic variations),
Mondrian (most),
iplots 2.0 (some)
29
Download