Plotting Data !!!

advertisement
Plotting Data
When Analyzing data, always plot the data!
• Always plot original data points.
– This is the first thing to do when analyzing
data
– This is very important!
*
Plotting helps avoid big mistakes
1000
Fetal Weight (g)
100
10
1
0.1
1
10
100
Placental Weight (g)
0.1
The decimal point was left out for some of the data points
Plotting Cancer Study Results
• The following plots are from a study by Dr.
Terry Rose-Hellekant in the Medical School
Duluth
• Treatments
– Tamoxifen
– Placebo
• Some mice develop breast cancer
• The data are RT-PCR expressions
corresponding to particular genes
– In RT-PCR the values are roughly a log base 2
scale of the RNA content.
• PUM1 Is a “housekeeping” gene
– Account for RNA quality in the sample
– For example time since death for a study of
schizophrenia on deceased patients’ brains
A dot diagram of fishing line strength
XL
11.8
11.7
11.6
11.5
11.4
11.3
11.2
11.1
11.0
10.9
**
**
***
**
*
XT
**
***
***
*
*
Stren
*
**
****
**
*
Two groups can be compared with back to
back stem and leaf diagrams
e.g. Stopping distances of bikes
Treaded tire
5
64
1
20
34
35
36
37
38
39
40
Or dot diagrams
|
|
| * | ** |
|*
340 350 360 370 380 390
|*** | * |
| * |
|*
Smooth tire
189
5
5
1
|**
400
|
Treaded
Smooth
When there are associations between sets of data values,
plot the data accordingly.
E.g., Snowfall for duluth and White Bear Lake 1972-2000
A not very good way to plot the data
WB Lake
**
*
******
***
**********
***
***
***
130
120
110
100
90
80
70
60
50
40
30
20
Duluth
*
*
**
***
*****
******
**
**
****
***
*
snow_total
Snowfall plot
140
130
120
110
100
90
80
70
60
50
40
30
20
10
0
Duluth
White Bear
1972
1977
1982
1987
year
1992
1997
A study of trace metals in South
Indian River
5
3
1
2
6
4
T=top water zinc concentration (mg/L)
B=bottom water zinc (mg/L)
1
2
3
4
5
6
Top
0.415 0.238 0.390 0.410 0.605 0.609
Bottom
0.430 0.266 0.567 0.531 0.707 0.716
• One of the first things to do when analyzing data is
to PLOT the data
Zinc In River
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
0
1
2
Depth 1=Top 2=Bottom
3
• This is not a useful way to plot the data. There is not
a clear distinction between bottom water and top
water zinc
• even though Bottom>Top at all 6 locations.
A better way
0.7
0.6
Zinc 0.5
0.4
0.3
0.2
Top
Connect points in the same pair.
Bottom
A better way
0.8
0.6
Bottom=Top
0.4
0.2
0
0
0.2
0.4
0.6
0.8
This following plot would imply a natural ordering of sites from 1 to 6.
This would not be the best way to plot the data unless the sites 1-6 correspond to a
natural ordering such as distance downstream of a factory.
0.8
Zinc
0.7
0.6
0.5
0.4
Top
Bottom
0.3
0.2
0.1
0
0
1
2
3
4
Site
5
6
7
Related documents
Download