TWO TYPES OF STATISTICS Sections 2-1 and 2-2 8/4/2014

advertisement
8/4/2014
TWO TYPES OF STATISTICS
• Descriptivestatistics summarizeor
describe theimportantcharacteristicsof
data.
• Inferentialstatistics usesampledatato
makeinferences (orgeneralizations)about
apopulation.
Sections 2-1 and 2-2
Review and Preview
and
Frequency Distributions
IMPORTANT CHARACTERISTICS
OF DATA
FREQUENCY DISTRIBUTION
1. Center: Arepresentativeoraveragevaluethat
indicateswherethemiddleofthedatasetis.
2. Variation: Ameasureoftheamountthatthe
datavaluesvaryamongthemselves.
3. Distribution: Thenatureorshapeofthe
distributionofthedata(suchasbell‐shaped,
uniform,orskewed).
4. Outliers: Samplevaluesthatlieveryfaraway
fromthevastmajorityoftheothersamplevalues.
5. Time: Changingcharacteristicsofdataovertime.
Afrequencydistribution (orfrequency
table)showshowadatasetispartitioned
amongallofseveralcategories(orclasses)by
listingallofthecategoriesalongwiththe
number(frequency)ofdatavaluesineachof
them.
QWERTY KEYBOARD WORD
RATINGS
AnarticleinDiscover magazine(“Typecasting”by
ScottKim,Discover)suggeststhatyoucan
measuretheeaseoftypingbyusingthispoint
ratingsystem:
Counteachletteronthehomerowas0,each
letteronthetoprowas1,andeachletteron
thebottomrowas2.
Togetthewordrating,
EXAMPLE:
wesumthenumbers.
ST ATI STI CS
Thewordratingfor
0101101120
“statistics”is7.
QWERTY KEYBOARD WORD
RATINGS
2
2
5
1
2
6
3
3
4
2
4
0
5
7
7
5
6
6
8
10
7
2
2
10
5
8
2
5
4
2
6
2
6
1
7
2
7
2
3
8
1
5
2
5
2
14
2
2
6
3
1
7
Thistableshowsthewordratingsforeachofthe52
wordsinthePreambletotheConstitution.
1
8/4/2014
FREQUENCY DISTRIBUTION OF
QWERTY WORD RATINGS
Rating
Lowerclasslimits arethesmallestnumbers
thatcanbelongtodifferentclasses.
Frequency
0-2
20
3-5
14
6-8
15
9 - 11
2
12 - 14
1
Rating
Lower Class
Limits
UPPER CLASS LIMITS
Upper Class
Limits
Frequency
0-2
20
3-5
14
6-8
15
9 - 11
2
12 - 14
1
CLASS BOUNDARIES
Upperclasslimits arethelargestnumbers
thatcanbelongtodifferentclasses.
Rating
LOWER CLASS LIMITS
Frequency
0-2
20
3-5
14
6-8
15
9 - 11
2
12 - 14
1
Classboundaries arethenumbersusedto
separateclasses,butwithoutthegapscreated
byclasslimits.
Rating
Frequency
- 0.5
2.5
Class
Boundaries
5.5
0-2
20
3-5
14
6-8
15
8.5
11.5
9 - 11
2
12 - 14
1
14.5
FINDING CLASS BOUNDARIES
Tofindclassboundaries:
1. Findthesizeofthegapbetweentheupper
classlimitofoneclassandthelowerclass
limitofthenextclass.
2. Addhalfofthatamounttoeachupperclass
limittofindtheupperclassboundaries.
3. Subtracthalfthatamountfromeachlower
classlimittofindthelowerclass
boundaries.
CLASS MIDPOINTS
Classmidpoints arethevaluesinthemiddle
oftheclasses.
Rating
Class
Midpoints
Frequency
0- 1 2
20
3- 4 5
14
6- 7 8
15
9 - 10 11
2
12 - 13 14
1
2
8/4/2014
CLASS WIDTH
Classwidth isthedifferencebetweentwo
consecutivelowerlimits(ortwoconsecutive
lowerclassboundaries)inafrequency
Rating
Frequency
distribution.
Class Width
3
0-2
20
3
3-5
14
3
6-8
15
3 9 - 11
2
3 12 - 14
1
RELATIVE FREQUENCY
DISTRIBUTION
frequencyforaclass
sumofallfrequencies
percentagefrequency
frequencyforaclass
sumofallfrequencies
100%
CUMULATIVE FREQUENCY
DISTRIBUTION
Acumulativefrequency foraclassisthesum
ofthefrequenciesforthatclassandallprevious
classes.
Rating Frequency
Rating
Cumulative
Frequency
0-2
20
Less than 3
20
3-5
14
Less than 6
34
6-8
15
Less than 9
49
9 - 11
2
Less than 12
51
12 - 14
1
Less than 15
52
1. Selectthenumberofclasses,usuallybetween5and20.
2. Determinetheclasswidth
maximumdatavalue
minimumdatavalue
classwidth
numberofclasses
Roundthisresulttogetaconvenientnumber.(It’susuallybestto
roundup.)
3. Choosethevalueforthefirstlowerclasslimitbyusingeitherthe
minimumvalueoraconvenientvalueslightlybelowtheminimum.
4. Usingthefirstlowerclasslimitandtheclasswidth,listtheotherlower
classlimits.(Addtheclasswidthtothefirstlowerclasslimittogetthe
secondlowerclasslimit.Addthewidthtothesecondlowerlimitto
getthethird,andsoon.)
5. Listthelowerclasslimitsinaverticalcolumnandentertheupper
classlimits.
6. Takeeachindividualdatavalueandputatallymarkintheappropriate
class.Addthetallymarkstofindthetotalfrequencyforeachclass.
RELATIVE FREQUENCY
DISTRIBUTION
Arelativefrequencydistribution or
percentagefrequencydistribution includes
thesameclasslimitsasafrequencydistribution,
butrelativefrequenciesareusedinsteadofactual
frequencies.
relativefrequency
CONSTRUCTING A FREQUENCY
DISTRIBUTION
Cumulative
Frequencies
Rating Frequency
Relative
Rating Frequency
0-2
20
0-2
38.5%
20/52=38.5%
3-5
14
3-5
26.9%
14/52=26.9%
6-8
15
6-8
28.8%
9 - 11
2
9 - 11
3.8%
12 - 14
1
12 - 14
1.9%
etc.
Totalfrequency=52
CRITICALLY THINKING: USING
FREQUENCY DISTRIBUTIONS TO
UNDERSTAND DATA
Inlaterchapters,therewillbefrequentreference
todatawithanormaldistribution.Onekey
characteristicofanormaldistributionisthatit
hasa“bell”shape.
• Thefrequenciesstartlow,thenincreasetoone
ortwohighfrequencies,thendecreasetoalow
frequency.
• Thedistributionshouldbeapproximately
symmetric,withfrequenciesprecedingthe
maximumbeingroughlyamirrorimageof
thosethatfollowthemaximum.
3
Download