Statistics Prerequisite materials

advertisement
What is Statistics
Chapter 1
McGraw-Hill/Irwin
©The McGraw-Hill Companies, Inc. 2008
GOALS






2
What is meant by statistics?
Understand why we study statistics.
Explain what is meant by descriptive statistics and
inferential statistics.
Distinguish between a qualitative variable and a
quantitative variable.
Describe how a discrete variable is different from a
continuous variable.
Distinguish among the nominal, ordinal, interval, and
ratio levels of measurement.
What Is Meant By Statistics?
–
Common meaning:
–
Numerical information such as:


–
–
3
Mean time waiting on hold for technical support is 17
minutes
In a recent poll, 21% of respondents approved of the
President’s policies
Statistic: One figure
Statistics: more than one figure
What Is Meant By Statistics?
Statistics is interested in what is a typical value
and how much variation there is in the data
Typical value:
–
–
–
Variation
–
–
–
4
Some sort of average (Mean, Median, Mode)
How reliable is the average
How clustered are the data points around the mean
What is Meant by Statistics?
Statistics is the science of
collecting, organizing, presenting,
analyzing, and interpreting
numerical data to assist in making
more effective decisions.
5
Understand why we study statistics


Because numeric and non-numeric data are
everywhere
In marketing, accounting, finance,
economics, politics, sciences, and
elsewhere, there are statistics
–
–
–
6
We need to be able to understand statistics when
we encounter them
We need to not be tricked by misleading statistics
We need to use statistics to help us make
decisions under future uncertainty
Who Uses Statistics?
Statistical techniques are used
extensively by marketing,
accounting, quality control,
consumers, professional sports
people, hospital administrators,
educators, politicians, physicians,
etc...
7
Types of Statistics – Descriptive
Statistics
Descriptive Statistics - methods of
organizing, summarizing, and
presenting data in an informative
way.
8
Descriptive Statistics
EXAMPLE 2: According
to Consumer Reports,
General Electric
washing machine
owners reported 9
problems per 100
machines during
2001. The statistic 9
describes the number
of problems out of
every 100 machines.
9
Descriptive Statistics
18.50%
6.50%
4.50%
Nike
Adidas
Reebok
Asics
Other
24.50%
46%
EXAMPLE 3: Pie Chart (chapter 2) For Running
Shoes Sold At Big 5 Sports
10
Descriptive Statistics
Frequency Distribution of Selling Prices
at Whitner Pontiac Last Month
Selling Prices
($ thousands)
12 up to 15
15 up to 18
18 up to 21
21 up to 24
24 up to 27
27 up to 30
30 up to 33
Total
Number of
Vehicles Sold
(Frequency)
8
23
17
18
8
4
2
80
EXAMPLE 4: Frequency Distribution (chapter 2)
11
Types of Statistics – Descriptive
Statistics
 Inferential
Statistics definition 1: The
methods used to estimate a
property of a population on the basis
of a sample.
 Inferential Statistics definition 2: A
decision, estimate, prediction, or
generalization about a population,
based on a sample.
12
Population versus Sample
A population is a collection of all possible individuals, objects, or
measurements of interest.
A sample is a portion, or part, of the population of interest
13
Inferential Statistics
Example 1: TV networks
constantly monitor
the popularity of their
programs by hiring
Nielsen and other
organizations to
sample the
preferences of TV
viewers.
14
#1
Inferential Statistics
Example 2: Wine tasters
sip a few drops of
wine to make a
decision with respect
to all the wine waiting
to be released for
sale.
15
Example 3: The
accounting
department of a large
firm will select a
sample of the
invoices to check for
accuracy for all the
invoices of the
company.
Descriptive Statistics Or Inferential
Statistics?
16

There are a total of 42,796 miles of interstate
highways in the United States

Auditors take a sample of a firm’s invoices in
order to assess the magnitude of reliability of
the accounting invoicing system
Types of Variables

Qualitative or Attribute variable - the
characteristic being studied is
nonnumeric.

EXAMPLES: Gender, type of automobile owned,
state of birth, eye color are examples.

Qualitative data are usually summarized in
graphs or bar charts
(Nominal or ordinal level of measurement)

17
Types of Variables

Quantitative variable - information is
reported numerically.

EXAMPLES: balance in your checking account,
minutes remaining in class, or number of children in a
family.
 Quantitative
variables can be
classified as either Discrete or
Continuous.
18
Quantitative Variables - Classifications
 Discrete
variables: can only assume
certain values and there are usually
“gaps” between values.
–
19
EXAMPLE: the number of bedrooms in a house, or the number of
hammers sold at the local Home Depot (1,2,3,…,etc).
Quantitative Variables - Classifications
 Continuous
variable can assume any
value within a specified range.
–
–
–
20
EXAMPLE: The pressure in a tire, the weight of a pork chop, or
the height of students in a class.
Usually is measured (accuracy depends on measuring
instrument)
Money is often categorized as a continuous variable (even though
you can’t count between pennies)
Distinguish Between A Qualitative Variable
And A Quantitative Variable




21
Colors of M & M candies?
Amount of money in your retirement
account?
Score on test?
Type of bike you own?
Summary of Types of Variables
22
Four Levels of Measurement

Levels of measurement dictate:
–
–
The calculations that can be done to summarize & present the
data
The statistical tests that can be preformed
level – No order
 Ordinal level – Order but no set distance between
 Nominal
levels
 Interval
level - Order with set distances between
levels, zero just a point on the scale, no division
 Ratio
level - Order with set distances between levels,
inherent zero starting point, division OK
23
Nominal Level

Nominal level - data that is classified into
categories and cannot be arranged in any particular
order.



Observations of a qualitative variable can only be
classified and counted. There is no particular order
to the labels.
Nominal level properties:
1.
2.
24
EXAMPLES: Eye color, gender, car make
Data categories are represented by labels or names.
Even when the labels are numerically coded, the data
categories have no logical order.
# unemployed/100,000
Nominal Level
10000
8900
8900
9000
8200
8000 7300
6700
7000
5400
6000
5000
4000
3000
2000
1000
0
1
2
3
4
5
Atlanta
Boston
Chicago
Los Angeles
New York
Washington
6
Cities
25
Ordinal Level

Ordinal level – involves data arranged in some
order, but the differences between data values
cannot be determined or are meaningless.




Ordinal level properties:
1.
2.
26
EXAMPLE: During a taste test of 4 soft drinks, Mellow Yellow
was ranked number 1, Sprite number 2, Seven-up number 3,
and Orange Crush number 4.
EXAMPLE: How do you rate your instructor?
EXAMPLE: Order of finish in race.
Data classifications are represented by sets of labels or
names (high, medium, low or very good, good, poor) that
have relative values.
Because of the relative values, the classified data can be
ranked or ordered.
Ordinal Level
During a taste test
of 4 soft drinks,
Coca Cola was
ranked number 1,
Dr. Pepper number
2, Pepsi number 3,
and Root Beer
number 4.
4
2
1
3
Interval Level

Interval level:
–
–
–
–
–
28
One category is higher than
another (Ordered).
There is a constant unit of
measurement.
Zero is just a point on the
scale; or there is no natural
zero point.
Division of two numbers does
not make sense.
Scale or rank are good
examples

EXAMPLE: Temperature on
the Fahrenheit scale.
–

EXAMPLE: Shoe size and
dress size.
–

There is no natural zero point
EXAMPLE: Years in which
Whole Foods Market Inc. stock
split.
–

Zero is just a point on the
scale.
Division of 1992 and 1993
does not make sense.
EXAMPLES: Rank of Indi 500
results, Test scores.
Interval Level

Interval level properties:
1.
2.
Data classifications are ordered according to the
amount of the characteristic they possess.
Equal differences in the characteristic are
represented by equal differences in the
measurement.
1.
29
The increment amount up or down is always the same.
Ratio Level

Ratio level - the interval level with an
inherent zero starting point. Differences and
ratios are meaningful for this level of
measurement.
–
30
Practically all quantitative data are the ratio level
of measurement.
EXAMPLES: Monthly income of surgeons,
distance traveled by Sales Rep. per month, Bank
account amount, weight, height, wages, units of
production….
Ratio Level

Bank account dollars
–
–
–
–
Zero is not just a point on the scale, it is the
inherent starting point.
Zero means that you don’t have any money.
Zero means that there is a complete absence of
money.
Division has meaning:



31
Starting balance = $1000.
Ending balance = $1500.
Decimal equivalent change = 1500/1000-1 = .50.
Ratio Level

Ratio level properties:
1.
2.
3.
32
Data classifications are ordered according to the
amount of the characteristics they posses.
Equal differences in the characteristics are
represented by equal differences in the number
assigned to the classification.
The zero point is the absence of the
characteristic and the ratio between two
numbers is meaningful.
Summary of the Characteristics for
Levels of Measurement (example 1)
33
Summary of the Characteristics for
Levels of Measurement (example 2)
Levels of Data
Nominal
Data may only
be classified
(no order)
1.
2.
34
Jersey #
Make of car
Ordinal
Interval
Ratio
Meaningful 0 point
Data are ranked
Meaningful
differences
between values
1.
2.
Team standings
in the Pac 10
CPA exam rank
1.
2.
3.
Temperature
Shoe size
Score on Test
& ratio
between values
1.
2.
Checkbook Bal.
Stock values
End of Chapter 1
35
Download