Pertemuan 01 PENDAHULUAN: Data dan Statistika Matakuliah

advertisement
Matakuliah
Tahun
: I0262-Statiatik Probabilitas
: 2007
Pertemuan 01
PENDAHULUAN:
Data dan Statistika
1
Outline Materi:
• Peranan dan Jangkauan Statistika
• Diagram Dahan dan Daun
• Sebaran Frekuensi
2
Business Basic Statistics
Introduction and Data
Collection
3
PERANAN DAN Jangkauan
Statistika
• Why a Manager Needs to Know About
Statistics
• The Growth and Development of Modern
Statistics
• Some Important Definitions
• Descriptive Versus Inferential Statistics
4
Peranan dan Jangkauan Statistika
• Why Data are Needed
(continued)
• Types of Data and Their Sources
• Design of Survey Research
• Types of Sampling Methods
• Types of Survey Errors
5
Why a Manager Needs to Know
About Statistics
• To Know How to Properly Present
Information
• To Know How to Draw Conclusions about
Populations Based on Sample
Information
• To Know How to Improve Processes
• To Know How to Obtain Reliable
Forecasts
6
The Growth and Development of
Modern Statistics
Needs of government to
collect data on its citizenry
The development of the
mathematics of probability
theory
The advent of the computer
7
Some Important Definitions
• A Population (Universe) is the Whole
Collection of Things Under Consideration
• A Sample is a Portion of the Population
Selected for Analysis
• A Parameter is a Summary Measure
Computed to Describe a Characteristic of
the Population
• A Statistic is a Summary Measure
Computed to Describe a Characteristic of
the Sample
8
Population and Sample
Population
Sample
Use statistics to
summarize features
Use parameters to
summarize features
Inference on the population from the sample
9
Statistical Methods
• Descriptive Statistics
– Collecting and describing data
• Inferential Statistics
– Drawing conclusions and/or making decisions
concerning a population based only on
sample data
10
Descriptive Statistics
• Collect Data
– E.g., Survey
• Present Data
– E.g., Tables and graphs
• Characterize Data
– E.g., Sample Mean =
X
i
n
11
Inferential Statistics
• Estimation
– E.g., Estimate the
population mean weight
using the sample mean
weight
• Hypothesis Testing
– E.g., Test the claim that
theDrawing
population
mean and/or making decisions
conclusions
concerning
a population
weight
is 120
pounds based on sample results.
12
Why We Need Data
• To Provide Input to Survey
• To Provide Input to Study
• To Measure Performance of Ongoing
Service or Production Process
• To Evaluate Conformance to Standards
• To Assist in Formulating Alternative
Courses of Action
• To Satisfy Curiosity
13
Data Sources
Data Sources
Print or Electronic
Observation
Survey
Experimentation
14
Types of Data
Data
Categorical
(Qualitative)
Numerical
(Quantitative)
Discrete
Continuous
15
Design of Survey Research
• Choose an Appropriate Mode of
Response
– Reliable primary modes
• Personal interview
• Telephone interview
• Mail survey
– Less reliable self-selection modes (not
appropriate for making inferences about the
population)
• Television survey
• Internet survey
• Printed survey in newspapers and magazines
16
Reasons for Drawing a Sample
• Less Time Consuming Than a Census
• Less Costly to Administer Than a Census
• Less Cumbersome and More Practical to
Administer Than a Census of the
Targeted Population
17
Types of Sampling Methods
Samples
Non-Probability
Samples
(Convenience)
Judgement
Quota
Probability Samples
Simple
Random
Chunk
Stratified
Cluster
Systematic
18
Probability Sampling
• Subjects of the Sample are Chosen Based
on Known Probabilities
Probability Samples
Simple
Random
Systematic
Stratified
Cluster
19
Organizing Numerical Data
Numerical Data
Ordered Array
21, 24, 24, 26, 27, 27, 30, 32, 38, 41
Stem and Leaf
Display
Frequency Distributions
Cumulative Distributions
2 144677
3 028
4 1
41, 24, 32, 26, 27, 27, 30, 24, 38, 21
Histograms
Tables
Ogive
Polygons
20
Stem and Leaf Display
(continued)
• Data in Raw Form (as Collected):
24, 26, 24, 21, 27, 27, 30, 41, 32, 38
• Data in Ordered Array from Smallest to
Largest:
21, 24, 24, 26, 27, 27, 30, 32, 38, 41
• Stem-and-Leaf Display:
2
144677
3
028
4
1
21
Tabulating and Graphing
Numerical Data
Numerical Data
Ordered Array
21, 24, 24, 26, 27, 27, 30, 32, 38, 41
41, 24, 32, 26, 27, 27, 30, 24, 38, 21
Frequency Distributions
Cumulative Distributions
O g ive
120
100
80
60
40
20
0
10
Stem and Leaf
Display
2 144677
3 028
4 1
Histograms
20
30
40
50
Ogive
7
6
5
4
Tables
Polygons
3
2
1
0
10
20
30
40
50
60
22
60
Tabulating Numerical Data:
Frequency Distributions
• Sort Raw Data in Ascending Order
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46,
53, 58
• Find Range: 58 - 12 = 46
• Select Number of Classes: 5 (usually between 5
and 15)
• Compute Class Interval (Width): 10 (46/5 then
round up)
• Determine Class Boundaries (Limits):10, 20, 2330,
40, 50, 60
Frequency Distributions, Relative Frequency
Distributions and Percentage Distributions
Data in Ordered Array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Class
10 but under 20
20 but under 30
30 but under 40
40 but under 50
50 but under 60
Total
Relative
Frequency Frequency Percentage
3
6
5
4
2
20
.15
.30
.25
.20
.10
1
15
30
25
20
10
100
24
Graphing Numerical Data:
The Histogram
Data in Ordered Array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Frequency
Histogram
7
6
5
4
3
2
1
0
6
5
3
2
0
5
Class Boundaries
No Gaps
Between
Bars
4
0
15
25
35
45
Class Midpoints
55
More
25
Graphing Numerical Data:
The Frequency Polygon
Data in Ordered Array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Frequency
7
6
5
4
3
2
1
0
5
15
25
35
45
Class Midpoints
55
More
26
Tabulating Numerical Data:
Cumulative Frequency
Data in Ordered Array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Lower
Limit
10
20
30
40
50
60
Cumulative
Frequency
0
3
9
14
18
20
Cumulative
% Frequency
0
15
45
70
90
100
27
Graphing Numerical Data:
The Ogive (Cumulative % Polygon)
Data in Ordered Array :
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Ogive
100
80
60
40
20
0
10
20
30
40
50
60
Class Boundaries (Not Midpoints)
28
Graphing Bivariate Numerical
Data (Scatter Plot)
Total Year to
Date Return (%)
Mutual Funds Scatter Plot
40
30
20
10
0
0
10
20
30
Net Asset Values
40
29
Tabulating and Graphing
Univariate Categorical Data
Categorical Data
Tabulating Data
The Summary Table
Graphing Data
Pie Charts
Bar Charts
Pareto Diagram
30
Graphing Univariate
Categorical Data
Categorical Data
Graphing Data
Tabulating Data
The Summary Table
Pie Charts
CD
Pareto Diagram
S a vi n g s
Bar Charts
B onds
S to c k s
0
10
20
30
40
50
45
120
40
100
35
30
80
25
60
20
15
40
10
20
5
0
0
S to c k s
B onds
S a vi n g s
CD
31
Bar Chart
(for an Investor’s Portfolio)
Investor's Portfolio
Savings
CD
Bonds
Stocks
0
10
20
30
40
50
Amount in K$
32
Pie Chart
(for an Investor’s Portfolio)
Amount Invested in K$
Savings
15%
Stocks
42%
CD
14%
Bonds
29%
Percentages are
rounded to the
nearest percent
33
Pareto Diagram
Axis for
bar
chart
shows
%
invested
in each
category
45%
100%
40%
90%
80%
35%
70%
30%
60%
25%
50%
20%
40%
15%
30%
10%
20%
5%
10%
0%
0%
Stocks
Bonds
Savings
CD
Axis for line
graph
shows
cumulative
% invested
34
Download