Uploaded by Weiran Hao

2.3 Exercise solutions

advertisement
FIN10002 Financial Statistics
eText solutions: Data collection
Note these solutions were taken from the textbook solutions manual.
3.2. a. The sample mean is computed using the following steps: Step
1: Collect the sample data.
The sample data are:
Step 2: Add the values in the population:
å x = 3 + 0 + 2 + .... + 4 + 0 = 58
Step 3: Divide the sum by the population size.
x 58
µ = å = = 2.42
N
24
The mean number of defects for this sample of 24 employees is 2.42.
b. The median is computed using the following steps:
Step 1: Collect the population data.
Step 2: Sort the data from smallest to largest, forming a data array.
Step 3: Find the median index.
The location index for the median is determined using the following equation:
i=
1
( n)
2
For n = 24, the index is:
1
1
i = (24) = 12
2
Step 4. Find the median
Because the index is 12 which is an integer, the median is the average of the 12th and 13th data
values going from either end. These two values are 2 and 3. This the median is:
Md =
2+3
= 2.5
2
Half the data values fall below 2.5 and half the data values fall above 2.5.
c. To determine if there is a mode and what the value of the mode is, we use the following
steps:
Step 1: Collect the population data.
See parts a. or b.
Step 2: Organize the data into a frequency distribution.
Mode = 3
3 occurs 6 times
Step 3: Determine the value that occurs most frequently.
The value 3 occurs 6 times which is the most of any value in the sample.
3.3.
Step 1: Sort the data from low to high.
11.5
14.4
12.8
14.4
13.1
14.7
13.2
15.1
13.5
15.5
13.6
15.9
13.8
16.2
14.2
16.3
14.2
17.1
14.3
18.7
Step 2: Determine the quartile location index and find the quartile value.
To determine the location index for the 1st quartile (p=25) we do the following:
i=
p
25
( n) =
(20) = 5
100
100
2
Since the index, 5, is an integer, the 1st quartile is determined by finding the average of the 5th and
6th values from the lower end of the sorted data. This is:
Q1 =
13.5 + 13.6
= 13.55
2
The location index for the 3rd quartile is:
i=
p
75
( n) =
(20) = 15
100
100
Since the index, 15, is an integer, the 3rd quartile is determined by finding the average of the 15th and
16th values from the lower end of the sorted data. This is:
Q3 =
15.5 + 15.9
= 15.7
2
Thus, the 1st quartile value is 13.55 meaning that 25 percent of the data values fall below 13.55. The
third quartile is 15.7 meaning that 75% of the data values fall below 15.7.
3.7.
46
73
Sorted data:
48
73
55
78
58
80
59
81
65
82
66
86
67
88
69
91
70
95
1
(20) = 10; the median is the average of the 10th and 11th values in
2
70 + 73
the data set. So median =
= 71.5
2
25
b. 25th percentile (Q1) position =
(20) = 5; Q1 is the average of the 5th and 6th values
100
59 + 65
= 62
in the data set. So Q1 =
2
75
(20) = 15; Q3 is the average of the 15th and 16th
75th percentile (Q3) position =
100
81 + 82
= 81.5
values in the data set. So Q3 =
2
60
60th percentile position =
(20) = 12; the 60th percentile is the average of the 12th
100
73 + 78
and 13th values in the data set. So the 60th percentile =
= 75.5
2
a. Median position =
3
3.11
a. The weighted mean is computed using:
xW =
åw x
åw
i
i
=
i
(7,400)(123) + (14,400)(402) + (12,300)(256) + (6,200)(109) + (3,100)(67)
= 11,213.48
123 + 402 + 256 + 109 + 67
b. It is reasonable that plants with more employees would have more medical issues. The
weighted average takes into account the number of employees and is a more reasonable
measure of the average payments than would be an unweighted average that treats all plants as
equals.
3.16
a. Average =
Sxi 4143.6
=
= 165.744
n
25
b. The ranked data are
112.6
123.9
131.0
134.0
141.9
145.4
155.2
156.5
159.3
161.3
161.9
162.7
164.9
165.8
168.3
171.2
173.1
177.4
178.8
182.0
185.8
192.0
211.1
213.1
214.4
The median’s index = (p/100)n = (50/100)25 = 12.5. The rule is to round this up to the 13th
observation = 164.9. This indicates that the mean is larger than the median. The data’s
distribution is slightly right-skewed.
c. The first quartile’s index = (25/100)25 = 6.25. The rule is to round this up to the 7th observation =
155.2.
The third quartile’s index = (75/100)25 = 18.75. The rule is to round this up to the 19th observation =
178.8.
4
For a Boxplot:
Lower limit = Q1 – 1.5(Q3 – Q1) = 155.2 - 1.5(178.8 – 155.2) = 119.8
Upper limit is Q3 + 1.5(Q3 – Q1) = 178.8 + 1.5(178.8 – 155.2) = 214.2.
There are no outliers shown in the plot although there are two, 214.4 and 112.6 in the data.
5
Download