Final solution - University of New Brunswick

advertisement
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
FACULTY OF ADMINISTRATION
UNIVERSITY OF NEW BRUNSWICK
ADM2623: Business Statistics
Final Examination
Instructions:
1. Please justify your answers by providing the methods and/or the procedures for your final
results. You will not get any credit for a single answer without proper explanation.
2. The exam is closed book and notes.
3. Formulas and tables are provided along with the exam papers at the end of this booklet.
4. Scratch papers will be supplied upon request.
5. There are 13 questions in total.
6. The total points are 35.
Name:_________________________________ID_______________________________
Question 1 (5/35 points) Short explanation (be brief and long explanations may reduce your
mark):
(a) (1 point) Explain the Central Limit Theorem
ANS: The sample mean as a random variable under SRR approximately follows a
normal distribution when the sample size is sufficiently large.
(b) (1 point) Explain the Empirical Rule.
ANS:
68% within the one standard deviation away from its mean
89% within the two standard deviations away from its mean
99.7% within the three standard deviations away from its mean
(c) (1 point) Explain the Chebychev’s Theorem.
𝟏
ANS: The interval [𝝁 − 𝒌𝝈, 𝝁 + 𝒌𝝈 ] contains at least 𝟏 − 𝒌𝟐 percentage of the
entire data.
(d) (1 point) What is a random experiment?
ANS: A random experiment is a process satisfying the following properties:
1.
2.
3.
4.
the process can be repeated as many trials as you want
the outcome of any trial is uncertain
well-defined set of possible outcomes
each outcome has a probability associated with it
Page 1 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
(e) (1 point) What is inferential statistics?
ANS: estimating attributes and drawing inference from a sample about the entire
population
Question 2 (5/35 points) Short calculations:
(a) (1 point) Suppose you invested in a stock with annual return of 7%. How many years
will it take to double your wealth (round your final answer up to the closest
integer)?
ANS: Suppose you invested $1 at the beginning, and after n years your wealth will
be $2 (doubled). 2=1(1+0.07)^n. So n=log2/log(1.07)= 10.24477. So approximately
10 years.
(b) (1 point) For the following grouped data as a sample, calculate the standard deviation
(keep two digits after the decimal):
Class
[-2,4)
[4,10)
[10,16)
[16,22)
[22,28]
Frequency
(f)
3
2
5
10
1
21
Midpoint
(x)
1
7
13
19
25
fx
fx^2
3
14
65
190
25
297
3
98
845
3610
625
5181
ANS: Var=(5181-297^2/21)/(21-1)= 49.02857. So SD=7.00.
(c) (1 point) In a math class of 30 students, 17 are boys and 13 are girls. On a unit test, 4
boys and 5 girls made an A grade. If a student is chosen at random from the class, what is
the probability of choosing a girl or an A student?
ANS:
P(girl or A) = P(girl) + P(A) - P(girl and A)
=
13
30
=
17
30
+
9
30
5
30
(d) (1 point) Find z such that 𝑃(𝑍 > 𝑧)=0.07 by using the normal table at the end of this
exam paper.
Page 2 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
ANS: 𝑷(𝒁 > 𝒛) = 𝟎. 𝟎𝟕 is equivalent to 𝑷(𝟎 < 𝒁 < 𝒛) = 𝟎. 𝟒𝟑 . Form the normal
table, we have z=1.48
(e) (1 point) X is a normally distributed random variable with E(X) = -10 and SD(X) = 2.
Find a number x such that P(-x-20<X<x ) = 0.84.
ANS: P(-x<X<x ) = 0.84 is equivalent to
−𝒙−𝟏𝟎
𝑷(
𝟐
<
𝑿−𝝁
𝝈
<
𝒙+𝟏𝟎
𝟐
) = 𝟎. 𝟖𝟒.
Let
z=
𝒙+𝟏𝟎
𝝈
.
Then
𝑷(−𝒛 < 𝒁 < 𝒛) = 𝟎. 𝟖𝟒.
Then
𝑷(𝟎 < 𝒁 < 𝒛) = 𝟎. 𝟒𝟐
From the normal table, we have z=1.41.
So
x = -10 + 1.41*2 = -7.18
Question 3 (5/35 points) Thirty AA batteries were tested to determine how long they would
last. The results, to the nearest minute, were recorded as follows:
423, 369, 387, 411, 393, 394, 371, 377, 389, 409,
392, 408, 431, 401, 363, 391, 405, 382, 400, 381,
399, 415, 428, 422, 396, 372, 410, 419, 386, 390
Organize the above data into a frequency distribution table
(a) (1 point). Calculate the number classes:
ANS: 5
(b) (1 point). Calculate the class width:
ANS: 14
(c) (1 point). Calculate the lower limit of the first class
ANS: 362
(d) (1point) Fill up the following frequency distribution table using the information you
obtained from (a), (b) and (c).
class
frequency
[362,376)
[376,390)
4
6
Page 3 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
[390,404)
[404,418)
[418,432]
9
6
5
(e) (1 point) Draw a cumulative frequency polygon (ogive).
Question 4 (3/35 points) Suppose you are a hospital manager, and you are considering the use
of a new method to diagnose a rare form of bowel syndrome. You know that only 0.1% of the
population suffers from that disease. You also know that if a person has the disease, the test has
99% of chance of turning out positive. If the person doesn’t have the disease, the test has a
98% chance of turning negative. How feasible is this diagnostics method? That is, given that a
test turned out positive, what are the chances of the person really having the disease?
ANS: Let’s say that event DIS is having the disease, and event POS is getting a positive
test. To solve the problem we want to find P(DIS|POS). The description of the problem
tells us that:
P(DIS) = 0.001 (population with the disease)
P(DIS') = 0.999 (population without the disease, so complement of event DIS)
P(POS|DIS) = 0.99 (positive test given patient has disease)
P(POS|DIS') = 0.01 (positive test given patient doesn't have disease)
P(POS'|DIS) = 0.02 (negative test given patient has disease)
P(POS'|DIS') = 0.98 (negative test given patient doesn't have the disease)
Bayes’ theorem is the following:
Page 4 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
P(DIS|POS) = P(POS|DIS) P(DIS) / P(POS)
Note that P(POS) is equal to
P(POS) = P(DIS)P(POS|DIS) + P(DIS')P(POS|DIS')
P(POS) = 0.001 * 0.99 + 0.999 * 0.01
P(POS) = 0.01098
Now we can apply Bayes’ theorem:
P(DIS|POS) = 0.99 * 0.001 / 0.01098 = 0.09016393
In other words, given the test turned out to be positive, the person only has a chance of
4.7% of actually having the disease. Clearly this is not a feasible method for diagnosing
the rare disease.
Question 5 (2/35 points) Every day a stock trader either makes 50% with probability 0.6 or
loses 50% with probability 0.4. Suppose this trader traded 20 days.
(a) (1 point) How many days can he/she expect to make 50%?
ANS: Let X be the number of days this trader makes 50% over the 20 days. Then
X ~ B(20, 0.6)
So E[X]=20(0.6)=12
(b) (1 point) What is the probability that he/she will make 50% for at least seven days?
ANS:
P(X>=7) =1-P(X<=6)
=1-P(X=0)-P(X=1)-P(X=2) -P(X=3)-P(X=4)-P(X=5)-P(X=6)
=1-0.001-0.005=0.994
Question 6 (5/35 points) Given a joint probability as follows:
X
Y
-3
4
10
-5
-3
-2
0.50
0.02
0.10
0.62
0.05
0.12
0.06
0.23
0.10
0.02
0.03
0.15
Page 5 of 12
0.65
0.16
0.19
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
(a) (2 points) Find the marginal distributions of X and Y respectively
ANS:
X
-5
-3
-2
P(X)
0.62
0.23
0.15
XP(X)
-3.1
-0.69
-0.3
Y
P(Y)
YP(Y)
-3
0.65
-1.95
4
0.16
0.64
10
0.19
1.9
(b) ( 1point) Compute the expectations of X and Y, respectively.
ANS: E[X]=-4.09 and E[Y]=0.59
(c) (1 mark) Are X and Y independent or not, justify your answer.
ANS: No because P(X=-5, Y=-3)=0.5 which is not equal to
P(X=-5)*P(Y=-3)=0.62*0.65=0.403
(d) (1 mark) Compute the expectation E [2+3X-4Y].
ANS: E [2+3X-4Y]=2+3E[X]-4E[Y]=2+3*(-4.09)-4*(0.59)=-12.63
Question 7 (4/35 points) A research firm conducted a survey to determine the mean amount
steady smokers spend on cigarettes during a week. A sample of 49 steady smokers revealed
that sample mean of $20 with a sample standard deviation of $5.
(i) (2 points) Obtain a 97% confidence interval estimate for mean amount steady smokers
spend on cigarettes during a week.
ANS: 𝟏 − 𝜶 = 𝟎. 𝟗𝟕 so 𝜶 = 𝟎. 𝟎𝟑 and 𝒛𝜶/𝟐 = 𝒛𝟎.𝟎𝟑/𝟐 = 𝒛𝟎.𝟎𝟏𝟓 = 𝟐. 𝟏𝟕
So the 97% CI is
[20-2.17*5/7, 20+2.17*5/7]=[18.45, 21.55]
(ii) (2 points) If it is desired to obtain a 97% confidence interval estimate for the mean amount
steady smokers spend on cigarettes during a week with a margin of error as $0.5, what
should be the sample size?
𝟓
𝟐
ANS: 𝒏 ≥ (𝟐. 𝟏𝟕 ∗ 𝟎.𝟓) = 𝟒𝟕𝟎. 𝟖𝟗. So n is at least471.
Page 6 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
Question 8 (4/35 points) Your investment advisor proposes you a monthly income investment
scheme which promises a variable return each month. You will invest in it only if you are
assured of exactly an average of a $180 monthly income. Your advisor also tells you that for
the past 300 months, the scheme had returns with an average value of $190 and standard
deviation of $75.
(a) (2 points) From this sample data, can we conclude, at 0.02 significance level, that the
you want to invest in this scheme?
ANS:
Step 1. The hypothesis is
𝐻0 : 𝜇 = 180
𝐻1 : 𝜇 ≠ 180
Step 2. 𝛼 = 0.02 so 𝑧𝛼 = 𝑧0.01 = 2.33
2
Step 3. Decision rule: reject the null hypothesis whenever |𝑧| ≥ 𝑧𝛼/2
190−180
Step 4. |𝑧| =
= 2.309401 < 2.33
75/√300
Step 5. So there is no enough evidence to reject the null hypothesis and hence you want to
invest in this scheme.
(b) (2 points) Suppose that you will invest in it only if you are assured of at least an
average of a $180 monthly income. From this sample data, can we conclude, at 0.02
significance level, that the you want to invest in this scheme?
ANS: This becomes an one-tailed test as follows:
Step 1.
𝐻0 : 𝜇 ≥ 180
𝐻1 : 𝜇 < 180
Step 2. 𝛼 = 0.02 so 𝑧𝛼 = 𝑧0.02 = 2.05
Step 3. Decision rule: reject the null hypothesis whenever |𝑧| ≥ 𝑧𝛼
190−180
Step 4. |𝑧| = 75/√300 = 2.309401 > 2.05
Step 5. So there is enough evidence to reject the null hypothesis and hence you do not want
to invest in this scheme.
Question 9 (2/35 points) Suppose you have a population of 4 persons with ages
18, 20, 25, 26
Obtain the sampling distribution of the sample mean age when a sample of two persons is
chosen using SRR.
ANS: The samples and their means are
Samples
Sample mean
18,18
18
18,20
19
20,18
19
18,25
21.5
Page 7 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
25,18
18,26
26,18
20,20
20, 25
25, 20
20,26
26,20
25,25
25,26
26,25
26,26
21.5
22
22
20
22.5
22.5
23
23
25
25.5
25.5
26
So the sample distribution of the sample mean is given as follows
Sample mean
18
19
20
21.5
22
22.5
23
25
25.5
26
Prob.
1/16
2/16
1/16
2/16
2/16
2/16
2/16
1/16
2/16
1/16
Page 8 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
Page 9 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
Page 10 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
Page 11 of 12
Time: 9:00am-12:00pm
Date: April 11, 2015
Place: CC (Performance Gym A&B 25-28)
Page 12 of 12
Download