SOLUTIONS Question 1. (i) The black offspring of an Aa x Aa mating is either AA with probability 0.25/(0.25+0.5) = 1/3, or Aa with probability 2/3. We can take these as prior probabilities. If the black bull is AA, the AA x aa mating produces Aa offspring, all black. If he is Aa, the offspring are Aa with probability ½ or aa with probability ½. prior lik prior x lik posterior AA 1/3 1 1/3 2/3 Aa 2/3 1/4 1/6 1/3 The probability that the bull is a carrier is 1/3. As soon as the bull produces a red offspring, we can eliminate the possibility that his genotype is AA, and the probability that he is a carrier is 1. (ii) Hamish and Dougal got their conditional probabilities the wrong way round. We need to calculate the probability of a pass, given nationality (0.67 if Scottish, 0.68 if English). A chi-squared test confirms that this is not a significant difference. Chi-square = 4(1/26 + 1/13 + 1/214 + 1/107) = 0.52 on 1 d.f. (iii) Of 12 changes, only one is negative. A two-sided sign test gives a p-value of 2*[1+12](1/2)^12 or 13/2048 (p < 0.01). The experiment demonstrates what is obvious, that pulse rate has a strong tendency to increase after exercise. The population median m lies between the 3rd and 10th order statistics if and only if the number of observations less than m (N) lies between 3 and 9 inclusive. Since N has a binomial distribution with n = 12 and p = ½, the confidence level is 1 – 2*(1/2)^12 [ 1 + 12 + 66] = 1 – 79/2048 = 0.961. Student 10 had a very high pulse rate before exercise. Perhaps he was late and had to rush to get to the start of the experiment on time. Question 2. (i) In standardized units, we require the probability of a value between –1 and +1 and the probability of a value less than –1.5. From tables of the normal distribution, the first of these is 2*(0.8413 – 0.5) = 0.6826 and the second is 1 – 0.9332 = 0.0668. (ii) The quartiles divide the distribution into 4 equal parts by probability. The second quartile (median) divides it into two equal parts. From normal tables, we find that q = 0.6745. The magnitude of Z is greater than q if either Z > q or Z < –q and the probability that the magnitude of Z is greater than q is pr(Z > q) + pr(Z < q) = ¼ + ¼ = ½. (iii) T, the time to the first event in a Poisson process has an exponential distribution, which is such that pr(T > t) = exp( – r t), where r is the rate of the process (events per unit time). If r = 12 events per minute, and t = 10/60 minutes, the required probability is exp( –2) or 0.13. Question 3 Part i Analysis of Variance Table Source Regression Residual Error Total DF 2 27 29 SS 362.6 596.10 958.70 MS 181.30 22.08 F 8.21 P 0.002 H0: group1=group2=group3 HA: not all equal The assumptions are that the data are independent, normally distributed and the group variances are the same F=8.21 F2,27; α=0.05=3.353 As the calculated F is greater than the critical value of 3.353 then we can reject the H0 of no difference between groups and conclude that there is at least one group mean that is different. Is there evidence that students who attended both revision sessions do better than those who attended none? Group 3 attended 2 revision sessions. Mean grade = 80.0% Group 1 attended no revision sessions. Mean grade = 71.5 1 1 𝑡 = (𝑥3 − 𝑥1 )/𝑠𝑝(√𝑛3 + 𝑛1) where sp=√𝑀𝑆𝐸 = √22.08 = 4.698936 t = 4.044868 t0.025,27 = 2.052 The calculated t is 4.70 which is greater than the critical value of 2.052 so there is evidence that the group 3 which attended 2 revision sessions did better than the group 1 that attended no sessions. Question 3 Part ii The slope=9.790. This means that we would predict the waiting time to increase by 9.790 min for every additional 1 min increase in duration. The t statistic = 9.970/1.30=7.669231 t0.025,19 = 2.093 7.67 is > 2.093 therefore there is evidence that there a significant relationship between waiting time and duration. The coefficient of determination or r2 = 74.9% (r2 = sum of squares regression / total sum of squares = 2130.6/2844.3=0.749). This means that 74.9% of the variation in waiting time is explained by duration. 95% prediction interval if duration is 3 min: If the duration of the last eruption was 3 min our equation would estimate that the next eruption would occur in 60.8min with 95% prediction interval of 47.24min – 73.53 min. If a tourist can only wait 1 hour then this interval may be too wide