Uploaded by Thành An Lâm

Exercises 3

advertisement
Exercises
Which type of data (categorical, discrete
numerical, continuous numerical) is each of the
following variables?
a. Age of a randomly chosen tennis player in the
Wimbledon tennis tournament.
b. Nationality of a randomly chosen tennis player in
the Wimbledon tennis tournament.
c. Number of double faults in a randomly chosen
tennis game at Wimbledon.
Which measurement level (nominal, ordinal,
interval, ratio) is each of the following variables?
a. A customer’s ranking of fi ve new hybrid
vehicles.
b. Noise level 100 meters from the Dan Ryan
Expressway at a randomly chosen moment.
c. Number of occupants in a randomly chosen
commuter vehicle on the San Diego Freeway.
There are 327 official ports of entry in the United
States. The Department of Homeland Security
selects 15 ports of entry at random to be audited
for compliance with screening procedures of
incoming travelers through the primary and
secondary vehicle and pedestrian lanes. What
kind of sample is this (simple random, systematic,
stratifi ed, cluster)?
The durations (minutes) of 26 electric power
outages in the community of Sonando Heights
over the past five years are shown below.
(a) Duration Prepare a stem-and-leaf plot.
(b) Prepare a dot plot.
(c) Prepare a frequency distribution and
histogram.
(d) Describe the distribution, based on these
displays.
Data file: Duration
Revenue and Net Income (millions) for 27 Randomly
Chosen Fortune 1000 Companies.
(a) Use Excel to make a scatter plot, placing Revenue
on the X-axis and Net Income on the Y-axis. Add
titles and modify the default colors, fonts, etc., as
you judge appropriate to make the scatter plot
effective.
(b) Describe the relationship (if any) between X and Y.
Weak? Strong? Negative? Positive? Linear?
Nonlinear?
Data file: RevenueIncome
The table below shows average daily sales of Rice
Krispies in the month of June in 74 Noodles &
Company restaurants.
(a) Make a histogram for the data.
(b) Would you say the distribution is skewed?
(c) Calculate the mean and standard deviation.
(d) Are there any outliers?
Data file: RiceKrispies
Analysis of annualized returns over a 10-year
period showed that prepaid tuition plans had a
mean return of 6.3 percent with a standard
deviation of 2.7 percent, while the Standard &
Poor’s 500 stock index had a mean return of 12.9
percent with a standard deviation of 15.8 percent.
(a) Calculate and compare the coefficients of
variation.
(b) Why would we use a coefficient of variation?
Why not just compare the standard deviations?
Analysis of annualized returns over a 10-year
period showed that prepaid tuition plans had a
mean return of 6.3 percent with a standard
deviation of 2.7 percent, while the Standard &
Poor’s 500 stock index had a mean return of 12.9
percent with a standard deviation of 15.8 percent.
(a) Calculate and compare the coefficients of
variation.
(b) Why would we use a coefficient of variation?
Why not just compare the standard deviations?
The contingency table below shows the results of a survey of
online video viewing by age. Find the following probabilities or
percentages:
a. Probability that a viewer is aged 18–34.
b. Probability that a viewer prefers watching TV videos.
c. Percentage of viewers who are 18–34 and prefer watching
user-created videos.
d. Percentage of viewers aged 18–34 who prefer watching
user-created videos.
e. Percentage of viewers who are 35–54 or prefer user
created-videos.
In the last 50 years, the average number of deaths
due to alligators in Florida is 0.3 death per year.
Assuming no change in this average, in a given
year find the probability of
(a) no alligator deaths;
(b) (b) at least 2 alligator deaths.
The diameter of bushings turned out by a
manufacturing process is a normally distributed
random variable with a mean of 4.035 mm and a
standard deviation of 0.005 mm. A sample of 25
bushings is taken once an hour.
(a) Within what interval should 95 percent of the
b ushing diameters fall?
(b) Within what interval should 95 percent of the
sample means fall?
(c) What conclusion would you reach if you saw a
sample mean of 4.020? A sample mean of
4.055?
A poll of 125 college students who watch The Big
Bang Theory showed that 83 of them usually
watch on a mobile device (e.g., laptop).
(a) Assuming that this was a random sample,
construct a 90 percent confidence interval for
the proportion of all college students who
usually watch this show on a mobile device.
(b) Would a finite population correction be
required? Explain.
To encourage telephone efficiency, a catalog call
center issues a guideline that at least half of all
telephone orders should be completed within 2
minutes. Subsequently, a random sample of 64
telephone calls showed that only 24 calls lasted 2
minutes or less.
(a) At α 5 .05 is this a significant departure from
the guideline in a left-tailed test? State your
hypotheses and decision rule.
(b) Find the p-value.
An experimental surgical procedure is being studied
as an alternative to the old method. Both methods
are considered safe. Five surgeons perform the
operation on two patients matched by age, sex, and
other relevant factors, with the results shown. The
time to complete the surgery (in minutes) is recorded.
(a) At the 5 percent significance level, is the new way
faster? State your hypotheses and show all steps
clearly.
(b) Is the decision close?
Data file: Surgery
In 2006, a sample of 200 in-store shoppers
showed that 42 paid by debit card. In 2009, a
sample of the same size showed that 62 paid by
debit card.
(a) Formulate appropriate hypotheses to test
whether the percentage of debit card
shoppers increased.
(b) Carry out the test at α 5 .01.
(c) Find the p-value.
The XYZ Corporation is interested in possible
differences in days worked by salaried employees
in three departments in the financial area. A
survey of 23 randomly chosen employees reveals
the data shown below. Because of the casual
sampling methodology in this survey, the sample
sizes are unequal.
Research question: Are the mean annual
attendance rates the same for employees in these
three departments?
Data file: DaysWorked
Several friends go bowling several times per month. They keep
track of their scores over several months. An ANOVA was
performed.
(a) What kind of ANOVA is this (one-factor, two-factor, etc.)?
(b) How many friends were there? How many months were
observed? How many observations per bowler per month?
Explain how you know.
(c) At α 5 .01, what are your conclusions about bowling scores?
Explain, referring either to the F tests or p-values.
In the following regression, X = weekly pay, Y = income
tax withheld, and n = 35 McDonald’s employees.
(a) Write the fitted regression equation.
(b) State the degrees of freedom for a two-tailed test
for zero slope, and use Appendix D to find the
critical value at α 5 .05.
(c) What is your conclusion about the slope?
(d) Interpret the 95 percent confidence limits for the
slope.
(e) Verify that F = t2 for the slope.
(f) In your own words, describe the fit of this
regression.
Download