Project 3

advertisement

Fall 2014 Project 3 BIO4835

BIOSTATISTICS

BIO 4835

Dr. Osborne

Project 3: Collection and analysis of data

Name ____________________________

Due December 15, 2014

Introduction.

The purpose of this project is to obtain and analyze bivariate data that you will collect.

The project is a team project although each member of the team is required to submit a separate report on their own data. Each member of the team individually collects a sample of 60 items. Your sample will be the heights and weights of 30 men and the heights and weights of 30 women. Each student will collect the 60 data points.

Data and analysis.

Each member of your research team records the height and weight of their 30 men and their 30 women but they keep the samples separate from the other members of the team. Each member of the team collects the 60 measurements which constitutes their sample. You will not need anyone’s data except for the ANOVA section. For your own sample of 60 heights and weights, you will prepare each of the following:--

 1. basic statistical analysis on the heights: including the sample mean, median, mode, standard deviation, range, quartiles, skewness, kurtosis. One set for the men, the other for the women

 2. histogram of the heights of each set, men and women

 3. frequency polygon of the heights of each set, men and women

 4. box plot of the heights of each set, men and women

5. x-y scatter plot based on heights and weights

 6. confidence intervals for population parameters based on the sample statistics of each set of heights

7. hypothesis test for population mean of the heights of each set, men and women

8. ANOVA on the multiple samples to find results about the population of the heights. This is the only place where you need the heights from the other members of your group of each set, men and women

9. regression and correlation analysis including calculations of the regression equation based on the heights and weights taken as (x-y) pairs, correlation coefficient (r), coefficient of determination (r 2 ) of each set, men and women

Report.

Each member of the group writes a separate report on their own data using the following template. This will include the following parts.

 an abstract which summarizes the results of the report

 a description of the methods used to obtain the data and to perform the analysis

 results of the analysis on the individual data sample

 conclusions based on the results of the project

The report must be written correctly using proper standard format for papers in the Biological Sciences.

Length is not a factor. Brevity is generally stressed, especially in situations where you have to pay by the page to get it into print. However, all of the required calculations, tables, graphs and results have to be in the paper and must be explained well enough to make them understandable. Write answers using

Microsoft Word. Graphs can be done on the graph papers provided, or you can do them using Microsoft

Excel and insert them into the document in place of the graph papers.

1

Fall 2014 Project 3 BIO4835

A. Abstract: Write an abstract of the report in this space.

B. Methods Describe how your data were collected and analyzed in the space below. List your data in the tables on the next two pages.

2

Fall 2014 Project 3 BIO4835

C. Results

1. (a) Data and Basic Statistical Analysis: Data from 30 Men

Data Table: Heights and Weights of 30 Men

Person

Number

1

Height (x)

(in)

Weight (y)

(lbs)

Person

Number

16

Height (x)

(in)

Weight (y)

(lbs)

2

3

4

5

17

18

19

20

6

7

8

9

10

11

12

13

14

21

22

23

24

25

26

27

28

29

15 30

Basic Statistical Analysis: Data from 30 Men

Using your values for heights from the data table above, find the following basis statistics for your sample. Use Microsoft Excel. Either enter the data in the table below, or erase the table and replace it with the table provided by Microsoft Excel.

Sample mean Quartiles ----

Median First

Mode

Standard Deviation

Range

Min

Max

Count

Skewness

Second

Third

Fourth

Kurtosis

3

Fall 2014 Project 3 BIO4835

1. (b) Data and Basic Statistical Analysis: Data from 30 Women

Data Table: Heights and Weights of 30 Women

Person

Number

1

2

Height (x)

(in)

Weight (y)

(lbs)

Person

Number

16

17

Height (x)

(in)

Weight (y)

(lbs)

3

4

5

6

7

8

9

10

18

19

20

21

22

23

24

25

11

12

13

14

15

26

27

28

29

30

Basic Statistical Analysis: Data from 30 Women

Using your values for heights from the data table above, find the following basis statistics for your sample. Use Microsoft Excel. Either enter the data in the table below, or erase it and replace it with the table provided by Microsoft Excel.

Sample mean Quartiles ----

Median First

Mode Second

Standard Deviation Third

Fourth Range

Min

Max

Count

Skewness

Kurtosis

4

Fall 2014 Project 3 BIO4835

2. (a) Histogram of the Heights of 30 Men

In the space below, draw a histogram of the heights based on your data sample. Use class intervals (bins) of two inches (58-59, 60-61, 62-63, etc.) as needed. Use the graph provided or create it using Microsoft Excel and paste it where the graph is.

2. (b) Histogram of the Heights of 30 Women

In the space below, draw a histogram of the heights based on your data sample. Use class intervals (bins) of two inches (58-59, 60-61, 62-63, etc.) as needed. Use the graph provided or create it using Microsoft Excel and paste it where the graph is.

5

Fall 2014 Project 3 BIO4835

3. (a) Frequency Polygon of the Heights of 30 Men

In the space below, draw a frequency polygon of the heights based on your data sample. Use class intervals (bins) of two inches (58-59, 60-61, 62-63, etc.) as needed. Use the graph provided or create it using Microsoft Excel and paste it where the graph is.

3. (b) Frequency Polygon of the Heights of 30 Women

In the space below, draw a frequency polygon of the heights based on your data sample. Use class intervals (bins) of two inches (58-59, 60-61, 62-63, etc.) as needed. Use the graph provided or create it using Microsoft Excel and paste it where the graph is.

6

Fall 2014 Project 3

4. (a) Box Plot of the Heights of 30 Men

In the space below, draw a box plot of the heights.

BIO4835

4. (b) Box Plot of the Heights of 30 Women

In the space below, draw a box plot of the heights.

7

Fall 2014 Project 3 BIO4835

5. (a) x-y Scatter Plot of the Heights of 30 Men

In the space below, draw an x-y scatter plot of the weights against the heights. Use the graph provided or create it using Microsoft Excel and past it where the graph is.

5. (b) x-y Scatter Plot of the Heights of 30 Women

In the space below, draw an x-y scatter plot of the weights against the heights. Use the graph provided or create it using Microsoft Excel and past it where the graph is.

8

Fall 2014 Project 3 BIO4835

6. (a) Confidence Interval for Mean Height of 30 Men

Assume that the population mean height for males in the United States is 70 inches. Create a confidence interval for your sample of men with

= .05.

6. (b) Confidence Interval for Mean Height of 30 Women

Assume that the population mean height for females in the United States is 65 inches. Create a confidence interval for your sample of women with

= .05.

9

Fall 2014 Project 3 BIO4835

7. (a) Hypothesis Test of the Mean Height of 30 Men

The population mean height for males in the United States is 70 inches. Can you conclude that the mean height of your sample of men is not 70 inches?

7. (b) Hypothesis Test of the Mean Height of 30 Women

The population mean height for females in the United States is 65 inches. Can you conclude that the mean height of your sample of women is not 65 inches?

10

Fall 2014 Project 3 BIO4835

8. Regression and Correlation Analysis

The methodology of these calculations begins with a table of values in the form of (x,y) pairs. The example in lecture dealt with breathing data from goldfish where x was the temperature of the water and y was the breathing rate. The calculations are done in four steps.

1. .First, the data were placed into two lists of the TI-83 calculator. The x values were in L1 and their corresponding y values were in L2.

2. From TI-83 calculations, this table containing values for calculations was prepared.

3. The slope (b) was calculated using appropriate data from the table. This regression equation will be in the form: y = a + bx.

4. The y-intercept (a) was calculated.

Result: Regression equation was: y = 4.54x – 1.57

11

Fall 2014 Project 3 BIO4835

(a) Regression and Correlation Analysis of Data for Men.

We begin with your tables of heights and weights back in Part C, Section 1. Note that the data are in the form of (x,y) pairs as (height, weight) for each person sampled. You have one table of

(x,y) pairs for men and another table of (x,y) pairs for women.

Begin by using the TI-83. Place your list of Heights into list L1. Then place their corresponding

Weights into list L2. With the two lists, you can complete the data table below for the terms needed for the calculations.

Data item for calculation Symbol From TI-83 Calc Value for men

Mean of x values

Sum of x values

Sum of x values squared

 x

(

 x)

2

 x

2

1-Var Stats L1

1-Var Stats L1

Square

 x value

Sum of squared x values

Mean of y values

Sum of y values

Sum of y values squared

1-Var Stats L1

1-Var Stats L2

Sum of squared y values

Number of ordered pairs

Sum of xy products

 y

(

 y)

2

 y 2

 n xy

1-Var Stats L2

Square

 y value

1-Var Stats L2

1-Var Stats L2

L1*l2

L3

1-Var Stats L3

Regression Equation Calculation

Using the terms in the table, calculate the regression equation for your data of men.

Calculate Correlation Coefficient and Coefficient of Correlation

Calculate the correlation coefficient and coefficient of correlation for your data of men.

12

Fall 2014 Project 3 BIO4835

(a) Regression and Correlation Analysis of Data for Women.

We begin with your tables of heights and weights back in Part C, Section 1. Note that the data are in the form of (x,y) pairs as (height, weight) for each person sampled. You have one table of

(x,y) pairs for men and another table of (x,y) pairs for women.

Begin by using the TI-83. Place your list of Heights into list L1. Then place their corresponding

Weights into list L2. With the two lists, you can complete the data table below for the terms needed for the calculations.

Data item for calculation Symbol From TI-83 Calc Value for women

Mean of x values

Sum of x values

Sum of x values squared

 x

(

 x)

2

 x

2

1-Var Stats L1

1-Var Stats L1

Square

 x value

Sum of squared x values

Mean of y values

Sum of y values

Sum of y values squared

1-Var Stats L1

1-Var Stats L2

Sum of squared y values

Number of ordered pairs

Sum of xy products

 y

(

 y)

2

 y 2

 n xy

1-Var Stats L2

Square

 y value

1-Var Stats L2

1-Var Stats L2

L1*l2

L3

1-Var Stats L3

Regression Equation Calculation

Using the terms in the table, calculate the regression equation for your data of women.

Calculate Correlation Coefficient and Coefficient of Correlation

Calculate the correlation coefficient and coefficient of correlation for your data of women.

13

Fall 2014 Project 3 BIO4835

9; ANOVA

(a) ANOVA for Data of Men.

Complete the ANOVA Data Table using data of the heights from your research team members.

Write your name on line one and each other student’s name on the lines in the subsequent column headings.

Data Table for men.

1. ________ 2. ________ 3. ________ 4. ________ 5. ________ 6. ________

 x

 x 2

N

Determine values for degrees of freedom and write them in the df column of the ANOVA table.

A. Calculate the Correction Factor. Write it here. __________________________

Calculate the necessary values required for ANOVA and write each result in its correct cell of the ANOVA Table.

B. Calculate Sum of Squares Total

C. Calculate Sum of Squares Group

14

Fall 2014 Project 3

D. Calculate Sum of Squares Error

E. Calculate Mean Squares Group

F. Calculate Mean Squares Error

G. Calculate Variance Ratio

ANOVA Table. Calculations for men.

Source Df

Total

Group

Error

SS

15

MS V.R.

BIO4835

Fall 2014 Project 3 BIO4835

9; ANOVA

(b) ANOVA for Data of women.

Complete the ANOVA Data Table using data of the heights from your research team members.

Write your name on line one and each other student’s name on the lines in the subsequent column headings.

Data Table for women.

1. ________ 2. ________ 3. ________ 4. ________ 5. ________ 6. ________

 x

 x 2

N

Determine values for degrees of freedom and write them in the df column of the ANOVA table.

A. Calculate the Correction Factor. Write it here. __________________________

Calculate the necessary values required for ANOVA and write each result in its correct cell of the ANOVA Table.

B. Calculate Sum of Squares Total

C. Calculate Sum of Squares Group

16

Fall 2014

D. Calculate Sum of Squares Error

E. Calculate Mean Squares Group

F. Calculate Mean Squares Error

G. Calculate Variance Ratio

ANOVA Table for women.

Source Df

Total

Group

Error

Project 3

SS

17

MS V.R.

BIO4835

Fall 2014 Project 3 BIO4835

D. Write conclusions based on the result of the project.

1. Describe and compare the histograms, frequency polygons and box plots for the data for the heights of men and women.

2. Describe and compare the scatter plots for the heights and weights of the data of men and women.

3. Describe and compare the results of the confidence intervals and hypothesis tests for the data of men and women.

4. Describe and compare the regression and correlation analysis for the data of men and women.

5. Describe and compare the ANOVA calculations for the data of men and women.

18

Download