Spot the Analysis ~ Covering “Classic” Inferential Procedures Example 1: I wish to examine the effect of a certain drug injected into red worms on such outcome variables as protein and lipid levels. How should the study be conducted and what method of analysis that we have examined would be most appropriate for analyzing the data? Example 2: We wish to compare the survival time (in weeks) of heart patients that have had heart catheterization performed versus those that haven’t. In an attempt to create a fair comparison and control for potential confounding factors patients in each group are matched according race, age, socio-economic status, smoking history, cholesterol level, blood gas levels, etc... How could/should the resulting data be analyzed? Example 3: We wish to compare the percentage of frogs in two large wetland areas that exhibit mutation. How could the study be conducted and what method of statistical analysis should be used? Example 4: We wish to compare the average seed production of flowers located at the top of a plant vs. those located at the bottom of the plant. How could we conduct the study and how should the data be analyzed? Example 5: We wish estimate the percentage of wetland areas in the Twin Cities Metro area that have Methoprin levels above 2 ppm. How could we conduct the study and how should the data be analyzed? Following up on the same example, suppose 10 years ago that percentage was 15% and wish to determine whether that percentage has increased. What type of analysis should be conducted? Example 6: We have two suppliers of laboratory rats and we are trying to decide which provides us with the best “product”. One of the primary concerns is that the rats should all have the same weights upon arrival, thus we would like to choose the supplier that provides us with the rats having the most consistent weight. What kind of study should we conduct and how should we analyze the resulting data? Example 7: Neuropsychological Dysfunction in Children with Chronic Low-Level Lead Absorption The paper "Neuropsychological Dysfunction in Children with Chronic Low-Level Lead Absorption" published in Lancet, 1, pp. 708-715, March 1975, examined the effects of exposure to lead on the psychological and neurological well-being of children. In summary, a group of children who lived near a lead smelter in El Paso, Texas, were identified and their blood levels of lead were measured. An exposed group of 46 children were identified who had blood-lead levels > 40 micrograms/milliliter in 1972 (or in few cases 1973). A control group of 78 children was also identified with blood-lead levels < 40 micrograms/milliliter in both 1972 and 1973. All children in the study lived in close proximity to the lead smelter. The Data The three sets of variables used in the study could be classified as (1) IQ related measures (e.g. verbal IQ), (2) symptomatic variables (e.g. colic) and (3) neurological function variables (e.g. # finger taps in 10 seconds). In your analysis of these data you should try to answer the following question. General Research Question: WHAT, IF ANY, EFFECT DOES LEAD EXPOSURE HAVE ON THE THREE SETS VARIABLES USED IN THE STUDY? There are potential confounding factors present such as socio-economic status, proximity to plant, proximity to plant in early years of life, total years of exposure, gender, etc. The data is located in the file Leadep.JMP in the Biometry JMP folder The variables in the data file are: Demographic Variables area - residence of child in August 1972 (1 = 0-1 miles from smelter, 2 = 1 - 2.5 miles, 3 = 2.5 - 4.1 miles) First 2 Years? - did the child live near the plant during first two years of life (Yes, No) age - age of child in years followed by months (e.g. 1011 = 10 yrs. 11 mo.) agegroup - age group of child (1 = 0 - 4 yrs., 2 = 5 - 9 yrs, 3 = 10 - 14 yrs., 4 = 15 - 19 yrs.) Gender – Male, Female Social Index - Hollingshead Index of Social Status Psychological Function Variables (i.e. IQs) The variables preceding those defined below are the different individual component scores for the verbal and performance IQ tests (we will ignore these for the purposes of this discussion). iqv - verbal IQ score iqp - performance IQ score (block arrangements, pattern recognition etc.) iqf - total IQ (this is not a direct sum or average of the verbal and performance IQ) Lead Exposure Variables leadgrp - blood lead level group (1 = blood lead level below 40 in both 1972 and 1973, 2 = greater than 40 in both 1972 and 1973 or greater than 40 in 1973 alone, 3 = greater than 40 in 1972 but less than 40 in 1973) Lead Level - indicator of high lead in 1972 or 1973 (High = greater than 40 in 1972 and/or 1973, Low = below 40 in both 1972 and 1973) ld72 - actual lead level in 1972 ld73 - actual lead level in 1973 totyrs - total years of life living in close proximity to the lead smelter Symptomatic Variables Pica - indicator of pica (Yes, No) Colic - did the child have colic (Yes, No) Clumsy - does the child seem abnormally clumsy (Yes, No) Irritated - does the child seem abnormally irritable (Yes, No) Convulsions - does the child have a history of convulsions (Yes, No) Neurological Function maxtap - maximum # of taps from left and right hand trials (removes hand dominance) minvis - minimum reaction time to a visual stimulus from left and right hand trials (removes hand dominance) minaud - minumum reaction time to an auditory stimulus from left and right hand trials (removes hand dominance) maxfwt - maximum # of finger to wrist taps from left and right hand trials (removes hand dominance) POSSIBLE QUESTIONS AND ANALYSES (Questions and possible ways to statistically analyze the data to answer theses questions)