Teaching Statistics By Example - BU

advertisement
Teaching Statistics By
Example
Lisa M Sullivan, PhD
Boston University
Outline
 Statistics in the News
 The Introductory Statistics
Requirement
 Course Content
 Recent Efforts at Reform
 Modifications?
 Examples for Specific Topics
News – Week of May 29, 2006
 Preserve brain function with spicy
foods.
Kicking your food up a notch with spices
could preserve brain function and keep your
brain sharp and strong as you age.
Turmeric, a spice that lends curries their
yellow tint, can curb mental decline and
even slow the effects of neurodegenerative
diseases such as Alzheimer’s.
News – Week of May 29, 2006
 Education linked to better fathers
study
U.S. data shows trend between education
and time spent with kids
Well-educated men tend to make better
fathers, according a new U.S. government
report on fatherhood.
News – Week of May 29, 2006
 U.S. Releases Bird Flu Response Plan Details outline containment policies; focus
on worst-case scenarios may spread fear,
experts say
 Breast Cancer Survivors Lax About
Mammograms - Only 33% get them
annually for five years after diagnosis,
study finds
News – Week of May 29, 2006
 The FDA issued a long-awaited
approval on a new human growth
product.
Approval of Omnitrope, made by Sandoz,
was announced Tuesday in a statement on
the FDA’s Web site.
Omnitrope, also known as somatropin, is a
hormone used to treat growth disorders in
children and adults.
Statistics Requirement
 Numerical literacy
 Provide quantitative foundations for
study in specific disciplines
 Understand and interpret data
 Perform independent research
Careers in Statistics
 Business and Industry (Manufacturing, Marketing,
Engineering)
 Health and Medicine (Public Health, Clinical Trials,
Epidemiology, Genetics, Health Communication)
 Government (Census, Surveys)
 Academia
 Social Sciences
 Health Insurance
Demand for statisticians far exceeds supply
today and this is expected to increase
through 2008.
The Introductory Statistics Course
 Difficult and frustrating for students
 Difficult and frustrating for instructors
 Hundreds of thousands of
undergraduates across a variety of
majors are required to take statistics
- most not mathematically inclined
Typical Course Content
 Descriptive Statistics
 Classification of Variables
 Means, Standard Deviations, Medians
 Graphical Displays
 Principles of Probability
 Probability Models
 Binomial, Poisson, Normal
 Central Limit Theorem
Course Content
 Estimation
 Point Estimates
 Margin of Error
 Precision
 Hypothesis Testing




Hypotheses
Test Statistic
Critical Region
Level of Significance, P-values
Course Content
 Associations Between Variables
 Regression Analysis
 Analysis of Variance
 Chi-Square Tests
Reforms in Undergraduate
Education in Statistics
 NSF funded various projects to
improve teaching of undergraduate
statistics courses
 May 1999 the ASA’s Undergraduate
Statistics Education Initiative (USEI)
was launched
Focus of the Reform
 Emphasize concepts over procedures
 Teach students to:




Formulate research questions
Collect data
Analyze data
Interpret results
Focus of the Reform
 Gain experience working with real
data
 Focus on active learning
 Build communication skills
 MAJOR FOCUS on statistical literacy/
statistical thinking
Statistical Thinking
 Process of using wide ranging and interacting
data to understand processes, problems, and
solutions. The opposite of one factor at a time,
where ones natural born tendency is to change
one factor and “see” what happens. Statistical
thinking is the tendency to want to understand
how several control factors may be interacting
at once to produce an outcome. Common
cause variation becomes your friend and
special cause variation your enemy. Attribute
judgments of good and bad are replaced with
estimates of significance with given confidence.
---Six Sigma
http://www.isixsigma.com/dictionary/Statistical_Thinking-454.htm
Statistical Thinking
 Recognize and attempt to
understand/explain variation
 The process of asking a “good”
question, collecting, analyzing and
interpreting data and appropriately
recognizing limitations
 How do we teach statistical thinking?
Recommendations for Instructors
 Provide working examples that
include questions and processes to
solve statistical problems
 Allow students to practice using
statistical thinking with open-ended
questions and problems
 Use technology to collect, manage
and analyze data
Recommendations for Instructors
 Use real data
 Choose data and questions that are of
interest to students
 Reduce content to focus on key
concepts in greater detail
What’s Missing






Study Design Issues and Implications
Real Data
Interpretation of Results
Practical Implications
Limitations of Inferences
Statistical Computing
Limitations/Interpretation
 Association is not causation
 Statistical significance is not practical
importance
 Lack of statistical significance does
not imply no difference
 Understand how to interpret news
stories/articles with statistical
information
Modifications?
 Include design and analysis issues in
curriculum
 Sharpen skills in interpretation of
results
 Include projects with real data
 Stress communication skills
 Focus more on big picture
Big Picture
What is a statistical study?
How is sample constructed?
What are the key questions?
How is information collected – analyzed
– interpreted?
 What makes a good study?




 A poorly analyzed study can be reanalyzed. A poorly designed study cannot
be re-designed.
Biostatistics
 A specialized branch of applied
mathematics/statistics that deals with
the statistical evaluation of
experimental research or clinical trial
results.
 Statistical applications in the medical
or public health arena.
Biostatistics
Mathematics/Statistics
Medicine/Public Health
Biostatistics
Computer Science
Examples
 What proportion of college students
drink alcohol, use illegal drugs?
 Should driving age be increased?
 Are cell phones safe for children?
 How can we address these questions?
Research Teams
 Principal Investigator (Clinicians,
Scientists)
 Statistician/Biostatistician
 Co-Investigators
 Project Manager
 Statistical Programmers
 Research Assistants
Statistician’s Role on Team
 Develop Study Design





Research question
Study sample
Sample size
Enrollment/Follow-up strategies
On-going monitoring
 Perform Interim and Final Analysis
 Interpret and Report Results
Cell Phones for Children?
 University of Washington scientist states
70-80% of the energy emitted from the
antenna of a mobile phone is absorbed in
the head: Children’s skulls are thinner and
their growing brains may be much more
susceptible to radiation exposure.
 FDA states that scientific evidence does not
show a danger to users of wireless
communication devices including children.
Study of Cell Phone Safety
 What is the outcome?
 What is exposure/risk factor?
 How can we assess relationship
between cell phone use and health
outcomes?
 Study Design
 Data Collection and Analysis
 Interpretation/Attribution
Issues for Biostatisticians
 Children - Obesity, Immunizations, Asthma,
Autism…..
 Adolescents – Alcohol & Tobacco Use,
Depression, STDs, Traffic Accidents….
 Adults – Cancer, CVD, Substance Abuse,
HIV/AIDS, Mental Health…
 What is #1 killer of men and women in US?
 What are the risk factors?
Research – Set Context
 Framingham Heart Study
 Pharmacologic Clinical Trials in
Children with Autism
 Effect of Alcohol Exposure in
Pregnancy on SIDS
The Framingham Heart Study
 5000+ men and women enrolled in 1948
 Longitudinal cohort study
 Exams every 2 years for cardiovascular risk
factors - surveillance
 Ancillary studies – hearing, exercise,
nutrition, neurological studies
 5000+ offspring & spouses enrolled in 1976
 Third generation enrolled in 2002
http://www.nhlbi.nih.gov/about/framingham/
Milestones from Framingham
1960 Cigarette smoking increases risk of heart
disease
1961 Cholesterol & blood pressure increase risk of
heart disease
1967 Physical activity reduces risk of heart disease,
obesity increases risk of heart disease
1970 High blood pressure increases risk of stroke
1978 Psychosocial factors increases heart disease
1988 High levels of HDL cholesterol reduces risk of
death
More than 1500 scientific papers published
Framingham Study Risk
Functions
 Risk prediction models
 Predict likelihood that a person will have
coronary heart disease in the next 10 years
 Models designed to include risk factors
that are readily available
 Age, blood pressure, cholesterol, smoking,
diabetes, treatment for hypertension & high
cholesterol, obesity
Risk Calculator
http://hp2010.nhlbihin.net/atpiii/calculator.asp
Clinical Trial in Children with Autism
 Autism-brain disorder usually diagnosed
before age 3 that affects communication,
social interaction, and creative play.
 Affects over 500,000 children in the US
 Trial to assess the efficacy of drug
treatment in reducing repetitive behaviors
 Children randomized to receive study drug
or placebo
Clinical Trial in Children with Autism
 144 children with autism aged 5-17 years
followed every 2 weeks for 12 weeks for
improvements in repetitive behaviors
 Issues
 Randomization/Blinding
 Measurement of outcome (child, parent,
teacher)
 Safety/Ethical issues
Effect of Alcohol Exposure in
Pregnancy on SIDS
 SIDS – Unexplained infant death
before 1 year of life
 Extremely high rates of SIDS among
American Indians in Northern Plains
of North and South Dakota and in
Cape Town South Africa
 High rates of alcohol consumption
Effect of Alcohol Exposure in
Pregnancy on SIDS
 SIDS – 0.57/1000 in US
3.4/1000 in Northern Plains
3.5/1000 in Cape Town
 In US – 13% of women report drinking
alcohol in pregnancy
58% in Northern Plains
41% in Cape Town
Effect of Alcohol Exposure in
Pregnancy on SIDS
 Study of 12,000 pregnant women in
Northern Plains and Cape Town
 Assess relationship between alcohol
and SIDS
 Issues
 Measuring alcohol exposure
 Ethical Issues – e.g., Autopsies
Examples for Specific Topics
 Conditional Probability
 Performance of screening tests for
prenatal diagnosis, prostate cancer,
breast cancer, HIV
Prenatal Diagnosis
 Your family is pregnant – should you
have a screening test?
 Standard of Care in the US is serum
screen
 68% sensitivity
 5% false positive rate
Performance Characteristics of
Screening Tests
Test +
Test -
Disease +
a
c
Disease –
b
d
Performance Characteristics
 Sensitivity = True Positive Fraction =
P(Test + | Disease)
 False Positive Fraction =
P(Test + | No Disease)
For the Patient
 Positive Predictive Value =
P(Disease | Test +)
 Negative Predictive Value =
P(No Disease | Test -)
Examples for Specific Topics
 Normal Probability Model
 Percentiles
Height, Weight, BMI for age
http://www.cdc.gov/growthcharts/
Statistical Inference
 Estimation and Hypothesis Testing
 Clinical Trials
 Search for clinical trials
 Recent results (press releases and scientific
articles)
http://www.clinicaltrials.gov/
Introductory Statistics
 Big Picture and Make it Real!
 Real Data
 Relevant Examples
 Focus on Interpretation – Practical
Importance
http://health.msn.com/
Academic Programs at Boston U
www.bu.edu
 BA in Mathematics/Statistics
 Minor in Applied Statistics
 Summer Institute for Training in Biostatistics
 MA in Biostatistics
 PhD in Biostatistics
 Minor in Public Health
(Biostatistics, Epidemiology, Environmental Health,
International Health, Health Law, Maternal and
Child Health, Health Services, Social and Behavioral
Science)
Download