Analytic Epidemiology

Analytic Epidemiology

Determining the

Etiology of Disease

Study Development Process

Descriptive Studies:

Data Collection and

Analysis

Analyze results and retest

Model Building and hypothesis formulation

Analytic Studies for

Hypothesis testing

Causality and Causal

Relationships







Must have statistical significance

Association may be either positive or negative (if positive, the association is higher than expected; if negative the association is lower than expected)

Must try to rule out “noise” (assuring the comparison of “apples to apples” by controlling confounding factors)

Artifactual or Spurious

Associations









A false or fictitious association can result from chance occurrence or bias in the study methods

Type 1error occurs from random fluctuation

Through retesting, we can determine

“spurious relationships.

Non-causal associations take place when a factor and disease are associated indirectly

Causal Association













Strength of association

Dose-Response Relationship

Consistency of the Association

Temporally Correct Association

Specificity of the Association

Coherence with Existing Information

Sources of Data





Primary data – information collected directly by the researcher

Secondary data – data that has already been collected and stored for analysis

Types of Surveys









Administrative surveys, medical records, vital records and statistical data

Telephone surveys

Self-administered surveys

Personal interviews

Measurement Issues







Measurement is an attempt to assign numbers to observations according to a set of rules

Variables can be categorical or continuous

Intent is to translate observations into a system that allows assessment of the hypothesis

Types of Categorical

Variables









Nominal variables – assigns name or number purely on arbitrary basis (e.g., race, sex)

Ordinal variables – measures assigned from (typically) a lesser to greater value

Interval variables – scale that assigns a number to an observation based on a constant unit of measurement

Ratio – assigns numbers to observations to reflect a true point

Improving the Survey















How is measure administered?

Has it been used on similar situations with success?

Is measure understandable by those being surveyed?

Is sample accessible and identifiable?

Is special training required?

What is length of time in measurement?

Are results available in timely manner?

Reliability and Validity

Issues





Reliability – the extent to which a measurement has stability and homogeneity

Validity – represents the precision to which the measure is truly measuring the phenomena being measured (measure must be reliable to be valid)

Reliability and Validity

Issues





Content Validity – the extent to which the measure reflects the full concept being studied

Criterion Validity – assessed by comparing the test measure of the phenomenon

Sensitivity

 Sensitivity (Se) – measures how accurately the test identifies those with the condition or trait, i.e., correctly identifies or captures true positives

 High sensitivity is needed:

 When early treatment is important

 When identification of every case is important

Specificity

 Specificity (Sp) – measures how accurately the test identifies those without the condition or trait, i.e., correctly identifies or excludes the true negatives.

 High specificity is needed when:

 Re-screening is impractical

 When reducing false positive is important

Factors to consider in setting cutoffs

 Cost of false positives v. false negatives

 Importance of capturing all cases

 Likelihood population will be re-screened

 Prevalence of the disease (Pe): a.

Low Pe requires high Sp, otherwise too many false positives b.

High Pe requires high Se, otherwise too many false negatives

Disease State

Screening

Test Disease No Disease

Positive True Positive

(TP)

A

C

Negative False Neg.

(FN)

False Positive

(FN)

B

D

True Neg.

(TN)

Determining SE and SP

Rates













SE = TP / (TP + FN)

SP = Specificity = TN / (TN + FP)

False Neg. Rate = 1 – SE

False Pos. Rate = 1 - SP

Positive Predictive Value

TP/(TP + FP)

Negative Predictive Value

TN/(TN + FN)

SE and PE Example



You need to test the validity of cervical smears – pap smears – to determine the presence of cancer of the cervix. Smears were taken from 120 women known to have cancer of the cervix and from 580 women who did not have cancer of the cervix. In the laboratory, the smears were read “blind” as positive or negative.

Of the total 700 smears, 200 were read as positive, 110 of which came from the proven cancer cases.

The Research Cycle

Theory

Empirical findings

Operational hypothesis

Statistical Tests Observations and measurements

Types of Studies









Cross-sectional – prevalence rates that may suggest association (good for developing theory, but no causal association)

Retrospective (Case-control) – good for rare diseases and initial etiologic studies

Prospective (cohort, longitudinal, followup) – yields incidence rates and estimates for risk. Better for causal association.

Experimental (intervention studies) – strongest evidence for etiology

Considerations for Study

Design











Stage of hypothesis development

Nature of disease

Nature of Exposure

Nature of study population

Context of research

Cross-Sectional Studies







Single point in time (snapshot studies)

Risk factors and disease measured at the same time

Determines prevalence ratios

Cross-Sectional Study

Design

Sample

Population

Exposed

Non-

Exposed

Cases

Non-

Cases

Cases

Non-

Cases

Advantages and Disadvantages of

Cross Sectional Studies

Advantages



Gives general description or scope of problem



Useful in health service evaluation and planning







Baseline for prospective study

Identifies cases and controls for retrospective study

Low-cost

Disadvantages



No calculation of risk





Temporal sequence is unclear

Not good for rare diseases







Selective survival can lead to bias

Selective recall can lead to bias

Cohort effect may be misleading

Prospective Study Desgin

Disease free persons are classified on exposure at beginning of follow-up period then tracked to ascertain the occurrence of disease.

Question of Study: Do persons with the factor of interest develop or avoid the disease more frequently than those without the factor or exposure

Prospective Study Design

Sample

Population

Exposure

+

Exposure -

Cases

Non-

Cases

Cases

Non-

Cases

Prospective Study

Criteria









Obtain Incidence data

Obtain the incidence among the exposed A/A+B

Obtain incidence among the nonexposed to determine relative risk

C/C+D

Determine Relative Risk

[A/(A+B)]/[C/(C+D)]


Prospective Studies

Advantages



Provides good assessment of temporal sequence



Evaluate before onset of disease and watch for disease

Disadvantages







Selection bias

Loss to follow-up

Expensive

Retrospective Study

Design

Subjects are selected on the basis of disease status: either cases or controls then classified on the basis of past exposure

Question of Study: Do persons with the outcome of interest (cases) have the exposure characteristic (or history of exposure) more frequently than those without the outcomes (controls)

Retrospective Study Design

Exposure Positive A

Cases

Exposure Negative B

Exposure Positive C

Controls

Exposure Negative D

Retrospective Study Method









Compare the odds of exposure among the cases with the odds of exposure among the controls

Odds of Exposure Among Cases =

[A/(A+C)]/[C/A+C)] or A/C

Odds of Exposure Among Controls

=[B/(B+D)]/[D/B+D)] or B/D

Get Odds Ratio or odds of expose among cases/Odds of exposure among controls

(A/C)/(B/D)


Retrospective Studies

Advantages



Less expensive than cohort (retrospective)

Studies





Quicker than cohort

Can identify more than one exposure





Good for rare diseases

Well design leads to good etiologic investigation

Disadvantages



Selective Survival





Selective recall

Temporal sequence not as clear









Not suited for rare exposures

Gives an indirect measure of risk

More susceptible to bias

Limited to single outcome

Experimental Studies





Uses an intervention in which the investigator manipulates a factor and measures the outcome

Elements of a complete experiment







Manipulation of data

Use of a control group

Ability to randomize subjects to treatment groups


Experimental Studies

Advantages



Prospective direction





Ability to randomize subjects

Temporal sequence of cause and effect





Can control extraneous variables

Best evidence of causality

Disadvantages



Contrive situation









Impossible to control human behavior

Ethical Constraints

External validity uncertain

Expensive

Attributable Risk

The rate of disease in the exposed group attributable to exposure.



Relative risk measures the strength of the association



Attributable risk identifies risk of the disease attributable to exposure or the proportion of incidence in exposed group attributable to exposure

Attributable Risk

Calculation





Begins with (Incidence in the Exposed

Group) - (Incidence in the non-exposed group).

Search for the proportion of AR

Attributable Risk

Calculation

Incidence in the

Exposed Group

-

Incidence in the

Exposed Group

Incidence in the Exposed Group

Population Attributable Risk

Requirements







Incidence rate of disease among those exposed to a trait or characteristic

Incidence rate of disease among those not exposed to the trait or characteristic

The proportion of the population that has the trait or characteristic

PAR Example

Incidence lung cancer, smokers

% of smokers in population

+

Inc. lung cancer, nonsmokers

% of nonsmokers in pop.

EXAMPLE Using NV Rates

[(28.0/1000)(.32)] + [(17.0/1000) (.68)] = 20.5

Attributable Risk Example

Incidence, total population

-

Incidence, nonexposed population


(20.5/1000) (17.0/1000) = 3.5/1000

PAR

Incidence, total population

-

Incidence, nonexposed pop.

Incidence in total population


[(20.5/1000)- (17.0/1000)]

20.5

= 3.5/20.5 =

17%

Intervention Comparisons







To demonstrate any therapeutic effect uses a PLACEBO

To demonstrate improved therapy compare to CONVENTIONAL

TREATMENT

To demonstrate the most effective regimen compare DIFFERENT

REGIMENS

Blinding in Experimental

Studies









The importance of blinding depends on the needed outcome. Less important if the outcome is clear.

Non-blinded – both subject & investigator know the treatment allocation

Single-blinded – investigator knows, subject does not know

Double-blinded – neither investigator and subject

Sources of Bias











During selection of participants

Absence of blinding allocation can lead to differential classification

Other sources of miscalculation

Withdrawals, ineligible sources, loss to follow-up

Premature termination

Selection Bias







Cases and controls, or exposed and nonexposed individuals were selected is such that an apparent association is observed even if there is no association.

Biased selection - taking from a pool in which we know the risk is higher is selection bias.

Small sample size or small response size

Information Bias

Methods of information about the subjects in the study are inadequate and results show information gathered regarding exposures and/or disease is incorrect.













Reporting bias

Abstracting records

Bias in interviewing

Bias from surrogate interviews

Surveillance bias

Recall bias

Other Issues





Confounding Variables







To prove that Factor A is a result of disease B, we say that a third factor, Factor X is a

Confounder if the following is true:

Factor X is a known risk factor for Disease B.

Factor X is associated with Factor A bit is not a result of Factor A.

Interactions

Analytic Epidemiology

Analytic Epidemiology

Study Development Process

Causality and Causal

Relationships

Artifactual or Spurious

Associations

Causal Association

Sources of Data

Types of Surveys

Measurement Issues

Types of Categorical

Variables

Improving the Survey

Reliability and Validity

Issues

Reliability and Validity

Issues

Sensitivity

Specificity

Factors to consider in setting cutoffs

Determining SE and SP

Rates

SE and PE Example

The Research Cycle

Types of Studies

Considerations for Study

Design

Cross-Sectional Studies

Cross-Sectional Study

Design

Prospective Study Desgin

Prospective Study Design

Prospective Study

Criteria

Retrospective Study

Design

Retrospective Study Design

Retrospective Study Method

Experimental Studies

Attributable Risk

Attributable Risk

Calculation

Attributable Risk

Calculation

Population Attributable Risk

Requirements

PAR Example

Attributable Risk Example

PAR

Intervention Comparisons

Blinding in Experimental

Studies

Sources of Bias

Selection Bias

Information Bias

Other Issues

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib