Statistics By Z S Chaudry Why do I need to know about statistics ? Tested in AKT To understand Journal articles and research papers Data Qualitative (Descriptive) Quantitative(Numeric) Discrete Continuous (range) Mean/Median/Mode Mean : Median : Mode : occurring value Average middle value of data Most Frequent Distributions and Ranges Gaussian distribution normal Positively Skewed Negatively Skewed Range Lower quartile Upper quartile Interquartile range – around median Standard deviation – spread around mean Square root of the variance Variance = sum of the square deviations from the mean / n 65% of values lie within 1 SD 95% of values lie within 2 SD 99% of values lie within 3 SD Key Terms Probability - likelihood or uncertainty of an event occurring Add probabilities if EITHER/OR events Multiply probabilities if AND events Power Related to size of study if study too small may not be able to detect a significant significance Errors Random Error Systematic Error (bias) Key Terms - contd Hypothesis Null hypothesis – NO DIFFERENCE between 2 groups under study Rejecting Hypothesis when true –Type 1 error Accepting Hypothesis when false – Type 2 error Compare test results T-test Chi-squared test Produce p-value Probability of result occurring by chance alone – p<0.05 significant – p<0.01 highly significant Key Terms - contd Confidence interval Level of uncertainty in following : Odds ratios, relative risk,risk difference,sensitivity,specificity The wider the range the less certain/significant the results CI usually 95 % i.e. 2 SD from mean in either direction. Provided study not biased true value can be expected to lie in the CI. Key Terms - contd The more people in a study the smaller the CI. CI range including zero not statistically significant or if results expressed as ratios a CI including 1 is not statistically significant. Measures of Risk INCIDENCE – New cases (New cases/population at risk over specific time) X 100 PREVALENCE-Existing cases (No of individuals with disease/population size during specific time) X 100 Measures of Association Risk varies from 0 to 1 Risk = probability of disease/death (R) Risk = No with disease/no at risk of disease Risk Difference = R1 – R2 Relative Risk = R1/R2 <1 intervention reduces risk of outcome =1 no effect on outcome >1 intervention increases risk of outcome Absolute Risk = R1 – R2 / R2 ODDs and ODDs Ratios Odds – ratio of probability of an event happening to that of it not happening Odds Ratio – measure of effectiveness of treatment compared to control OR = ODDs in treated grp/ODDs in control grp <1 effects of treatment less than control group =1 effect of treatment same as control group >1 effect of treatment greater than control group Diagnostic Testing SENSITIVITY – Positive test /total number of positives SPECIFICITY- Negative test when disease free Positive Predictive Value – likelihood that positive test will be a true positive Negative Predictive Value – likelihood that a negative test is a true negative NNT= Number needed to treat = 1/ ARR So the smaller the ARR the greater the NNT Bias Publication –positive results more likely to be published Selection – systematic differences between sample and target population. Information – systematic errors in measures of outcome or exposure ? Language – may be bias in inclusion of studies to be selected in metaanalysis.(combine results of several studies to answer a question) Validity Study validity Internal and external bias Internal validity Extent to which conclusions in a study are legitimate. External validity Degree to which conclusions generated from a study can be generalised to a target population. Study designs Experimental RCT Cohort Longitudinal follow-up of 2 or more groups with recorded exposure to risk Provides comparative incidence estimates between groups Can have surveillance bias Case controlled Used when prevalence low Study designs Observational Cross-sectional Gives prevalence estimates Forest plots Pictorial representation of ODDs ratios in form of a horizontal line If horizontal line crosses vertical line results are not significant! Horizontal line represents the 95% CI of each trial being plotted Further Reading High-Yield Biostatistics by Lippincott Williams and Wilkins The Complete nMRCGP Study Guide by Sarah Gear CASP tools – Critical Analysis to review papers – available on the web THE END THANK YOU