CORRECT TABLE 4 IN PAPER II (errors in reference numbers in version printed in thesis) Table 4. Measurement properties of the back-specific functional status questionnaires Questionnaire abb. Internal consistency BDS BACK-ILL Good 43 Rasch good BPFS Good 11 Good 11 BPI BQ Good 46 Good 46 CBPQ Good 39 75 DPQ Good 82 Good, 75 inadequate 39 Acceptable to good, 82 inadequate 41 DRI 0.84 40 FOQSD - Test-retest reliability 49 FRI Good GFS Acceptable 35 47 External construct validity Good 43 Good 49 Discriminant validity Responsiveness Acceptable 49 Acceptable 49 Good 11 Good 46 Good 39 75;81 Acceptable 11 Acceptable 11 - Acceptable, 80 inadequate 46 Acceptable 39;75;81 Good 39 Good, 41;83 moderate to weak 82 Acceptable 41;84 Acceptable for daily activities and work/leisure scales, 82;83 inadequate for social 82 and anxiety/depression scales 83 85 ICC 0.95. 40 Good 40 86 Good, 62 acceptable 40 - - Good, 40 acceptable, 62 87 inadequate 88 - Good 47 Good 35;90 Good 35 Acceptable 49 Good 47 Good 35;61;89 Good 47 Good 35 61 81 - 89;90 FFbH-R Good 42 Inadequate 42;71 Good, 42;71 moderate (psychological measure) - Acceptable 42 Acceptable 19 91 JVB - Good? 26 Good 19 - JOA - Good 54 Good 92 54 Good 27;94 Inadequate 92 - LBOS Good RS - Good 28 Good 28 - - Million - Inadequate 29 Moderate 35;95 Good 90;95 Good, 90 acceptable 29 NASS Good 30 96 76 Good 30 97 96 Good 30;96 97 76 Acceptable 30 Acceptable 76 ORQ Good 38 Good 38 Moderate 38 - ODI 1.0 Acceptable to good 36 98 54;99 Good, 36 101 102 56 100 inadequate 22 Good 27 40 19;35;36;94;99;103 54;61;61;100;104 moderate (pain, 101;105 global change index, 106 physical impairment measures, 107;108 109 110;110;111 generic measure, 102) weak (psychological measure, 112 work 113 103 114) Acceptable for satisfaction scale, inadequate for productivity scale 38 Good, 115 90;95;116 acceptable Good, 125 acceptable to good 77 - Good, 125;126 inadequate 23 Good, 23 125 moderate, 77 weak (depression) 95 Good, 62;126 acceptable 77 Good, 62 acceptable 23 77 126 - - - - ODI 2.0 ODI AAOS 56;93 Good 93 100 54;61;89 Acceptable 27;55 36;102;113;117-121 122 Acceptable 27 55 56 Good, 104;116 123 85 95 89;90;124 acceptable, 22;115 36 19 113 102 inadequate 114 ODI Chir Acceptable 25 Good, 127 inadequate 25 Good, 128 moderate, 129 83;130 weak (depression, Good 127 Good, 128;132 127 acceptable, inadequate 25 80;83 - Good for physical function scale 83 OMLSS Acceptable to good 34 Acceptable to good 34 PRAP - PSA NA48 Acceptable to good 45 - QBPDI Good 37 36 133 Acceptable to good 36;126;127;133;133 radiographical findings 131 ) Moderate 34 34 Good 45 Inadequate 45 - Good, 36 50;133;134 weak (physical impairment) Good 48 Good 48 Good 37;127 36 126 Good, 85 50;127 acceptable 36;37 133 126 135 126 127 RADL Good RMQ Good 136 137 inadequate 37 Good 44 44 138 139 Acceptable to good, 17 36;136 0.87 11 78 140 141 11 78 140 126 125 99 79 54 125 SE 8.8%, 11 inadequate 72 17 79;136 RMQ-two RMQ-7p Good - 20 Good 20 Inadequate 21 RMQ-23 Good 6 153 Good 153 RM-18 Good 19 Good 19 RM-16 - SPIM - WDI Acceptable Good 50 56 Acceptable to good 31 126 Moderate to good 44 Acceptable 44 Good 44 Good 72 23 19 136 36 99 11 81 75 136 103 35 54 125 moderate to good, 44 moderate (pain, 17 142 physical impairment measures, 78 143 144 145 global rating, 146) weak (work, 103 physical impairment measures, 147 physical activity in daily life 148) Good, 72;115 19;146 62;116 acceptable 36;149 Good, 115;116 19 146 149 44 11;81;119 acceptable 152 36 11 126 Good 20 Good 21 Good, 6 154 moderate (pain) 35 Good 19 Good 18 Good 50 Good, 51 27 weak (lumbar motion) 108 - Good 21 Inadequate 155 Good 6 Good 19 Acceptable 19 Acceptable to good 18 Good 18 Acceptable 50 Good, 48 62;78;81;150;151 126 acceptable Acceptable 51;126 51 Questionnaire abbrevations (abb): the same abbrevations as in Table 3 are used. Internal consistency: Considered as good when Chronbach’s Alpha ≥ 0.80 or Rasch analysis is good, acceptable when Chronbach’s Alpha 0.80-0.70 or Rasch shows some problems, inadequate when Chronbach’s Alpha < 0.70, only inter-item correlations or inadequate statistical analyses. 14 Test-retest reliability: Considered as good when intraclass correlation coefficients (ICC) ≥ 0.80, Kappa ≥ 0.75, and/or good evidence for minimal detectable change (MDC) or coefficient of variation (CV), acceptable when ICC 0.80-0.40 and/or Kappa 0.75-0.40, and inadequate when ICC and/or Kappa < 0.40, and when Pearson statistics is used. 14 External construct validity: Considered as good when high convergent correlations with r ≥ 0.60, moderate when r 0.60-0.30, and weak when r <0.30. 14 Discriminant validity: Considered as good when evidence meets high standard such as area under receiver operating curves or group differences by means or %, acceptable when adequate standard, and inadequate or low when evidence do not support the discriminant abilities, or none/few analyses have been carried out. 14 Responsiveness (sensitivity to change): Considered as good when evidence is strong, in the expected direction, supported by patient or clinical evidence, and/or effect size statistics good (SRM ≥ 0.80), acceptable when moderate effect size (SRM 0.40-0.80), conflicting evidence or only used associations/correlations, inadequate or weak evidence when SRM< 0.40, when evidence is based solely on statistical significance, or inadequate statistical analyses. 14 NA= not applicable The details of this table can be ordered by contacting the authors.