2013 Standard Protocol for Evaluation Screening Tools

CENTER ON RESPONSE TO INTERVENTION AT AIR STANDARD PROTOCOL FOR EVALUATING RESPONSE TO INTERVENTION TOOLS: READING AND MATH SCREENING August 2013 Screening involves brief assessments that are valid, reliable, and evidence-based. They are conducted with all students or targeted groups of students to identify students who are at risk of academic failure and, therefore, likely to need additional or alternative forms of instruction to supplement the conventional general education approach. 1 Please Read and Complete Before You Start In order to know if your tool qualifies for review, please answer the following four questions regarding the research evidence for your tool: (1) Can you provide direct evidence* on the effects of using your tool? YES_____ NO_____ YES_____ NO_____ YES_____ NO_____ YES_____ NO_____ YES_____ NO_____ YES_____ NO_____ (3) Is your outcome variable a reading (for reading screeners) or math (for math screeners) measure? YES_____ NO_____ (4) Are there at least three months between the screening and your outcome measure? YES_____ NO_____ *Direct evidence refers to data from a study that has been conducted based on the tool submitted for evaluation. Studies that use data from the use of another tool, even if it is similar, are considered indirect evidence and will not be considered as adequate evidence for the purposes of this review. (2) Do you have the following classification data for your tool? a. Specificity b. Sensitivity c. Positive predictive power d. Negative predictive power e. Area Under the Curve (AUC) derived from a Receiver Operating Characteristic (ROC) curve analysis If you cannot answer yes to all of these questions, we will not review your tool. If you are able to answer YES to all of the above questions, then your tool qualifies for review. Please proceed to the following page and begin filling out the protocol. PLEASE NOTE: If your tool assesses more than one component skill of reading, you MUST submit separate protocols for EACH sub-test that is used to identify students. The Center on Response to Intervention at AIR will review all of your submitted materials to ensure that they adhere to the qualifications for review, stated above. If it is found that your submission packet needs substantial amount of supplemental information or is missing critical information, the entire packet will be returned to you. A revised protocol packet with additional information may be re-submitted. 2 Results of the review will be posted on the Center’s website, in the Screening Tools Chart. Once the review has begun, withdrawal of tools from the process will not be permitted. If you are unfamiliar with any of the terms in the protocol or unsure of what information to provide, Center staff will be available to answer your questions or to assist you completing the protocol for submission. Please contact the Center on Response to Intervention at AIR: Center on Response to Intervention at AIR American Institutes for Research 1000 Thomas Jefferson Street, NW Washington, DC 20007 Phone: 1-877-RTI-4-ALL (1-877-784-4255) E-mail: rticenter@air.org 3 In order to be eligible for review, you must read and sign the following marketing language agreement. Marketing Language Agreement By signing this agreement I have indicated my understanding of the intent and purpose of the Center on Response to Intervention at AIR’s tools chart, and my agreement to use language that is consistent with this purpose in any marketing materials that will be used to publicize my product’s presence and ratings on the chart. Specifically, I understand the following: 1. The Technical Review Committee (TRC) rated each submitted tool against established criteria but did not compare it to other tools on the chart. The presence of a particular tool on the chart does not constitute endorsement and should not be viewed as a recommendation from either the TRC or the Center on Response to Intervention at AIR. 2. All tools submitted for review are posted on the chart, regardless of results. The chart represents all tools that were reviewed, not those that were “approved.” When marketing my product, I will not use any language that is inconsistent with the above. Examples of inappropriate marketing language include, but may not be limited to, the following: (a) Reference to a “top-ranked” product in comparison to other products on the chart (b) Reference to “approval” or “endorsement” of the product by the Center on RTI at AIR If the Center becomes aware of any marketing material on my product that violates this agreement, I understand that I risk removal of the product from the chart. I also understand that I may draft language and submit to Center staff for review in advance of releasing it, in order to ensure compliance with this agreement. I have read and understand the terms and conditions of this Agreement. By signing below, I signify my agreement to comply with all requirements contained herein. Signature Date Print Name Organization 4 SECTION I: BASIC INFORMATION A. TOOL Title: Developer: Publisher: Contact Person: Pub. Date: Name Telephone: Email Address: B. DESCRIPTIVE INFORMATION Description of tool: 1. The tool is intended for use in grade(s) (Check all that apply): 2. The tool is intended for use with ages(s) (Check all that apply): ☐ Kindergarten ☐ Third Grade ☐ First Grade ☐ Fourth Grade ☐ Second Grade ☐ Fifth Grade ☐ Beyond Fifth Grade ☐ 4 ☐ 7 ☐ 5 ☐ 8 ☐ 6 ☐ 9 ☐ 10+ 3. What is the language of administration? 1 SECTION I: BASIC INFORMATION 4. What is area of skill screened? (Check all that apply) Reading ☐ Subword processes: ☐ Phonological processing ☐ RAN ☐ Phonological Memory ☒ Phonological Awareness ☐ Letter sound correspondence ☐ Structural analysis ☐ Word ID: ☐ Accuracy ☐ Speed ☐ Nonword: ☐ Accuracy ☐ Speed ☐ Spelling ☐ Accuracy ☐ Speed ☐ Passage: ☐ Accuracy ☐ Accuracy ☐ Reading comprehension: ☐ Multiple choice questions ☐ Constructed Response ☐ Maze ☐ Other: ☐ Listening comprehension ☐ Multiple choice questions ☐ Constructed Response ☐ Sentence verification ☐ Vocabulary ☐ Expressive ☐ Phonics ☐ Other: ☐ Cloze ☐ Retell ☐ Sentence verification ☐ Cloze ☐ Retell ☐ Receptive Mathematics ☐ Global Indicator of Mathematics Competence ☐ Accuracy ☐ Speed ☐ Multiple choice questions ☐ Constructed Response ☐ Early Numeracy ☐ Accuracy ☐ Speed ☐ Multiple choice questions ☐ Constructed Response ☐ Mathematics Concepts ☐ Accuracy ☐ Speed 2 SECTION I: BASIC INFORMATION ☐ Multiple choice questions ☐ Constructed Response ☐ Mathematics Computation ☐ Accuracy ☐ Speed ☐ Multiple choice questions ☐ Constructed Response ☐ Mathematic Application ☐ Accuracy ☐ Speed ☐ Multiple choice questions ☐ Constructed Response ☐ Fractions/Decimals ☐ Accuracy ☐ Speed ☐ Multiple choice questions ☐ Constructed Response ☐ Algebra ☐ Accuracy ☐ Speed ☐ Multiple choice questions ☐ Constructed Response ☐ Geometry ☐ Accuracy ☐ Speed ☐ Multiple choice questions ☐ Constructed Response ☐ Other (List specific skills or subtest) 3 SECTION I: BASIC INFORMATION CONTINUED NOTES: 5. Acquisition Information: Where to Obtain: Address: Phone #: Web Site: Cost per student for Year 1: Including: $ Complete Kit (describe contents) $ Manuals and Test Materials $ Directions for Administration $ Test Forms $ Technical Manuals $ Protocol per Student Other (Describe below): $ $ $ $ 4 SECTION I: BASIC INFORMATION CONTINUED Cost per student for subsequent years: Including: $ Complete Kit (describe contents) $ Manuals and Test Materials $ Directions for Administration $ Test Forms $ Technical Manuals $ Protocol per Student Other (Describe below): $ $ $ $ $ Additional Information on the tools: Describe basic pricing plan and/or structure of the tools. Also, provide information on what is included in the published tools, including information about special accommodations for students with disabilities. 5 SECTION II: EFFICIENCY A. TIME, ADMINISTRATION, AND FREQUENCY Assessment format (Check all that apply): ☐ Check lists ☐ ☐ Direct (please specify): Questionnaire ☐ Computerized ☐ One to one ☐ Group Administered Administration time: minutes per student minutes per total group Additional scoring time: minutes per student minutes per total group Discontinue Rules: ☐ yes ☐ no ☐ basals ☐ ceilings ☐ other: Alternate forms available: ☐ yes Number per grade, test level, or units: ☐ no Are norms available? ☐ yes ☐ no Are benchmarks available? ☐ yes ☐ no How many benchmarks per year? What months during year? B. TRAINING Time required for training tester: ☐ less than 1 hour of training ☐ 1-4 hours of training ☐ 4-8 hours of training ☐ information not available 6 SECTION II: EFFICIENCY CONTINUED Minimum qualifications of the examiner: ☐ professional ☐ paraprofessional ☐ information not available Training manuals and materials available: ☐ yes ☐ no Training manuals/materials field-tested: Training manuals/materials included in cost of tools: Sources for ongoing technical support available: ☐ yes ☐ no ☐ yes ☐ no ☐ yes ☐ no C. SCORING Types of scores available (Check all that apply): ☐ raw score ☐ standard score ☐ percentile score ☐ grade equivalents ☐ IRT-based score ☐ age equivalents ☐ stanines ☐ normal curve equivalents ☐ developmental benchmarks: ☐ cut points ☐ benchmarks ☐ equated ☐ probability ☐ lexile score ☐ subscale/subtest scores ☐ composite scores ☐ error analysis ☐ Other (specify): Is a scoring key available? Basis for calculating standard & percentile scores: ☐ yes ☐ no ☐ age norms ☐ stanines ☐ grade norms ☐ normal curve equivalents 7 SECTION II: EFFICIENCY CONTINUED Scoring Structure (specify how raw scores are calculated and what comprises cluster/composite score): Describe the tool’s approach to screening, behavior samples, test format, and/or scoring practices, including steps taken to ensure that it is appropriate for use with culturally and linguistically diverse populations and students with disabilities. Describe the tool’s theoretical justification. 8 SECTION III: TECHNICAL INFORMATION A. CLASSIFICATION ACCURACY 1a. Please provide the following classification data1 for your tool: Screen At-Risk Not At-Risk RD True Positive a Not RD False Positive b Total a+b False Negative c True Negative d c+d N = a+b+c+d False Positive Rate [b/(b+d)] = False Negative Rate [c/(a+c)] = Sensitivity [a/(a+c)] = Specificity [d/(b+d)] = Positive Predictive Power [a/(a+b)] = Negative Predictive Power [d/(c+d)] = Overall Classification Rate [(a+d)/(a+b+c+d)] = Area Under the Curve (AUC) = ____________________ What is the base rate in the sample [(a+c)/(a+b+c+d)] = What were the cut-points? ___________________________________________________________________________________ ___________________________________________________________________________________ ___________________________________________________________________________________ Please provide specificity for the following three sensitivity levels: Sensitivity 90%: Specificity =_____________ Sensitivity 80%: Specificity =_____________ Sensitivity 70%: Specificity =_____________ 1 For assistance in calculating these values, please use the following worksheet: Classification Data. For assistance in conducting a ROC curve analysis and computing AUC, please see the following guidance document: Receiver Operating Characteristic 9 SECTION III: TECHNICAL INFORMATION CONTINUED 1b. Do you provide, in your user’s manual, classification data that are disaggregated by race, ethnicity, or language proficiency? If so, complete below for each race/ethnicity/LEP group for which you provide disaggregated classification data. Copy additional pages as needed. Race-ethnicity/LEP group: __________________________________ Screen At-Risk Not At-Risk RD True Positive A Not RD False Positive b Total a+b False Negative C True Negative d c+d N = a+b+c+d False Positive Rate [b/(b+d)] = False Negative Rate [c/(a+c)] = Sensitivity [a/(a+c)] = Specificity [d/(b+d)] = Positive Predictive Power [a/(a+b)] = Negative Predictive Power [d/(c+d)] = Overall Classification Rate [(a+d)/(a+b+c+d)] = Area Under the Curve (AUC) = ____________________ What is the base rate in the sample [(a+c)/(a+b+c+d)] = What were the cut-points? ___________________________________________________________________________________ ___________________________________________________________________________________ ___________________________________________________________________________________ Please provide specificity for the following three sensitivity levels: Sensitivity 90%: Specificity =_____________ Sensitivity 80%: Specificity =_____________ Sensitivity 70%: Specificity =_____________ 10 SECTION III: TECHNICAL INFORMATION CONTINUED 2. Please provide a description of the study sample on which the preceding data are based. National representation: ☐ Yes ☐ No Local representation: ☐ Yes ☐ No Date: Number of States: Size: Regions: Gender (Percent): Male Female Unknown SES (Percent, measured as free or reduced-price lunch) Eligible for free or reduced-price lunch Other SES Indicators: Race/Ethnicity (Percent): White, Non-Hispanic Black, Non-Hispanic Hispanic American Indian/Alaska Native Asian/Pacific Islander Other Unknown Disability classification (Please describe): First language (Please describe): Language proficiency status (Please describe): 3. In the aforementioned study, was a separate external reading or pre-reading measure with national norms administered at the beginning of the study? ☐ Yes ☐ No If yes, please describe: If yes, what percentage of children in the sample scored: ___ below the 25th percentile ___ between the 25th and 75th percentile ___ above the 75th percentile 11 SECTION III: TECHNICAL INFORMATION CONTINUED 4. Were the children in the study involved in an intervention in addition to typical classroom instruction between the screening measure and outcome assessment? ☐ Yes ☐ No If yes, please describe the intervention, what children are receiving the intervention, and how they were chosen: 5. Describe the criterion (outcome) measure(s) and how it is broken down into risk and no-risk categories. 6. Describe how the classification analyses were performed and cut-points determined. 12 SECTION III: TECHNICAL INFORMATION CONTINUED B. CROSS-VALIDATION 1. Has a cross validation study been conducted? ☐ Yes ☐ No Note: In a cross-validation study, previously identified classification rules are used to predict risk and non-risk in the new sample. If yes, provide the following classification data for the cross validation study: Screen At-Risk Not At-Risk RD True Positive a Not RD False Positive b Total a+b False Negative c True Negative d c+d N = a+b+c+d False Positive Rate [b/(b+d)] = False Negative Rate [c/(a+c)] = Sensitivity [a/(a+c)] = Specificity [d/(b+d)] = Positive Predictive Power [a/(a+b)] = Negative Predictive Power [d/(c+d)] = Overall Classification Rate [(a+d)/(a+b+c+d)] = Area Under the Curve (AUC) = ____________________ What is the base rate in the sample [(a+c)/(a+b+c+d)] = What were the cut-points? ___________________________________________________________________________________ ___________________________________________________________________________________ ___________________________________________________________________________________ Please provide specificity for the following three sensitivity levels: Sensitivity 90%: Specificity =_____________ Sensitivity 80%: Specificity =_____________ Sensitivity 70%: Specificity =_____________ 13 SECTION III: TECHNICAL INFORMATION CONTINUED 2. Please provide a description of the cross-validation study sample. National representation: ☐ Yes ☐ Local representation: ☐ Yes ☐ No Date: No Number of States: Size: Regions: Gender (Percent): Male Female Unknown SES (Percent, measured as free or reduced-price lunch) Eligible for free or reduced-price lunch Other SES Indicators: Race/Ethnicity (Percent): White, Non-Hispanic Black, Non-Hispanic Hispanic American Indian/Alaska Native Asian/Pacific Islander Other Unknown Disability classification (Please describe): First language (Please describe): Language proficiency status (Please describe): 3. In the cross-validation study, was a separate external reading or pre-reading measure with national norms administered at the beginning of the study? ☐ Yes ☐ No If yes, please describe: If yes, what percentage of children in the sample scored: ___ below the 25th percentile ___ between the 25th and 75th percentile ___ above the 75th percentile 14 SECTION III: TECHNICAL INFORMATION CONTINUED 4. Were the children in the cross-validation study involved in an intervention in addition to typical classroom instruction between the screening measure and outcome assessment? ☐ Yes ☐ No If yes, please describe the intervention, what children are receiving the intervention, and how they were chosen: 5. Describe the criterion measure(s) for the cross-validation study and how it is broken down into risk and no-risk categories. 6. Describe how the classification analyses for the cross-validation study were performed and cut-points determined. 15 SECTION III: TECHNICAL INFORMATION CONTINUED C. RELIABILITY 1a. In the chart below, report reliability information (e.g., Cronbach’s alpha, test-retest, split-half, and/or inter-rater reliability estimates). If you include inter-rater reliability, provide formula: Type of Reliability Age or Grade n (range) Coefficient range median Manual cites other published reliability studies: *Attach manual. *Attach published studies. SEM ☐ yes Information (including normative data)/Subjects ☐ no 16 SECTION III: TECHNICAL INFORMATION CONTINUED 1b. Do you provide, in your user’s manual, reliability data that are disaggregated by diverse populations (e.g., race-ethnicity, ELL students)? If so, complete below for each group for which you have disaggregated reliability data. If you include inter-rater reliability, provide formula: Type of Reliability Age or Grade n (range) Coefficient range median Manual cites other published reliability studies: *Attach manual. *Attach published studies. SEM ☐ yes Information (including normative data)/Subjects ☐ no Copy additional pages as needed. 17 SECTION III: TECHNICAL INFORMATION CONTINUED D. VALIDITY 1a. In the chart below, report information on predictive, content, and construct validity.2 Type of Validity Age or Grade Test or Criterion n (range) Coefficient (if applicable) range Median Information (including normative data)/Subjects Other forms of validity: Manual cites other published validity studies: *Attach manual. *Attach published studies. ☐ yes ☐ no 2 For predictive validity, the TRC suggests predicting to an outcome between 3 months and 1 year after the screen. For construct validity, the TRC suggests predicting to an outcome that is within 3 months of the screen. 18 SECTION III: TECHNICAL INFORMATION CONTINUED 1b. Do you provide, in your user’s manual, validity data that are disaggregated by diverse populations (e.g., race-ethnicity, ELL students)? If so, complete below for each group for which you have disaggregated validity data. Type of Validity Age or Grade Test or Criterion n (range) Coefficient (if applicable) range median Information (including normative data)/Subjects Other forms of validity: Manual cites other published validity studies: *Attach manual. *Attach published studies. ☐ yes ☐ no Copy additional pages as needed. 19

2013 Standard Protocol for Evaluation Screening Tools

Related documents

Products

Support

2013 Standard Protocol for Evaluation Screening Tools

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib