Chapter 6 Construction of Knowledge Tests

advertisement
Chapter 6
Construction of Knowledge Tests
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
Chapter Objectives
After completing this chapter, you should be able to
1. List and describe the steps for knowledge test
construction.
2. Construct a table of specifications and explain its use.
3. State the purposes of item analysis.
4. Define item difficulty, index of discrimination, and
response quality.
5. Conduct item analysis.
6. Contrast the advantages and disadvantages of various
test items.
7. Construct true-false, multiple-choice, short answer,
completion, matching, and essay test items.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-2
Steps in Construction of a Test
Knowledge test refers to tests that measure thought
processes.
TEST PLANNING
1. Consider content validity.
2. Develop a table of test specifications (serves as an
outline for construction of the test).
Includes:
- kinds and number of test items
- kinds of tasks (though processes) the items will present
and number of each kind of tasks
- content area and number of items in each area
(See table 6.1.)
6-3
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
Steps in Construction of a Test
Table 6.1 Specifications for a 50-item multiple-choice test
Content Area
History
Rules
Technique
Offensive
strategy
Defensive
strategy
Task
(Number of Questions and Percentages of Total)
Knowledge Comprehension Analysis Application
2(4%)
5(10%)
5(10%)
5(10%)
2(4%)
5(10%)
8(16%)
4(8%)
5(10%)
4(8%)
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
5(10%)
6-4
Steps in Construction of a Test
*Most commonly used kinds of objective items are
multiple-choice, true-false, matching, and completion.
The total number of items usually determined by:
- length of time to take to take test
- the length of the items
- the difficulty of the items
- the conditions under which the test is to be administered
- age of individuals taking the test
*In addition to factual information, test takers may be
asked to demonstrate ability (thought processes) to:
- comprehend
- apply
- evaluate
- analyze
- synthesize
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-5
Table 6.2 Bloom’s Classification of Thought
Processes (Cognitive Behavior)
1.
2.
3.
4.
5.
6.
Knowledge – recall
Comprehension – lowest level of understanding
Application – use of knowledge and understanding
Analysis – separate whole into parts
Synthesis – rearrange
Evaluation (most advanced) – make judgments about
value of information
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-6
Steps in Construction of a Test
*Content area deals with areas covered during instruction.
In physical education test might include:
- history
- technique or mechanical analysis
- terminology
- strategy
- rules
- benefits of participation
- equipment
*Item difficulty should be related to the purpose of the
test; determined only after test is administered; but
through experience you can develop the ability to
estimate.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-7
Test Item Construction
General Guidelines
1. Allow enough time to complete the test construction.
2. No item should be included on a test unless it covers an
important fact, concept, principle, or skill. Why is the test
taker responsible for this? What is the value of this point?
What future benefit will it have?
3. Items should be independent of each other.
4. Write simply and clearly.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-8
Test Item Construction
General Guidelines
5. Be flexible; as a general rule the test should include more
than one type of item.
6. Place easy items first.
7. Record the test number of each item in the table of test
specifications.
8. Prepare clear, concise, and complete directions.
9. Ask your peers to review the test.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-9
Test Administration
Guidelines
1. Provide a typed copy of the test.
2. Start the test on time.
3. Be sure the test is administered under normal conditions.
4. Read the directions to the test takers.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-10
Item Analysis
Determines the quality of the items.
Purposes
• Indicates which items may be too easy or too difficult
• Indicates which items may fail to discriminate clearly
between the better and poor students for reasons other than
item difficulty
• Indicates why an item has not functioned effectively and
how it might be improved
• Improves your skills in test construction
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-11
Steps In Item Analysis
1. Arrange the scored tests in order from high score to low
score.
2. Determine the upper 27% of the test scores and place
them in one group (referred as upper group). Do the
same for the bottom 27% of the test scores (referred to
as lower group). Although upper and lower groups of
27% are considered the best for maximizing the
difference between the two groups, any percentage
between 25% and 33% may be used.
3. Tally the number of times the correct response to each
item was chosen on the tests of each group.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-12
Item Difficulty
Defined as the proportion of test takers who answer an
item correctly.
If upper and lower groups are not formed, the difficulty index
(p) may by found by:
p = number answering correctly
total number in group
Example
If 60 test takers completed a test and 41 correctly answered
an item, the
difficulty index would be:
p = 41 = .68
60
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-13
Item Difficulty
Use of upper group (UG) and lower group (LG)
p = number correct in UG + number correct in LG
number in UG + number in LG
Example
number of test takers = 150
number in UG = .27 x 150 = 40.5 = 41
number in LG = .27 x 150 = 40.5 = 41
number correct in UG = 33
number correct in LG = 17
p = 33 + 17 = 50 = .61
41 + 41 82
INTERPRETATION!
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-14
Item Difficulty
Table 6.3 Evaluation of Item Difficulty
Difficulty Index
Item Evaluation
.80 and higher
reject item
.71 - .79
accept item if index of discrimination is
acceptable, but revise if discrimination is
marginal
.30 - .70
accept item if index of discrimination is
acceptable
.20 - .29
accept item if index of discrimination is
acceptable, but revise if discrimination is
marginal
.19 and below
reject item
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-15
Item Difficulty
*Easy item has a high index and difficult item has low index.
*Typical norm-referenced test includes items with a range of
difficulty; average test item difficulty should be around
50%.
*Difficulty for criterion-referenced tests is established at the
minimum proficiency level; items on C-R tests are usually
constructed so that at least 80% to 85% of test takers are
expected to pass.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-16
Item Difficulty
*Interpretation of item difficulty not always an easy task.
*Item may be easy either because answer is obvious (poor
construction) or because test takers have learned the
material.
*May be difficult because it is poorly constructed or
because the test takers have not learned the material.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-17
Item Discrimination
Determines how well the item differentiates between
the good test taker and the poor test taker.
If item discriminates, more test takers with high scores
will answer the item correctly than will test takers with
low scores.
D = number correct in UG - number correct in LG
number in each group
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-18
Item Discrimination
Example
number of test takers = 150
number in UG = .27 x 150 = 40.5 = 41
number in LG = .27 x 150 = 40.5 = 41
number correct in UG = 33
number correct in LG = 17
D = 33 - 17 = 16 = .39
41
41
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-19
Item Discrimination
*Index of discrimination can range from +1.00 to -1.00.
*Negative index indicates that more individuals in LG
answered item correctly than individuals in UG.
*Negative index has no place in test.
*Generally, index of .40 or above on norm-referenced test
indicates that item discriminates well.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-20
Item Discrimination
*For criterion-referenced tests, administer test before
instruction (pretest) and administer test after instruction
(posttest); if large difference in proportion of correct
answers from pretest to posttest, item discriminates.
*Test item with difficulty index between .30 and .70 has
good chance of being a discriminating item, but should
not assume this always to be true.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-21
Item Discrimination
Table 6.4 Evaluation of Index of Discrimination
Index of Discrimination
Item Evaluation
.40 and above
item discriminates; accept item
if item difficulty acceptable
.30 -.39
.20 - 29
below .20
item provides reasonably good
discrimination; may need
improvement, particularly if
item difficulty is marginal
item provides marginal
discrimination; consider revision
if item difficulty acceptable
item does not discriminate; reject
item
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-22
Response Quality
*Choices of answers for multiple-choice items called
responses.
*Incorrect responses are referred to as distractors or foils.
*Each response should be selected by some of the students;
a response should be selected by at least 2% to 3% of the
test takers.
*Should consider pattern of incorrect responses by UG and
LG; if an incorrect response selected by many students in
the UG but few in LG, item may need revision.
*Item analysis of multiple-choice test should include a
record of the number of students who selected each
response as well as the item difficulty and index of
discrimination.
6-23
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
Item Revision
*After completing the item analysis, you should perform the
necessary revisions.
*Revision usually involves discarding or rewording some
items, changing responses, and changing items to different
types.
*After at least two administrations of the test to similar
groups, with the necessary revisions, you should have a
good test.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-24
Table 6.5 Example of Item Analysis for
Multiple-Choice Test
60 students completed the test; groups of 27% (16 test scores in
each group); correct responses in bold print
Item
Responses
p
D
A
B
C
D
1
UG
1
13
2
0
.50
.63
LG
4
3
5
4
3
UG
LG
A
0
1
B
8
8
C
0
0
D
8
7
.50
.00
Item 1. All responses considered; difficulty and discrimination good.
Retain item.
Item 3. Response C not considered and A considered only once; no
discrimination. Reject item.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-25
Objective Test Items
• Choices of answers provided for each test item
• True-false, multiple-choice, and matching
items
• With each item, test taker must select one of
choices provided
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-26
Objective Test Items
Objective test items can measure more than simple
recognition, rote memory, or association.
Can measure several different kinds of thought processes;
e.g., comprehension, analysis, and application.
Takes time to construct objective test items that measure
different kinds of thought processes.
Table 6.6 includes tasks often associated with each thought
process
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-27
True-False Items
*A declarative statement; test taker must decide if the
statement is correct or incorrect.
*Often limited to factual content, but can be used to test
applications, principles, and knowledge in the form of
propositions.
Examples
If the score is 15-30, the server is ahead in points.
If the score is 15-30, the serve should be to the receiver’s
left service court.
*Also can use T/F items to describe a situation and then ask
test-taker to respond to items about the situations; e.g.,
game situations and strategies.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-28
True-False Items
Advantages
1. Cover wide range of material in a single testing period.
2. Easy to score.
3. Generally, easy to construct. However, if thought
processes other simple knowledge are measure, test will
require some time to construct.
Disadvantages
1. Random guessing could produce a score of 50% correct.
2. With 50% chance of guessing correct answer, reliability
of test items tends to be lower.
3. Correct answer often depends on one word.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-29
True-False Items
Guidelines for Writing True-False Items
1. Avoid use of specific determiners. Words like all, always,
never, no, and none indicate question is probably false.
Words like sometimes, usually, and typically suggest that
item is probably true.
2. Include an equal number of true and false item, or include
more false items than true items.
3. Avoid the exact language of the textbook.
4. Avoid trick items.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-30
True-False Items
Guidelines for Writing True-False Items
5. Avoid negative and double negative terms. Underline
negative terms if you use them.
6. Avoid ambiguous statements.
7. All items should be of the same approximately length.
8. Limit each item to a single concept.
SEE EXAMPLES
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-31
Multiple-Choice Items
*Consist of two main components: the stem and three to five
responses (one correct response and incorrect responses
referred to as distractors or foils)
*Stem may be more than one sentence long; usually a direct
question or an incomplete statement.
*Should present problem in enough detail so that there is no
ambiguity about what is being asked.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-32
Multiple-Choice Items
Advantages
1. Can measure almost any understanding or ability if
designed to do so.
2. Can be used to test most types of material.
3. Chances of guessing correct answer are much less than
they are for true-false items.
4. Can be scored easily.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-33
Multiple-Choice Items
Disadvantages
1. More difficult to construct than other objective tests;
considerable time to develop good items that include at
least four responses.
2. Sometime encourage memorization of facts rather than
understanding of concepts.
3. Less material can be covered than with true-false test.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-34
Multiple-Choice Items
Guidelines for Writing Multiple-Choice Items
Stem Construction
1. Stem should be concise, easy to read and understand,
and contain the central issue of the item. Properly
constructed item has meaning by itself so that good
student knows correct answer before reading all
responses.
2. Avoid specific determiners such as always, never, all,
none, and so on.
3. Avoid negative wording; if necessary to use, underline.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-35
Multiple-Choice Items
4. Not mandatory, but sometimes helps test taker to begin
stem with w word such as which, why, where, what, when,
or who. Introduces main point of stem.
5. Do not work stem so that you are asking opinion of test
taker.
6. If testing definition, place word in stem and definitions in
responses
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-36
Multiple-Choice Items
Response Construction
1. All responses should be plausible, but only one correct
response.
2. Use at least four responses for each item.
3. All responses should be grammatically consistent,
homogeneous in content, and approximately the same
length. If some responses begin with a vowel but others in
the same item do not, us a(n) in the stem to introduce the
response.
4. If items numbered, use A, B, C, D, and E to designate the
responses. Also, unless limited by the number of pages,
place responses in vertical order.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-37
Multiple-Choice Items
Response Construction
5. Avoid patterns in positions of correct responses. Use A,
B, C, D, and E equally.
6. Use the response “none of the above” or “all of the
above” with care.
7. When possible, list the responses in logical or
sequential order. If arrange in sequence, correct
response occasionally should be first or last in
sequence.
SEE EXAMPLES
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-38
Matching Items
*Consists of a column of items (stimulus words of phrases) on
the left-hand side of a page and a column of options
(alternatives) on the right.
*Test taker selects the option that correctly associates with the
item.
*Similar to multiple-choice items; options serve as alternatives
for all items.
*Also similar to short-answer items; usually limited to specific
factual information (names, dates, labels)
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-39
Matching Items
Advantages
1. Easy to construct and score.
2. Provide many scoreable responses per test page or per unit
of test time.
3. Motive test taker to integrate their knowledge and consider
relations among items.
4. Odds of guessing the correct answer are low.
Disadvantages
1. Time consuming to complete.
2. Usually test on factual material.
3. Limited to association tasks.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-40
Matching Items
Guidelines for Writing Matching Items
1. Include only homogeneous material in each matching
exercise.
2. Clear directions - if an option be use more than once; if
each item has only one correct answer; how the marking
is to be done.
3. Keep the sets of items relatively short (five or six in lower
grades and ten to fifteen in upper grades).
4. Place all items and options for a matching exercise on one
page.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-41
Matching Items
5. Use an appropriate format. Usually best to the
homogeneous items on the left and the options on the
right. If permitted to write on test, leave a blank space
beside each numbered item for the letter of the
matched option.
6. Arrange the responses in alphabetical or logical order;
reduces time to find correct answer.
7. Develop more options than items; usually two or three
more options.
SEE EXAMPLES
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-42
Short-Answer and Completion Items
*Distinction between short-answer items and completion
items is primarily the length and format of the response.
*Short-answer item requires test taker to respond to a
question with a word, a phrase, or a sentence or two.
*In a completion item, the simplest short-answer form,
the test taker is asked to provide the missing
information.
*Both items are suited to measure factual knowledge,
comprehension of principles, ability to identify and
define concepts.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-43
Short-Answer and Completion Items
Advantages
1. Affected much less by guessing than are true-false or
multiple-choice items.
2. Come closer to assessing recall, as contrasted with
recognition
3. Are valuable when steps or procedures are to be learned.
4. Are easy to construct.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-44
Short-Answer and Completion Items
Disadvantages
1. Scoring takes longer than choice-type items, especially
if spelling considered.
2. Unless extreme care is taken in construction, a number of
answers might be wholly or partially correct; sometimes
means only test constructor is able to score tests.
3. Encourage rote learning; occasions when recall and
memorization are appropriate; e.g., CPR
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-45
Short-Answer and Completion Items
Guidelines for Writing Short-Answer and Completion
Items
1. Be sure item can be answered with a unique word, phrase,
or number and that only one answer is correct.
2. Be sure test taker knows what type of response is required;
indicate how precise response should be.
3. Think of the answer first. Then write item.
4. With completion items, place the blank near the end of the
sentence; intent of item clearer and avoids multiple
answers.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-46
Short-Answer and Completion Items
5. Use no more than two blanks in an item.
6. Avoid lifting items directly from textbook. One sentence
taken out of context from a paragraph may fail to
adequately present concept of paragraph.
7. Make the actual blanks for the responses the same
length.
SEE EXAMPLES
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-47
Essay Test Items
*Designed to measure ability to use higher mental
processes - identifying, interpreting, integrating,
organizing, and synthesizing.
Advantages
1. Easily and quickly constructed.
2. Can measure complex concepts, thinking ability, and
problem-solving skills.
3. Encourage test taker to learn how to effectively organize
and express their own ideas.
4. Minimize guessing.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-48
Essay Test Items
Disadvantages
1. Time consuming to score.
2. Scoring requires some decision making on the part of
scorer; reliability may be decreased.
3. Since they take longer to answer, only a few items can be
answered in test period.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-49
Essay Test Items
Guidelines for Writing Essay Items
1. Should require test taker to demonstrate a command of
essential knowledge (not reproduction of materials
presented in textbook).
2. Phrase each item so that only one answer is correct.
3. Indicate the scope and direction of the required answer;
vague phrasing leads to a wide variation of responses.
4. Require all test takers to answer the same questions; if
answer different items, basis for comparing is limited.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-50
Essay Test Items
Guidelines for Writing Essay Items
5. Indicate the approximate amount of time that should be
devoted to each answer; also state point value.
6. Generally, best to use a reasonable number of short essay
items rather than a few longer ones.
7. Write the ideal answer for each item; provides idea of
item’s reason and aids in scoring.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-51
Essay Test Items
Guidelines for Scoring Essay Items
1. Develop a method for scoring items. May identify
essential points that should be included; also may rank
each item according to the quality of response.
2. Evaluate the same item on all test taker’s papers before
going on to the next item. Also wise to occasionally
check your consistency by reviewing how you evaluated
an item on the first few papers you scored.
3. Conceal the name of the person whose test you are
evaluating.
SEE EXAMPLES
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved
6-52
Download