item analysis2
DESCRIPTION
TRANSCRIPT
Item Analysis
• When analyzing the test item, we have several questions about the performance of each item. Some of these questions include :•Are the item congruent with the test objectives?•Are the item valid? Do they measure what they supposed to measure?•Are the item reliable? Do they measure consistently?•How long does it take an examinee to complete each item?•What item are most difficult to answer correctly?•What item are easy?•Are they any poor performing items that need to be discarded?
Types Of Item Analysis for CTT• Three major types :
1. Assess quality of the distracters
2. Assess difficulty of the items
3. Assess how well an item differentiates between high and low performers
Purposes and Elements of item Analysis
To select the best available items for the final form of the test.
To identify structural or content defects in the items.
To detect learning difficulties of the class as a whole
To identify the areas of weakness of students in need of remediation
Three elements of item analysis
1. Examination of the difficulty level of the items.
2. Determination of the discriminating power of each item, and
3. Examination of the effectiveness of distractors in a multiple choice or matching items.
The difficulty level of an item is known as index of difficulty.Index of difficulty is the percentage of students answering
correctly each item in the testIndex of discrimination refer to the percentage of high-scoring
individuals responding correctly versus the number of low-scoring individuals responding responding correctly to an item.
This numeric index indicates how effectively an item differentiates between the students who did well and those who did poorly on the test.
Preparing Data for Item Analysis
1. Arrange test score from highest to lowest.
2. Ger one-third of the papers from the highest scores and the other third from the lowest scores.
3. Record separately the number of times each alternative was chosen by the students in both groups.
4. Add the number of correct answers to each item made by the combined upper and lower groups.
5. Compute the index of difficulty for each item, following formula :IDF = (NRC/TS)100
Where IDF = index of difficulty NRC = number of students responding correctly to an itemTS = total number of an students in the upper and lower groups.
6. Compute thee index of discrimination, based on the formula :IDN = (CU – CL)
NSGWhere IDN = index of discrimination
CU = number of correct responses of the upper groupCL = number of correct responses of the lower groupNSG = number of student per group
Using information about Index of Difficulty
The difficulty index of a test items tells a teacher about the comprehension of or performance on material or task contained in an item.
For an item to be considered a good item, its difficulty index should be 50%. An item with 50% difficulty index is neither easy nor difficult.
If an item has a difficulty index of 67.5%, this means that it is 67.5% easy and 32.5% difficult.
Information on the index of difficulty of an item can help a teacher decide whether a test should be revised, retained or modified.
Interpretation of the Difficulty Index
Range Difficulty Level
20 & below21-4041-6061-80
81 & above
Very difficultDifficultAverage
EasyVery easy
Using Information About Index Of Discrimination
• The Index Of Discrimination tells a teacher the degree to which a test item differentiates the high achievers from the low achievers in is class. A test item may have positive or negative discriminating power.
• An item has a positive discriminating power when more student from the upper group got the right answer than those from the lowest group.
• When more student from the upper group got the correct answer on an item than those from the upper group, the item has a negative discriminating power.
There are instance when an item has zero discriminating power – when equal number of students from upper and lower group got the right answer to a test item.
In the given example, item 5 has the highest discriminating power. This means that it can differentiate high and low achievers.
Interpretation of the Difficulty Index
Range Verbal Description
.40 & above.30 - .39.20 - .29.09 - .19
Very Good ItemGood ItemFair ItemPoor Item
When should a test item be rejected? Retained? Modified or revised
A test item can be retained when its level of difficulty is average and discriminating power is positive.
It has to rejected when it is either easy / very easy or difficult / very difficult and its discriminating power is negative or zero.
An item can be modified when its difficulty level is average and its discrimination index is negative.
Examining Distracter Effectiveness
An ideal item is one that all student in the upper group answer correctly and all students in the lower group answer wrongly. And the responses of the lower group have to be evenly distributed among the incorrect alternatives.
Developing an Item Data File
Encourage teachers to undertake an item analysis as often as practical
Allowing for accumulated data to be used to make item analysis more reliable
Providing for a wider choice of item format and objectives Facilitating the revision of items Accumulating a large pool of items as to allow for some items
to be shared with the students for study purposes.
Limitations Of Item Analysis
• It cannot be used for essay items.• Teacher must be cautious about what damage may
be due to the table of specifications when items not meeting the criteria are deleted from the test. These items are to be rewritten or replaced.
What is Item Discrimination?
• Generally, student who did well on the exam should select the correct answer to any given item on the exam.
• The Discrimination Index distinguishes for each item between the performance of students who did poorly.
How does it work?
• for each item, subtract the number in the lower group who answered correctly from the number of students in the upper group who answered correctly.
• Divide the result by the number of students in one group.
• The discrimination Index is listed in decimal format and ranges between -1 and 1.
What a “good” value?
Item Discrimination : Examples
1 90 20 0.7
2 80 70 0.1
3 100 0 1
4 100 100 0
5 50 50 0
6 20 60 -04
Item no.
Number of correct answers in group
Upper 1/4 Lower 1/4
Item Discrimination
Index
Quick Reference• Use the following table as a guideline to determine
whether an item ( or its corresponding instruction) should be considered for revision.
Item Discrimination (D)
D = < 0%
0 % < D < 30 %
D > = 30 %
High Medium Low
review review review
ok review ok
ok ok ok
Item Difficulty
Distracter analysis
First question of item analysis : how many people choose each response?
If there only one best response, then all other response options are distracters.
Example from in class assignment (N=35):
Which method has best internal consistensy ?a) Projective test 1b) Peer ratings 1c) Forced choice 21d) Differences n.s. 12
Distracter analysis (cont’d)
• A perfect test item would have 2 characteristics : 1. Everyone who knows the item gets it right 2. People who do know the item will have responses equality distributed
across the wrong answer.
• It is not desirable to have one of the distracters chosen more often then the correct answer.
• This result indicates a potential problem with the question. This distracters may be too similar to the correct answer and /or these maybe something in either the stem or the alternatives that is misleading.
Distracter analysis (cont’d)
• Calculate the # of people expected to choose each of the distracters. If random same expected number for each wrong response (Figure 10-1).
# of Persons N answering incorrectly 14 Exp. To Choose ___________________ = __ =4.7Distracter number of distracters 3
Distracter analysis (cont’d)
When the number of person choosing a distracter significantly exceeds the number expected, these are 2 possibilities:
1. It is possible that choice reflects partial knowledge2. The item is a poorly worded trick question
• Unpopular distracters may lower item and test difficulty because it is easily eliminated
• Extremely popular likely to lower the reliability and validity of the test
Distracter analysis : Definition
• Compare the performance of the highest and lowest scoring 25% of the student on the distracter option (i.e. the incorrect answers presented on the exam)
• Fewer of the top performers should choose each of the distracters as their answer compared to the bottom performers.
Distracter analysis : Examples
Item 1 A B C D E Omit
% of student in upper 1/4 20 5 0 0 0 0
% of student in middle 15 10 10 10 5 0
% of student in lower 1/4 5 5 5 10 0 0
Item 2 A B C D E Omit
% of student in upper ¼ 0 5 5 15 0 0
% of student in middle 0 10 15 5 20 0
% of student in lower 1/4 0 5 10 0 10 0
Distracter Analysis : Discussion
• What is the purpose of a good distracter?
• Which distracters should you consider throwing out?
Item analysis report
Exercise : Interpret Item Analysis
• Review the sample report.• Identify any exam items that may require revision.• For each identify item, list your observation and
hypothesis of the nature of the problem.
Knowledge Or Successful Guessing?
Multiple Choice Exam Strategies-improve odds by eliminating 1 or more infeasible or unlikely answer options
Description Exam Strategies-brain dumping-part marks-consideration for perfect answers to questions that were not asked
Possibility of a “Random Pass”
Depends on the numberof answer options per question and the number of questions!
1
2
4
6
10
20
50
Number of Questions 2 choice 3 choice 4 choice 5 choice
50 33 25 20
75 56 44 36
69 41 26 18
66 32 17 10
62 21 8 3
59 9.2 1.4 .3
56 1 .01 .0004
Percent Pass ( >50%) by Chance