Creating Assessments The three properties of good assessments

  • Slides: 25
Download presentation
Creating Assessments The three properties of good assessments

Creating Assessments The three properties of good assessments

Validity: A case study using The Shakespeare Test “To ensure that my lessons were

Validity: A case study using The Shakespeare Test “To ensure that my lessons were effective, I chose to implement a preand post-test on the three-class periods that would be participating in my Shakespeare unit. . . After administering the pre-test, I found that approximately 90% of the students scored an ‘F’ and the other 10% landed just above the “F” grade level. ”

Validity: A case study using The Shakespeare Test “At the end of the 20

Validity: A case study using The Shakespeare Test “At the end of the 20 -day unit, I administered the post-test. My assumption was that the majority of them would score high on the test. Fortunately, my assumption paid off and I found that there were zero “D” level students and nearly 80% of the students were in the “B” to “A” range. Much like the pre-test, I found this extremely satisfying since I felt I had a few complications here and there throughout the unit. ”

Creating Assessments n All good assessments have three key features: – Validity – Reliability

Creating Assessments n All good assessments have three key features: – Validity – Reliability – Usability

Validity n val·id adj. 1. Well grounded; just. 2. Producing the desired results; efficacious.

Validity n val·id adj. 1. Well grounded; just. 2. Producing the desired results; efficacious. 3. Having legal force; effective or binding. 4. Logic. a. Containing premises from which the conclusion may logically be derived. b. Correctly inferred or deduced from a premise. (American Heritage Dictionary)

Validity

Validity

Validity n Validity: an evaluation of the adequacy and appropriateness of the interpretations and

Validity n Validity: an evaluation of the adequacy and appropriateness of the interpretations and uses of assessment results. (Linn & Gronlund, 2000).

Validity n. Let us emphasize some important points: Validity refers to the appropriateness of

Validity n. Let us emphasize some important points: Validity refers to the appropriateness of the interpretation of the results of an assessment procedure, not to the procedure itself. n. Consider the following example:

Dilbert’s Medical Exam One day Dilbert was not feeling well. He went to the

Dilbert’s Medical Exam One day Dilbert was not feeling well. He went to the Doctor’s office for a check-up. While there he and the doctor had a discussion about who was smarter, doctors or engineers. Dilbert lost the argument when the doctor pointed out that doctors make more money; therefore they must be smarter. After this, the doctor informed Dilbert that his test results indicated he had a terminal condition and would die within a few weeks. This is what happened next.

Validity n Four Primary Categories of Validity: – Content – Construct – Criterion –

Validity n Four Primary Categories of Validity: – Content – Construct – Criterion – Consequential

Validity n Content Validity – How well does the sample of assessment tasks represent

Validity n Content Validity – How well does the sample of assessment tasks represent the domain of tasks to be measured? “Math Content Universe” Topology Numerical Analysis Multivariate Calculus Trigonometry Algebra Geometry Counting + - x ÷

Validity n Construct Validity – How well can performance on the assessment be explained

Validity n Construct Validity – How well can performance on the assessment be explained in terms of psychological characteristics? – How does one measure effort? Love? Hate? Indifference? Intelligence? Situational Awareness?

Validity n Criterion Validity – How accurately does performance on the assessment predict future

Validity n Criterion Validity – How accurately does performance on the assessment predict future performance or estimate the present performance on some other valued measure called a criterion?

Validity n Consequential Validity – How well did use of the assessment serve the

Validity n Consequential Validity – How well did use of the assessment serve the intended purpose and avoid adverse effects?

Validity n Cautions to keep in mind: – Validity is a unitary concept. In

Validity n Cautions to keep in mind: – Validity is a unitary concept. In the gestalt tradition, the “whole” of validity is greater than the sum of its various “parts. ” – Validity is always specific to some use or interpretation, and no assessment is valid for all purposes. An assessment may have high validity in some aspects, but low validity for others.

Validity n Cautions to keep in mind: – Though we often say “validity of

Validity n Cautions to keep in mind: – Though we often say “validity of the test”, it is important to recognize that what is really being validated is the interpretation and use of the test results. – Validity is a matter of degree. It exists along a continuum, the same way gray exists in the continuum between black and white. At best we can classify it as “high”, “moderate”, or “low”

Validity n Cautions to keep in mind: – Validity involves an overall evaluative judgment.

Validity n Cautions to keep in mind: – Validity involves an overall evaluative judgment. This judgment must be based on sound assessment results that include accurate measurement. Errors in the assessment process can radically affect the interpretation of the results and the resulting evaluation.

Validity n To help insure the validity of your assessments, items should be: –

Validity n To help insure the validity of your assessments, items should be: – Written based on the content defined in the objectives. – Assessed at the specified learning level. – Reviewed by competent colleagues.

Validity n The following is a list of common mistakes that will LOWER validity:

Validity n The following is a list of common mistakes that will LOWER validity: – Use too few items to assess the objective. – Provide tasks or write items that do not function as intended (e. g. ambiguous, clues that lead to answers, bias, too easy, too hard, etc. ). – Arrange items from hardest to easiest.

Validity n Mistakes that LOWER validity (cont): – Write items that do not match

Validity n Mistakes that LOWER validity (cont): – Write items that do not match the learning level specified in the objectives. – Impose unrealistic time constraints – Provide an atmosphere that is poorly suited to the purpose (e. g. noisy, messy, malfunctioning equipment). – Provide unclear or confusing directions. – Unfair or unsystematic scoring.

Validity – It is an evil for a well-taught and welltrained student to fail

Validity – It is an evil for a well-taught and welltrained student to fail in an examination. – It is an evil for an unqualified student, through some inefficiency of the test, to obtain credit in an examination. New York State Department of Education, 1906