Monday, January 31, 2011
Currently reading : Questionnaire Design
Second edition
Ian Brace
U 658.83 B796Q 2008
Questionnaire Design: introduction & Chapter 1
Sunday, January 30, 2011
Topic 6 : Coding & Cleaning Survey Data
Saturday, January 29, 2011
Modern Measurement: Modern Scaling with Item Response Theory : Chapter 10
Introductory Description of IRT
- What is IRT
o Psychologically based theory of mental measurement that specifies information about latent traits and the characteristics of stimuli used to represent them.
o IRT statistics are not non-parametric
- Relation of IRT to CTT
- Cautionary note on studying IRT
o IRT : theory about latencies and the way they can be estimated
IRT and Invariant Measurement for Items and Persons
- Problem of lack of an independent scale in CTT
o Difficulty in comparing "low esteem" in one test vs another test.
o No common zero point
o Converting scores to z scores does not solve problem – then you only have scores expressed in same metric
- Group Dependent Items and Item Dependent Groups
o Left with relative comparisons
o Eg, test on history – what measurement best represents difficulty of test à depends on group who takes test à eg, primary school vs college
o Difficulty value is group dependent
o Reference group
o Measuring examinee ability is item dependent
- IRT as item and person invariant measurement
- Notion of invariant measurement
o Invariance is an estimable concept [GW – not sure I understood or agreed with this section]
Introduction to IRT Models
- Some commonly used IRT Models
- Models are usually identified by number of characteristics they estimate about a test's stimuli
o One – parameter
o Two / more
- Most popular models
o One-parameter
§ Only item difficulty is estimated
o Two parameter
§ Estimates separate difficulty and discrimination parameters for each item
o Three parameter
§ Includes examinee's probability of guessing or pseudochance
Assumptions
- Centrality of assumptions to IRT
- Unidimensionalty of items and tests
o Given test item or exercise is directly targeted at single cognitive process, and in theory it fills that latent space completely
- Local independence
o Examinees response to a given specific measure reflects an independent and autonomous reference to a latent trait in cognition.
o Examinee responds to stimulus of test item or stimulus, also approaches the stimulus without also thinking about other items or exercises.
o Degree of learning as more items are encountered
- Item characteristic curve
o Defined characteristics of test stimuli are reliably estimable functions
- Certainty of response
o Optimal performance
ICC and IRC
- Specifying ICCs generally
- Inflection point
- Scales allow a trace line to describe functional relationship between characteristics of an item and the trait level of examinee
IRT Models
- Likelihood function
o Examinee of particular ability level has certain probability of getting an item correct
o .likelihood function is joint probability of getting several items correct or incorrect
o More on working with the log scale
o The two parameter model
The one-parameter IRT model and Rasch
- The Rasch Model
Other IRT Models
- Nominal and graded response models for polytomous items
- Richly cognitive models
Estimating Item and Ability Parameters
- Iterative estimation procedures
- Developing priors
- Test information function
- Some estimation procedures
Computer Programs Available for parameter estimation
Brief History and major Contributors to IRT
Friday, January 28, 2011
Modern Measurement: Chapter 9 : Performance Related Measures
A Psychometric Perspective on Performance Assessment
- Terminology of Performance Assessment
- Defining Performance Assessment
- Examples of PA
Characteristics of Performance Assessments
- Three common characteristics
o Require examinees to respond to stimulus in a manner other than selecting one choice from presented alternatives
o Intended to gauge sophisticated thinking skills
o To score them, someone must inspect response and apply evaluative judgment
Organizing Performance Assessments
- Two rough categories of performance assessment
o Task centered
o Construct centered
- A PA taxonomy
o Type of reasoning competency employed
o Nature of cognitive continuum employed
o Kind of response yielded.
Special Types of Performance Assessment
- Authentic assessment as performance assessment
- Rather than use proxy format
- Complex performance assessments
Validity Issues
- Importance and difficulty in amassing validity evidence for Pas
- PA Validity evaluation as interpretative argument
- Construct evidence for Pas
Essays as Performance Assessments
GW Comment / Question
- Is there a survey equivalent to Performance Assessment – focus group ?
Thursday, January 27, 2011
Modern Measurement – Chapter 8 : Constructing Items and Exercises for Tests
Wednesday, January 26, 2011
Topic 4 : Developing a Questionnaire
General Samples and Population Means: Chapter 10: Statistics : A Bayesian Perspective
n | 100 | ||||
x | 30 | ||||
std dev | 10 | ||||
e | 2.718282 | ||||
Model | z-score | Likelihood | P (Model) | Product | P (Model | D) |
30.00 | 0.00 | 1.00000000 | 0.000999 | 0.00099900 | 0.03989423 |
30.10 | -0.10 | 0.99501248 | 0.000999 | 0.00099402 | 0.03969525 |
30.20 | -0.20 | 0.98019867 | 0.000999 | 0.00097922 | 0.03910427 |
30.30 | -0.30 | 0.95599748 | 0.000999 | 0.00095504 | 0.03813878 |
30.40 | -0.40 | 0.92311635 | 0.000999 | 0.00092219 | 0.03682701 |
30.50 | -0.50 | 0.88249690 | 0.000999 | 0.00088162 | 0.03520653 |
30.60 | -0.60 | 0.83527021 | 0.000999 | 0.00083444 | 0.03332246 |
30.70 | -0.70 | 0.78270454 | 0.000999 | 0.00078192 | 0.03122539 |
30.80 | -0.80 | 0.72614904 | 0.000999 | 0.00072542 | 0.02896916 |
30.90 | -0.90 | 0.66697681 | 0.000999 | 0.00066631 | 0.02660852 |
31.00 | -1.00 | 0.60653066 | 0.000999 | 0.00060592 | 0.02419707 |