Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study

Artikelen

Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study

Informatie

Auteurs

André-Sébastien Aubin

Christina St-Onge

Jean-Sébastien Renaud

Soort article

Original Research

Categorie

Detection rates,

Person-fit statistics,

Rater-based assessment,

Verscheen in

PME Volume 7, Issue 2

Lees het hele artikel

Introduction With the Standards voicing concern for the appropriateness of response processes, we need to explore strategies that would allow us to identify inappropriate rater response processes. Although certain statistics can be used to help detect rater bias, their use is complicated by either a lack of data about their actual power to detect rater bias or the difficulty related to their application in the context of health professions education. This exploratory study aimed to establish the worthiness of pursuing the use of l _zto detect rater bias.

Methods We conducted a Monte Carlo simulation study to investigate the power of a specific detection statistic, that is: the standardized likelihood l _zperson-fit statistics (PFS). Our primary outcome was the detection rate of biased raters, namely: raters whom we manipulated into being either stringent (giving lower scores) or lenient (giving higher scores), using the l _zstatistic while controlling for the number of biased raters in a sample (6 levels) and the rate of bias per rater (6 levels).

Results Overall, stringent raters (M = 0.84, SD = 0.23) were easier to detect than lenient raters (M = 0.31, SD = 0.28). More biased raters were easier to detect then less biased raters (60% bias: 62, SD = 0.37; 10% bias: 43, SD = 0.36).

Lees het hele artikel