摘要
采用锚测验非等组设计的数据收集方案,对4种基于经典测量理论的等值方法进行了比较研究。研究数据取自TIMSS1999数据库,兼用等值标准误和交叉验证方法作为各等值方法比较的检验标准,利用CIPE程序对实验数据进行分析。研究结果表明,针对本研究所设置的等值情境,线性等值优于等百分位等值,其中Tucker线性方法比Levine观察分数线性方法更好一些,Braun-Holland线性方法不宜采用,频数估计等百分位方法等值误差较大,亦不足取。
Four equating methods in common-item non-equivalent group design based on the classical test theory were compared. "The research data was selected from the TIMSS1999 database. Both the equating Standard Error and a crossvalidation analysis were used as the evaluation criteria. And the CIPE program was used to compute the data. We came to a conclusion that, in our research situation, the linear equating procedures were superior to the equipercentile method. Of the linear equating methods, the Tucker linear method was better than the Levine Observed Score equating method, and the Braun-Holland linear method was not appropriate. The Frequency Estimation equipercentile method was with large equating error, and should not be used.
出处
《心理发展与教育》
CSSCI
北大核心
2006年第1期97-102,共6页
Psychological Development and Education
基金
国家自然科学基金项目(30440081)
北京市重点实验室建设规划项目(jd100270541)资助
关键词
测验等值
经典测量理论
锚测验设计
test equating
classical test theory
common-item design