期刊文献+

改进的认知诊断模型项目功能差异检验方法——基于观察信息矩阵的Wald统计量 被引量:14

An improved method for differential item functioning detection in cognitive diagnosis models: An application of Wald statistic based on observed information matrix
在线阅读 下载PDF
导出
摘要 Hou,de la Torre和Nandakumar(2014)提出可以使用Wald统计量检验DIF,但其结果的一类错误率存在过度膨胀的问题。本研究中提出了一个使用观察信息矩阵进行计算的改进后的Wald统计量。结果表明:(1)使用观察信息矩阵计算的这一改进后的Wald统计量在DIF检验中具有良好的一类错误控制率,尤其是在项目具有较高区分能力的时候,解决了以往研究中一类错误率过度膨胀的问题。(2)随着样本量的增加以及DIF量的增大,使用观察信息矩阵计算Wald统计量的统计检验力也在增加。 In cognitive diagnostic models(CDMs), differential item functioning(DIF) refers to the probabilities of success of an item being different for examinees with the same attribute mastery pattern in the groups. The detection of DIF is an important step to ensure the fairness and validity of results from CDMs for all groups. Hou et al.(2014) proposed that the Wald statistic can be used to detect DIF in CDMs. Unfortunately, their results revealed that the Wald statistic based on the information matrix estimation method developed by de la Torre(2009, 2011) yielded inflated Type I error rates. However, Li and Wang(2015) found that the Type I error rates of the Wald statistic in which MCMC algorithms were implemented were slightly inflated in their study under the same conditions. In this study, we proposed an improved Wald statistic based on the observed information matrix for DIF assessment. As a general demonstration, we took the log-linear cognitive diagnosis model(LCDM; Henson et al., 2009) as an example. In this simulation study, in order to compare the results with previous studies(e.g., Hou et al.,2014; Li Wang, 2015), we followed the simulation design used by Hou et al.(2014), except that we implemented the observed or cross-product(XPD) information matrix in the Wald statistic computation. Parameters set in the studies were: the test length at 30, the number of attributes at 5, and the maximum number of required attributes for an item at 3. Binary item response data were generated from the DINA model. Three sets of true item parameter values were considered( g j ?s j?.1,.2, or.3) for the reference group. Two DIF sizes:.05 and.10, and two types of DIF: uniform and nonuniform, were manipulated. Two sample sizes were considered, 500 and 1,000. Each condition was replicated 1000 times, and the estimation code was written in R(R Core Team, 2015). The simulation results showed that:(1) for the relatively discriminating items, Wald statistic had accurate Type I error control when the observed information matrix was used in its computation. However, when the slip and guessing parameters were large( s j ?g j? 0.3), the Type I error control was slightly conservative.(2) When the XPD information matrix was used for the computation of the Wald statistic, the Type I error control was conservative; that is, the performance of the observed information matrix was better than the XPD information matrix.(3) The number of attributes required for success on the item did not have a notable impact on the Type I error control of Wald statistic, irrespective of whether the observed or the XPD information matrix was used for the statistic.(4) The power rates of Wald statistic for detecting DIF increased as the sample size increased. We conclude that our improved Wald statistic provided follows asymptotically a chi-square distribution with degrees of freedom equal to 2, for DINA model. The improved Wald statistic is a useful and powerful tool for DIF detection in CDMs.
出处 《心理学报》 CSSCI CSCD 北大核心 2016年第5期588-598,共11页 Acta Psychologica Sinica
基金 国家自然科学基金面上项目(31371047) 中央高校基本科研业务费专项资金资助(SKZZX2013028)
关键词 Wald统计量 项目功能差异 认知诊断模型 观察信息矩阵 经验交叉相乘信息矩阵 Wald statistic differential item functioning cognitive diagnosis model observed information matrix cross-product information matrix
  • 相关文献

参考文献2

二级参考文献23

  • 1陈平,辛涛.(2011).认知诊断计算杌化自适应测验中的项目增补-以DINA模型为例.博士论文.北京师范大学.
  • 2Brennan, R. L. (2006). Education measurement (4 ed. ). West- port, CT: American Council on Education and Pmeger Pub- lishers.
  • 3de la Torre, J. (2009). DINA model and parameter estimation :A didactic. Journal of Educational and Behavioral Statistics, 34 (1),115-130.
  • 4de la Torte, J. , & Douglas, J. A. (2004). Higher - order latent trait models for cognitive diagnosis. Psychometrika, 69 ( 3 ), 333 - 353.
  • 5de la Torre, J. (2011). The Generalized DINA Model Frame- work. Psychometaka, 76( 2 ), 179 - 199.
  • 6Flier,H. ,Mellenbergh,G. J. ,Ader,H. J. ,& Wijn,M. (1984). An iterative item bias detection method. Journal of Education- al Measurement,21 (2) ,131 - 145.
  • 7Haertel,E. H. (1989). Using restricted latent class models to map the skill structure of achievement items. Journal of Edu- cational Measurement ,26(4 ) ,301 - 321.
  • 8Holland,P. W. , & Thayer, D. T. (1988). Differential item per- formance and the Mantel -Haenszel procedure. Test Validity, 129 -145.
  • 9Hou, L. (2013). Differential item functioning assessment in cog- nitive diagnostic modeling:Applying the wald test to investi-gate DIF in the Generalized DINA Model Framework../ournal of Educational Measurement,51 ( 1 ) ,98 - 125.
  • 10Hou, L. , de la Torte, J. , & Nandakumar, R. ( 2014). Differential item functioning assessment in cognitive diagnostic modeling: Application of the wald test to investigate DIF in the DINA Model. Journal of Educational Measurement, 51 ( 1 ), 98 - 125.

共引文献8

同被引文献46

引证文献14

二级引证文献38

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部