Item calibration in incomplete testing designs

نویسندگان

  • Theo J.H.M. Eggen
  • Norman D. Verhelst
چکیده

This study discusses the justifiability of item parameter estimation in incomplete testing designs in item response theory. Marginal maximum likelihood (MML) as well as conditional maximum likelihood (CML) procedures are considered in three commonly used incomplete designs: random incomplete, multistage testing and targeted testing designs. Mislevy and Sheenan (1989) have shown that in incomplete designs the justifiability of MML can be deduced from Rubin's (1976) general theory on inference in the presence of missing data. Their results are recapitulated and extended for more situations. In this study it is shown that for CML estimation the justification must be established in an alternative way, by considering the neglected part of the complete likelihood. The problems with incomplete designs are not generally recognized in practical situations. This is due to the stochastic nature of the incomplete designs which is not taken into account in standard computer algorithms. For that reason, incorrect uses of standard MMLand CML-algorithms are discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Optimal Designs to Item Calibration

In computerized adaptive testing (CAT), examinees are presented with various sets of items chosen from a precalibrated item pool. Consequently, the attrition speed of the items is extremely fast, and replenishing the item pool is essential. Therefore, item calibration has become a crucial concern in maintaining item banks. In this study, a two-parameter logistic model is used. We applied optima...

متن کامل

Reducing the length of mental health instruments through structurally incomplete designs.

This paper presents structurally incomplete designs as an approach to reduce the length of mental health tests. In structurally incomplete test designs, respondents only fill out a subset of the total item set. The scores on the unadministered items are estimated using methods for missing data. As an illustration, structurally incomplete test designs recording, respectively, two thirds, one hal...

متن کامل

An Automatic Online Calibration Design in Adaptive Testing

An accurately calibrated item bank is essential for a valid computerized adaptive test. However, in some settings, such as occupational testing, there is limited access to examinees for calibration. As a result of the limited access to possible examinees, collecting data to accurately calibrate an item bank in an occupational setting is usually difficult. In such a setting, the item bank can be...

متن کامل

Optimal Design for Count Data with Binary Predictors in Item Response Theory

The Rasch Poisson counts model (RPCM) allows for the analysis of mental speed which represents a basic component of human intelligence. An extended version of the RPCM, which incorporates covariates in order to explain the difficulty, provides a means for modern rule-based item generation. After a short introduction into the extended RPCM we will develop locally D-optimal calibration designs fo...

متن کامل

On the complementarity of classical test theory and item response models: item difficulty estimates and computerized adaptive testing

This study aims to provide statistical evidence of the complementarity between classical test theory and item response models for certain educational assessment purposes. Such complementarity might support, at a reduced cost, future development of innovative procedures for item calibration in adaptive testing. Classical test theory and the generalized partial credit model are applied to tests c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010