%0 Journal Article %J Journal of Computerized Adaptive Testing %D 2023 %T An Extended Taxonomy of Variants of Computerized Adaptive Testing %A Roy Levy %A John T. Behrens %A Robert J. Mislevy %K Adaptive Testing %K evidence-centered design %K Item Response Theory %K knowledge-based model construction %K missingness %B Journal of Computerized Adaptive Testing %V 10 %G English %N 1 %R 10.7333/2302-100101 %0 Conference Paper %B IACAT 2017 Conference %D 2017 %T From Blueprints to Systems: An Integrated Approach to Adaptive Testing %A Gage Kingsbury %A Tony Zara %K CAT %K integrated approach %K Keynote %X

For years, test blueprints have told test developers how many items and what types of items will be included in a test. Adaptive testing adopted this approach from paper testing, and it is reasonably useful. Unfortunately, 'how many items and what types of items' are not all the elements one should consider when choosing items for an adaptive test. To fill in gaps, practitioners have developed tools to allow an adaptive test to behave appropriately (i.e. examining exposure control, content balancing, item drift procedures, etc.). Each of these tools involves the use of a separate process external to the primary item selection process.

The use of these subsidiary processes makes item selection less optimal and makes it difficult to prioritize aspects of selection. This discussion describes systems-based adaptive testing. This approach uses metadata concerning items, test takers and test elements to select items. These elements are weighted by the stakeholders to shape an expanded blueprint designed for adaptive testing. 

Session Video

%B IACAT 2017 Conference %I Niigata Seiryo University %C Niigata, Japan %8 08/2017 %G eng %U https://drive.google.com/open?id=1CBaAfH4ES7XivmvrMjPeKyFCsFZOpQMJ %0 Journal Article %J Journal of Computerized Adaptive Testing %D 2017 %T Latent-Class-Based Item Selection for Computerized Adaptive Progress Tests %A van Buuren, Nikky %A Eggen, Theo J. H. M. %K computerized adaptive progress test %K item selection method %K Kullback-Leibler information %K Latent class analysis %K log-odds scoring %B Journal of Computerized Adaptive Testing %V 5 %P 22-43 %U http://iacat.org/jcat/index.php/jcat/article/view/62/29 %N 2 %R 10.7333/1704-0502022 %0 Journal Article %J Journal of Computerized Adaptive Testing %D 2014 %T Detecting Item Preknowledge in Computerized Adaptive Testing Using Information Theory and Combinatorial Optimization %A Belov, D. I. %K combinatorial optimization %K hypothesis testing %K item preknowledge %K Kullback-Leibler divergence %K simulated annealing. %K test security %B Journal of Computerized Adaptive Testing %V 2 %P 37-58 %G English %U http://www.iacat.org/jcat/index.php/jcat/article/view/36/18 %N 3 %R 10.7333/1410-0203037 %0 Journal Article %J Applied Psychological Measurement %D 2010 %T Item Selection and Hypothesis Testing for the Adaptive Measurement of Change %A Finkelman, M. D. %A Weiss, D. J. %A Kim-Kang, G. %K change %K computerized adaptive testing %K individual change %K Kullback–Leibler information %K likelihood ratio %K measuring change %X

Assessing individual change is an important topic in both psychological and educational measurement. An adaptive measurement of change (AMC) method had previously been shown to exhibit greater efficiency in detecting change than conventional nonadaptive methods. However, little work had been done to compare different procedures within the AMC framework. This study introduced a new item selection criterion and two new test statistics for detecting change with AMC that were specifically designed for the paradigm of hypothesis testing. In two simulation sets, the new methods for detecting significant change improved on existing procedures by demonstrating better adherence to Type I error rates and substantially better power for detecting relatively small change. 

%B Applied Psychological Measurement %V 34 %P 238-254 %G eng %N 4 %R 10.1177/0146621609344844 %0 Journal Article %J Journal of Pediatric Orthopedics %D 2008 %T Measuring physical functioning in children with spinal impairments with computerized adaptive testing %A Mulcahey, M. J. %A Haley, S. M. %A Duffy, T. %A Pengsheng, N. %A Betz, R. R. %K *Disability Evaluation %K Adolescent %K Child %K Child, Preschool %K Computer Simulation %K Cross-Sectional Studies %K Disabled Children/*rehabilitation %K Female %K Humans %K Infant %K Kyphosis/*diagnosis/rehabilitation %K Male %K Prospective Studies %K Reproducibility of Results %K Scoliosis/*diagnosis/rehabilitation %X BACKGROUND: The purpose of this study was to assess the utility of measuring current physical functioning status of children with scoliosis and kyphosis by applying computerized adaptive testing (CAT) methods. Computerized adaptive testing uses a computer interface to administer the most optimal items based on previous responses, reducing the number of items needed to obtain a scoring estimate. METHODS: This was a prospective study of 77 subjects (0.6-19.8 years) who were seen by a spine surgeon during a routine clinic visit for progress spine deformity. Using a multidimensional version of the Pediatric Evaluation of Disability Inventory CAT program (PEDI-MCAT), we evaluated content range, accuracy and efficiency, known-group validity, concurrent validity with the Pediatric Outcomes Data Collection Instrument, and test-retest reliability in a subsample (n = 16) within a 2-week interval. RESULTS: We found the PEDI-MCAT to have sufficient item coverage in both self-care and mobility content for this sample, although most patients tended to score at the higher ends of both scales. Both the accuracy of PEDI-MCAT scores as compared with a fixed format of the PEDI (r = 0.98 for both mobility and self-care) and test-retest reliability were very high [self-care: intraclass correlation (3,1) = 0.98, mobility: intraclass correlation (3,1) = 0.99]. The PEDI-MCAT took an average of 2.9 minutes for the parents to complete. The PEDI-MCAT detected expected differences between patient groups, and scores on the PEDI-MCAT correlated in expected directions with scores from the Pediatric Outcomes Data Collection Instrument domains. CONCLUSIONS: Use of the PEDI-MCAT to assess the physical functioning status, as perceived by parents of children with complex spinal impairments, seems to be feasible and achieves accurate and efficient estimates of self-care and mobility function. Additional item development will be needed at the higher functioning end of the scale to avoid ceiling effects for older children. LEVEL OF EVIDENCE: This is a level II prospective study designed to establish the utility of computer adaptive testing as an evaluation method in a busy pediatric spine practice. %B Journal of Pediatric Orthopedics %7 2008/03/26 %V 28 %P 330-5 %8 Apr-May %@ 0271-6798 (Print)0271-6798 (Linking) %G eng %M 18362799 %2 2696932 %0 Journal Article %J Journal of Clinical Epidemiology %D 2005 %T Simulated computerized adaptive tests for measuring functional status were efficient with good discriminant validity in patients with hip, knee, or foot/ankle impairments %A Hart, D. L. %A Mioduski, J. E. %A Stratford, P. W. %K *Health Status Indicators %K Activities of Daily Living %K Adolescent %K Adult %K Aged %K Aged, 80 and over %K Ankle Joint/physiopathology %K Diagnosis, Computer-Assisted/*methods %K Female %K Hip Joint/physiopathology %K Humans %K Joint Diseases/physiopathology/*rehabilitation %K Knee Joint/physiopathology %K Lower Extremity/*physiopathology %K Male %K Middle Aged %K Research Support, N.I.H., Extramural %K Research Support, U.S. Gov't, P.H.S. %K Retrospective Studies %X BACKGROUND AND OBJECTIVE: To develop computerized adaptive tests (CATs) designed to assess lower extremity functional status (FS) in people with lower extremity impairments using items from the Lower Extremity Functional Scale and compare discriminant validity of FS measures generated using all items analyzed with a rating scale Item Response Theory model (theta(IRT)) and measures generated using the simulated CATs (theta(CAT)). METHODS: Secondary analysis of retrospective intake rehabilitation data. RESULTS: Unidimensionality of items was strong, and local independence of items was adequate. Differential item functioning (DIF) affected item calibration related to body part, that is, hip, knee, or foot/ankle, but DIF did not affect item calibration for symptom acuity, gender, age, or surgical history. Therefore, patients were separated into three body part specific groups. The rating scale model fit all three data sets well. Three body part specific CATs were developed: each was 70% more efficient than using all LEFS items to estimate FS measures. theta(IRT) and theta(CAT) measures discriminated patients by symptom acuity, age, and surgical history in similar ways. theta(CAT) measures were as precise as theta(IRT) measures. CONCLUSION: Body part-specific simulated CATs were efficient and produced precise measures of FS with good discriminant validity. %B Journal of Clinical Epidemiology %V 58 %P 629-38 %G eng %M 15878477