Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests

DEMİR AYÇİÇEK, SEDA; Atar, Burcu

doi:10.21031/epod.787865

Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests

Yazarlar (2)

Doç. Dr. Seda DEMİR AYÇİÇEK Tokat Gaziosmanpaşa Üniversitesi, Türkiye

Burcu Atar
Hacettepe Üniversitesi, Türkiye

Makale Türü	Özgün Makale
Makale Alt Türü	Ulusal alan endekslerinde (TR Dizin, ULAKBİM) yayınlanan tam makale
Dergi Adı	Journal of Measurement and Evaluation in Education and Psychology
Dergi ISSN	1309-6575 Wos Dergi Scopus Dergi
Dergi Tarandığı Indeksler	TR DİZİN
Makale Dili	İngilizce
Basım Tarihi	03-2021
Cilt No	12
Sayı	1
Sayfalar	15 / 27
DOI Numarası	10.21031/epod.787865
Makale Linki	https://dergipark.org.tr/tr/download/article-file/1264328

Özet

This study aims to compare Sequential Probability Ratio Test (SPRT) and Confidence Interval (CI) classification criteria, Maximum Fisher Information method on the basis of estimated-ability (MFI-EB) and Cut-Point (MFI-CB) item selection methods while ability estimation method is Weighted Likelihood Estimation (WLE) in Computerized Adaptive Classification Testing (CACT), according to the Average Classification Accuracy (ACA), Average Test Length (ATL), and measurement precision under content balancing (Constrained Computerized Adaptive Testing: CCAT and Modified Multinomial Model: MMM) and item exposure control (Sympson-Hetter Method: SH and Item Eligibility Method: IE) when the classification is done based on two, three, or four categories for a unidimensional pool of dichotomous items. Forty-eight conditions are created in Monte Carlo (MC) simulation for the data, generated in R software, including 500 items and 5000 examinees, and the results are calculated over 30 replications. As a result of the study, it was observed that CI performs better in terms of ATL, and SPRT performs better in ACA and correlation, bias, Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) values, sequentially; MFI-EB is more useful than MFI-CB. It was also seen that MMM is more successful in content balancing, whereas CCAT is better in terms of test efficiency (ATL and ACA), and IE is superior in terms of item exposure control though SH is more beneficial in test efficiency. Besides, increasing the number of classification categories increases ATL but decreases ACA, and it gives better results in terms of the correlation, bias, RMSE, and MAE values.

Anahtar Kelimeler

Classification criteria | Computerized adaptive classification testing | Content balancing | Item exposure control | Item selection methods

BM Sürdürülebilir Kalkınma Amaçları

Atıf Sayıları
WoS	1
SCOPUS	1
Google Scholar	3

Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests

Dergi Adı	Journal of Measurement and Evaluation in Education and Psychology-EPOD
Yayıncı	Association of Measurement and Evaluation in Education and Psychology (EPODDER)
Açık Erişim	Hayır
ISSN	1309-6575
E-ISSN	1309-6575
CiteScore	0,9
SJR	0,178
SNIP	0,352

Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests

Paylaş