Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Investigating level descriptors and their impact on validity

View through CrossRef
This article presents a validation study which investigates the relationship between mastery level descriptors and item difficulty in the National Tests of English (NTE) in Norway. The aim is to establish the extent to which the descriptors indicate item difficulty and thus support the argument that the mastery levels are a reflection of the framework of the NTE. This argument strength has direct implications for validity and for the defensibility of moving from criterion-based descriptors to norm-based national data. The study involved a panel of 10 raters assigning level descriptors from 7 content categories to 80 test items, giving a total of 5,600 individual judgements. These judgements are compared to the real test scores of around 46,000 pupils to establish if the level descriptors assigned by raters can predict pupil performance on the tests. The results show a strong correlation between rater judgements and real test scores, meaning that the descriptors offer an indication of item difficulty. However, some individual descriptor categories contain deviations from the expected order. We conclude that, while the level descriptors reflect the test’s framework, and thus support the validity argument, the argument could be strengthened with the revision of some individual descriptors.
The Association for Language Testing and Assessment of Australia and New Zealand
Title: Investigating level descriptors and their impact on validity
Description:
This article presents a validation study which investigates the relationship between mastery level descriptors and item difficulty in the National Tests of English (NTE) in Norway.
The aim is to establish the extent to which the descriptors indicate item difficulty and thus support the argument that the mastery levels are a reflection of the framework of the NTE.
This argument strength has direct implications for validity and for the defensibility of moving from criterion-based descriptors to norm-based national data.
The study involved a panel of 10 raters assigning level descriptors from 7 content categories to 80 test items, giving a total of 5,600 individual judgements.
These judgements are compared to the real test scores of around 46,000 pupils to establish if the level descriptors assigned by raters can predict pupil performance on the tests.
The results show a strong correlation between rater judgements and real test scores, meaning that the descriptors offer an indication of item difficulty.
However, some individual descriptor categories contain deviations from the expected order.
We conclude that, while the level descriptors reflect the test’s framework, and thus support the validity argument, the argument could be strengthened with the revision of some individual descriptors.

Related Results

Chemical bond overlap descriptors from multiconfiguration wavefunctions
Chemical bond overlap descriptors from multiconfiguration wavefunctions
Chemical bonds are fundamental in chemistry, serving as the foundation for understanding molecular properties. Over time, various theories and descriptors have evolved to character...
A Sampling of the Phenetic Diversity of Cacao in the International Cocoa Gene Bank of Trinidad
A Sampling of the Phenetic Diversity of Cacao in the International Cocoa Gene Bank of Trinidad
The International Cocoa Gene bank, Trinidad, is an international cacao (Theobroma cacao L.) germplasm depository that conserves nearly 2500 accessions in its field collection. A po...
Exploring the Utilization of Gradient Information in SIFT Based Local Image Descriptors
Exploring the Utilization of Gradient Information in SIFT Based Local Image Descriptors
The utilization of gradient information is a key issue in building Scale Invariant Feature Transform (SIFT)-like descriptors. In the literature, two types of gradient information, ...
A Spatial-Spectral Feature Descriptor for Hyperspectral Image Matching
A Spatial-Spectral Feature Descriptor for Hyperspectral Image Matching
Hyperspectral Images (HSIs) have been utilized in many fields which contain spatial and spectral features of objects simultaneously. Hyperspectral image matching is a fundamental a...
Test Validity and Reliability in Learning Evaluation
Test Validity and Reliability in Learning Evaluation
The evaluation aimed to get achievement in assessing an educational target by students. In terms of assessment, a quality assessment tool is needed to fulfil two things: validity a...
Distitive Accuracy Measurement of Binary Descriptor v2
Distitive Accuracy Measurement of Binary Descriptor v2
Mobile Augmented Reality (MAR) requires a descriptor that is robust to changes in viewing conditions in real time application. Many different descriptors had been proposed in the l...

Back to Top