Javascript must be enabled to continue!
Rubric development for AI-enabled scoring of three-dimensional constructed-response assessment aligned to NGSS learning progression
View through CrossRef
IntroductionThe Framework for K-12 Science Education (the Framework) and the Next- Generation Science Standards (NGSS) define three dimensions of science: disciplinary core ideas, scientific and engineering practices, and crosscutting concepts and emphasize the integration of the three dimensions (3D) to reflect deep science understanding. The Framework also emphasizes the importance of using learning progressions (LPs) as roadmaps to guide assessment development. These assessments capable of measuring the integration of NGSS dimensions should probe the ability to explain phenomena and solve problems. This calls for the development of constructed response (CR) or open-ended assessments despite being expensive to score. Artificial intelligence (AI) technology such as machine learning (ML)-based approaches have been utilized to score and provide feedback on open-ended NGSS assessments aligned to LPs. ML approaches can use classifications resulting from holistic and analytic coding schemes for scoring short CR assessments. Analytic rubrics have been shown to be easier to evaluate for the validity of ML-based scores with respect to LP levels. However, a possible drawback of using analytic rubrics for NGSS-aligned CR assessments is the potential for oversimplification of integrated ideas. Here we describe how to deconstruct a 3D holistic rubric for CR assessments probing the levels of an NGSS-aligned LP for high school physical sciences.MethodsWe deconstruct this rubric into seven analytic categories to preserve the 3D nature of the rubric and its result scores and provide subsequent combinations of categories to LP levels.ResultsThe resulting analytic rubric had excellent human- human inter-rater reliability across seven categories (Cohen’s kappa range 0.82–0.97). We found overall scores of responses using the combination of analytic rubric very closely agreed with scores assigned using a holistic rubric (99% agreement), suggesting the 3D natures of the rubric and scores were maintained. We found differing levels of agreement between ML models using analytic rubric scores and human-assigned scores. ML models for categories with a low number of positive cases displayed the lowest level of agreement.DiscussionWe discuss these differences in bin performance and discuss the implications and further applications for this rubric deconstruction approach.
Title: Rubric development for AI-enabled scoring of three-dimensional constructed-response assessment aligned to NGSS learning progression
Description:
IntroductionThe Framework for K-12 Science Education (the Framework) and the Next- Generation Science Standards (NGSS) define three dimensions of science: disciplinary core ideas, scientific and engineering practices, and crosscutting concepts and emphasize the integration of the three dimensions (3D) to reflect deep science understanding.
The Framework also emphasizes the importance of using learning progressions (LPs) as roadmaps to guide assessment development.
These assessments capable of measuring the integration of NGSS dimensions should probe the ability to explain phenomena and solve problems.
This calls for the development of constructed response (CR) or open-ended assessments despite being expensive to score.
Artificial intelligence (AI) technology such as machine learning (ML)-based approaches have been utilized to score and provide feedback on open-ended NGSS assessments aligned to LPs.
ML approaches can use classifications resulting from holistic and analytic coding schemes for scoring short CR assessments.
Analytic rubrics have been shown to be easier to evaluate for the validity of ML-based scores with respect to LP levels.
However, a possible drawback of using analytic rubrics for NGSS-aligned CR assessments is the potential for oversimplification of integrated ideas.
Here we describe how to deconstruct a 3D holistic rubric for CR assessments probing the levels of an NGSS-aligned LP for high school physical sciences.
MethodsWe deconstruct this rubric into seven analytic categories to preserve the 3D nature of the rubric and its result scores and provide subsequent combinations of categories to LP levels.
ResultsThe resulting analytic rubric had excellent human- human inter-rater reliability across seven categories (Cohen’s kappa range 0.
82–0.
97).
We found overall scores of responses using the combination of analytic rubric very closely agreed with scores assigned using a holistic rubric (99% agreement), suggesting the 3D natures of the rubric and scores were maintained.
We found differing levels of agreement between ML models using analytic rubric scores and human-assigned scores.
ML models for categories with a low number of positive cases displayed the lowest level of agreement.
DiscussionWe discuss these differences in bin performance and discuss the implications and further applications for this rubric deconstruction approach.
Related Results
Objective Assessment in Java Programming Language Using Rubrics
Objective Assessment in Java Programming Language Using Rubrics
Aim/Purpose: This paper focuses on designing and implementing the rubric for objective JAVA programming assessments. An unsupervised learning approach was used to group learners ba...
The Effect of Rubric Delivery Method and Additional Written Comments on Future Lab Report Performance
The Effect of Rubric Delivery Method and Additional Written Comments on Future Lab Report Performance
It is not uncommon to hear instructors lament the number of hours spent providing feedback on student writing, only to find their students making the very same mistake on the follo...
Microwave Ablation with or Without Chemotherapy in Management of Non-Small Cell Lung Cancer: A Systematic Review
Microwave Ablation with or Without Chemotherapy in Management of Non-Small Cell Lung Cancer: A Systematic Review
Abstract
Introduction
Microwave ablation (MWA) has emerged as a minimally invasive treatment for patients with inoperable non-small cell lung cancer (NSCLC). However, whether it i...
Aviation English - A global perspective: analysis, teaching, assessment
Aviation English - A global perspective: analysis, teaching, assessment
This e-book brings together 13 chapters written by aviation English researchers and practitioners settled in six different countries, representing institutions and universities fro...
Comparing Effectiveness Between Rubric and Traditional Methods to Assess Clinical Practice among Vietnamese Nursing Students: A Quasi-Experimental Study
Comparing Effectiveness Between Rubric and Traditional Methods to Assess Clinical Practice among Vietnamese Nursing Students: A Quasi-Experimental Study
Assessing student competency in clinical practice poses a significant challenge for nursing educators. Rubrics are assessment tools to mitigate subjective biases and lay out set st...
Clinical Skills at the Undergraduate Level: What are we trying to assess?
Clinical Skills at the Undergraduate Level: What are we trying to assess?
The attainment of clinical skills is essential for the development and achievement of competence as a clinician. Transparency about what is being assessed and how and what should b...
Assessment of Medical Student Achievement of Competency‐based Objectives through Clinical Case Presentations
Assessment of Medical Student Achievement of Competency‐based Objectives through Clinical Case Presentations
IntroductionThe Medical College of Georgia expects medical students to demonstrate competence in six domains (Medical Knowledge, Patient Care, Practice‐based Learning, Communicatio...
The effect of the periodic investigation model on attentive control and learning the scoring skill from persistence and peaceful scoring in female basketball
The effect of the periodic investigation model on attentive control and learning the scoring skill from persistence and peaceful scoring in female basketball
The study aimed to identify the degree of attention control among students of the second stage in the college of university knowledge / Department of Physical Education and Sports ...

