Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Improving human and machine classification through cognitive-inspired data engineering

View through CrossRef
Crowdsourcing offers a fast and cost-efficient approach to obtaining human labeled datasets. However, crowdsourced datasets and the models trained on them can inherit the cognitive constraints and biases of their annotators. In a process we refer to as cognitive-inspired data engineering, we investigate whether ideas from cognitive science can be applied to mitigate the presence of cognitive constraints and cognitive biases in crowdsourced datasets and, as a result, improve the performance of models trained on these datasets. We evaluate our approach by crowdsourcing labels for medical image diagnostic tasks using two different crowdsourcing platforms across two experiments. In Experiment 1, we collect subjective probability judgments from novice annotators through Amazon Mechanical Turk and, in Experiment 2, we collect subjective probability judgments and binary classifications from skilled annotators though DiagnosUs, a crowdsourcing platform specializing in medical and scientific data annotation. In both experiments, we find that de-biasing subjective probability judgments via recalibration leads to more accurate crowdsourced datasets and more accurate models trained on these datasets. Our results suggest that cognitive-inspired data engineering offers a promising avenue to improve the quality of crowdsourced datasets.
Title: Improving human and machine classification through cognitive-inspired data engineering
Description:
Crowdsourcing offers a fast and cost-efficient approach to obtaining human labeled datasets.
However, crowdsourced datasets and the models trained on them can inherit the cognitive constraints and biases of their annotators.
In a process we refer to as cognitive-inspired data engineering, we investigate whether ideas from cognitive science can be applied to mitigate the presence of cognitive constraints and cognitive biases in crowdsourced datasets and, as a result, improve the performance of models trained on these datasets.
We evaluate our approach by crowdsourcing labels for medical image diagnostic tasks using two different crowdsourcing platforms across two experiments.
In Experiment 1, we collect subjective probability judgments from novice annotators through Amazon Mechanical Turk and, in Experiment 2, we collect subjective probability judgments and binary classifications from skilled annotators though DiagnosUs, a crowdsourcing platform specializing in medical and scientific data annotation.
In both experiments, we find that de-biasing subjective probability judgments via recalibration leads to more accurate crowdsourced datasets and more accurate models trained on these datasets.
Our results suggest that cognitive-inspired data engineering offers a promising avenue to improve the quality of crowdsourced datasets.

Related Results

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Midlife Marital Status and Subsequent Cognitive Decline over 20 Years: Discovery from ARIC
Midlife Marital Status and Subsequent Cognitive Decline over 20 Years: Discovery from ARIC
Background — Recent studies show that marriage is associated with a protective effect against cognitive decline among older adults. However, definite evidence from large prospectiv...
Cognitive Science Approaches in Biblical Studies
Cognitive Science Approaches in Biblical Studies
Since the mid-2000s, cognitive science approaches have been used in biblical studies. Cognitive science came into existence in the 1950s as a reaction to the psychological behavior...
Improving Medical Document Classification via Feature Engineering
Improving Medical Document Classification via Feature Engineering
<p dir="ltr">Document classification (DC) is the task of assigning the predefined labels to unseen documents by utilizing the model trained on the available labeled documents...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
Impact of Tinnitus on Quality of Life and Cognitive Function in Adults: A Systematic Review
Impact of Tinnitus on Quality of Life and Cognitive Function in Adults: A Systematic Review
Background: Tinnitus is often associated with cognitive difficulties, especially in attention and executive functioning. However, it remains unclear how much tinnitus itself contri...
Evolutionary Cognitive Archaeology
Evolutionary Cognitive Archaeology
Cognitive archaeology may be divided into two branches. Evolutionary cognitive archaeology (ECA) is the discipline of prehistoric archaeology that studies the evolution of human co...
Engineering Cementitious Composite with Nature-Inspired Architected Polymeric Reinforcing Elements Using Additive Manufacturing Method
Engineering Cementitious Composite with Nature-Inspired Architected Polymeric Reinforcing Elements Using Additive Manufacturing Method
Concrete, known for its excellent compression strength, faces challenges in tensile strength, requiring additional steel or polymers reinforcements. Incorporating nature-inspired p...

Back to Top