Javascript must be enabled to continue!
Averaging Non-Probability Online Surveys to Avoid Maximal Estimation Error
View through CrossRef
Data from online non-probability samples are often analyzed as if they were based on a simple random sample drawn from the general population. As the exact sampling frame for these non-probability samples are usually unknown, there is no general method to construct unbiased estimators. This raises the question of whether estimates based on online non-probability samples are consistent across sample vendors and concerning estimates based on probability samples. To address this question, we analyze data collected from eight different online non-probability sample vendors and one online probability-based sample. We find that estimates from the different non-probability samples can be very inconsistent. We suggest averaging estimates across multiple vendor samples to avoid the risk of a maximum estimation error. We evaluate several averaging approaches, including a LASSO regression procedure which identifies a subset of vendors that, when averaged, produce estimates that are more consistent with the reference probability-based estimates, compared to any single vendor. Our results show that estimates based on different vendors’ samples display different selection biases, but there is also some commonality among some vendor-specific estimates, thus there could be strong gains in estimation precision by averaging across a selection of multiple non-probability sample vendors.
Title: Averaging Non-Probability Online Surveys to Avoid Maximal Estimation Error
Description:
Data from online non-probability samples are often analyzed as if they were based on a simple random sample drawn from the general population.
As the exact sampling frame for these non-probability samples are usually unknown, there is no general method to construct unbiased estimators.
This raises the question of whether estimates based on online non-probability samples are consistent across sample vendors and concerning estimates based on probability samples.
To address this question, we analyze data collected from eight different online non-probability sample vendors and one online probability-based sample.
We find that estimates from the different non-probability samples can be very inconsistent.
We suggest averaging estimates across multiple vendor samples to avoid the risk of a maximum estimation error.
We evaluate several averaging approaches, including a LASSO regression procedure which identifies a subset of vendors that, when averaged, produce estimates that are more consistent with the reference probability-based estimates, compared to any single vendor.
Our results show that estimates based on different vendors’ samples display different selection biases, but there is also some commonality among some vendor-specific estimates, thus there could be strong gains in estimation precision by averaging across a selection of multiple non-probability sample vendors.
Related Results
Estimation of error followed by knowledge of results in a moderate or high frequency enhances motor learning and may avoid extrinsic feedback dependence
Estimation of error followed by knowledge of results in a moderate or high frequency enhances motor learning and may avoid extrinsic feedback dependence
The present study aimed to analyze how different schedules combining error estimation and relative frequency of extrinsic feedback affects motor learning. Fifty-two undergraduate s...
NICU Medication Errors: Describing the Cause and Nature of Medication Errors in a NICU in Qatar
NICU Medication Errors: Describing the Cause and Nature of Medication Errors in a NICU in Qatar
IntroductionA medication error can be defined as “any error occurring in the medication use process” and focuses on problems with the delivery of medication to a patient [1]. Medic...
Robust Averaging Protects Decisions from Noise in Neural Computations
Robust Averaging Protects Decisions from Noise in Neural Computations
Abstract
An ideal observer will give equivalent weight to sources of information that are equally reliable. However, when averaging visual information, human observ...
Pure Maximal Submodules and Related Concepts
Pure Maximal Submodules and Related Concepts
In this work we discuss the concept of pure-maximal denoted by (Pr-maximal) submodules as a generalization to the type of R- maximal submodule, where a proper submodule of a...
Islet β-Cell Function Following 4.4-Year Insulin Injection in Chinese Elderly Patients with Type 2 Diabetes Mellitus
Islet β-Cell Function Following 4.4-Year Insulin Injection in Chinese Elderly Patients with Type 2 Diabetes Mellitus
Abstract
Background The aim of this study was to scrutinize changes of islet β-cell function in Chinese elderly patients with type 2 diabetes mellitus (T2DM) after insulin ...
Masticatory muscle activation patterns manifested by changes in index values
Masticatory muscle activation patterns manifested by changes in index values
Relevance. Surface electromyography (sEMG) is a method used to record the bioelectrical activity of masticatory muscles both at rest and during movement. This method generates rela...
Averaging pre-Lie bialgebras and the related admissible classical Yang-Baxter equations
Averaging pre-Lie bialgebras and the related admissible classical Yang-Baxter equations
In this paper, we initiate the representation theory for averaging pre-Lie algebras, and establish the intrinsic equivalence among matched pairs, Manin triples, and bialgebra struc...

