Javascript must be enabled to continue!

Averaging Non-Probability Online Surveys to Avoid Maximal Estimation Error

Data from online non-probability samples are often analyzed as if they were based on a simple random sample drawn from the general population. As the exact sampling frame for these non-probability samples are usually unknown, there is no general method to construct unbiased estimators. This raises the question of whether estimates based on online non-probability samples are consistent across sample vendors and concerning estimates based on probability samples. To address this question, we analyze data collected from eight different online non-probability sample vendors and one online probability-based sample. We find that estimates from the different non-probability samples can be very inconsistent. We suggest averaging estimates across multiple vendor samples to avoid the risk of a maximum estimation error. We evaluate several averaging approaches, including a LASSO regression procedure which identifies a subset of vendors that, when averaged, produce estimates that are more consistent with the reference probability-based estimates, compared to any single vendor. Our results show that estimates based on different vendors’ samples display different selection biases, but there is also some commonality among some vendor-specific estimates, thus there could be strong gains in estimation precision by averaging across a selection of multiple non-probability sample vendors.

SAGE Publications

Alexander Murray-Watters Stefan Zins Joseph W. Sakshaug Carina Cornesse

Journal of Official Statistics

2025

Title: Averaging Non-Probability Online Surveys to Avoid Maximal Estimation Error

Description:

Data from online non-probability samples are often analyzed as if they were based on a simple random sample drawn from the general population.

As the exact sampling frame for these non-probability samples are usually unknown, there is no general method to construct unbiased estimators.

This raises the question of whether estimates based on online non-probability samples are consistent across sample vendors and concerning estimates based on probability samples.

To address this question, we analyze data collected from eight different online non-probability sample vendors and one online probability-based sample.

We find that estimates from the different non-probability samples can be very inconsistent.

We suggest averaging estimates across multiple vendor samples to avoid the risk of a maximum estimation error.

We evaluate several averaging approaches, including a LASSO regression procedure which identifies a subset of vendors that, when averaged, produce estimates that are more consistent with the reference probability-based estimates, compared to any single vendor.

Our results show that estimates based on different vendors’ samples display different selection biases, but there is also some commonality among some vendor-specific estimates, thus there could be strong gains in estimation precision by averaging across a selection of multiple non-probability sample vendors.

Back

The present study aimed to analyze how different schedules combining error estimation and relative frequency of extrinsic feedback affects motor learning. Fifty-two undergraduate s...

NICU Medication Errors: Describing the Cause and Nature of Medication Errors in a NICU in Qatar

IntroductionA medication error can be defined as “any error occurring in the medication use process” and focuses on problems with the delivery of medication to a patient [1]. Medic...

Robust Averaging Protects Decisions from Noise in Neural Computations

Abstract An ideal observer will give equivalent weight to sources of information that are equally reliable. However, when averaging visual information, human observ...

Pure Maximal Submodules and Related Concepts

In this work we discuss the concept of pure-maximal denoted by (Pr-maximal) submodules as a generalization to the type of R- maximal submodule, where a proper submodule of a...

Islet β-Cell Function Following 4.4-Year Insulin Injection in Chinese Elderly Patients with Type 2 Diabetes Mellitus

Abstract Background The aim of this study was to scrutinize changes of islet β-cell function in Chinese elderly patients with type 2 diabetes mellitus (T2DM) after insulin ...

Masticatory muscle activation patterns manifested by changes in index values

Relevance. Surface electromyography (sEMG) is a method used to record the bioelectrical activity of masticatory muscles both at rest and during movement. This method generates rela...

Errors

When we compare study group/s with a control group in a research, there can be ‘errors’. Error is the difference between the ‘fact’ and our ‘finding’. In other words, error is the ...

Averaging pre-Lie bialgebras and the related admissible classical Yang-Baxter equations

In this paper, we initiate the representation theory for averaging pre-Lie algebras, and establish the intrinsic equivalence among matched pairs, Manin triples, and bialgebra struc...

Email:
Password:

Email:

Averaging Non-Probability Online Surveys to Avoid Maximal Estimation Error

Related Results