Javascript must be enabled to continue!
Dealing with overdispersed count data in applied ecology
View through CrossRef
Summary
The ability to identify key ecological processes is important when solving applied problems. Increasingly, ecologists are adopting Akaike's information criterion (AIC) as a metric to help them assess and select among multiple process‐based ecological models. Surprisingly, however, it is still unclear how best to incorporate AIC into the selection process in order to address the trade‐off between maximizing the probability of retaining the most parsimonious model while minimizing the number of models retained.
Ecological count data are often observed to be overdispersed with respect to best‐fitting models. Overdispersion is problematic when performing an AIC analysis, as it can result in selection of overly complex models which can lead to poor ecological inference. This paper describes and illustrates two approaches that deal effectively with overdispersion. The first approach involves modelling the causes of overdispersion implicitly using compound probability distributions. The second approach ignores the causes of overdispersion and uses quasi‐AIC (QAIC) as a metric for model parsimony.
Simulations and a novel method that identifies the most parsimonious model are used to demonstrate the utility of the two overdispersion approaches within the context of two ecological examples. The first example addresses binomial data obtained from a study of fish survival (as related to habitat structure) and the second example addresses Poisson data obtained from a study of flower visitation by nectarivores.
Applying either overdispersion approach reduces the chance of selecting overly complex models, and both approaches result in very similar ecological inference. In addition, inference can be made more reliable by incorporating model nesting into the selection process (i.e. identifying which models are special cases of others), as it reduces the number of models selected without significantly reducing the probability of retaining the most parsimonious models.
Synthesis and applications. When data are overdispersed, inference can be improved by either modelling the causes of overdispersion or applying QAIC as a metric for model parsimony. Inference can also be improved by adopting a model filtering procedure based on how models are nested. The general simulation approach presented in this paper for identifying the most parsimonious model, as defined by information theory, should help to improve our understanding of the reliability of model selection when using AIC, and help the development of better selection rules.
Title: Dealing with overdispersed count data in applied ecology
Description:
Summary
The ability to identify key ecological processes is important when solving applied problems.
Increasingly, ecologists are adopting Akaike's information criterion (AIC) as a metric to help them assess and select among multiple process‐based ecological models.
Surprisingly, however, it is still unclear how best to incorporate AIC into the selection process in order to address the trade‐off between maximizing the probability of retaining the most parsimonious model while minimizing the number of models retained.
Ecological count data are often observed to be overdispersed with respect to best‐fitting models.
Overdispersion is problematic when performing an AIC analysis, as it can result in selection of overly complex models which can lead to poor ecological inference.
This paper describes and illustrates two approaches that deal effectively with overdispersion.
The first approach involves modelling the causes of overdispersion implicitly using compound probability distributions.
The second approach ignores the causes of overdispersion and uses quasi‐AIC (QAIC) as a metric for model parsimony.
Simulations and a novel method that identifies the most parsimonious model are used to demonstrate the utility of the two overdispersion approaches within the context of two ecological examples.
The first example addresses binomial data obtained from a study of fish survival (as related to habitat structure) and the second example addresses Poisson data obtained from a study of flower visitation by nectarivores.
Applying either overdispersion approach reduces the chance of selecting overly complex models, and both approaches result in very similar ecological inference.
In addition, inference can be made more reliable by incorporating model nesting into the selection process (i.
e.
identifying which models are special cases of others), as it reduces the number of models selected without significantly reducing the probability of retaining the most parsimonious models.
Synthesis and applications.
When data are overdispersed, inference can be improved by either modelling the causes of overdispersion or applying QAIC as a metric for model parsimony.
Inference can also be improved by adopting a model filtering procedure based on how models are nested.
The general simulation approach presented in this paper for identifying the most parsimonious model, as defined by information theory, should help to improve our understanding of the reliability of model selection when using AIC, and help the development of better selection rules.
Related Results
Tracing Hematological Shifts in Pregnancy: How Anemia and Thrombocytopenia Evolve Across Trimesters
Tracing Hematological Shifts in Pregnancy: How Anemia and Thrombocytopenia Evolve Across Trimesters
Abstract
Introduction
Given pregnancy's significant impact on hematological parameters, monitoring these changes across trimesters is crucial. This study aims to evaluate hematolog...
Language Ecology
Language Ecology
Norwegian scholar Einar Haugen proposed language ecology to study how languages historically present in a land or social setting interact with languages that arrive in this setting...
Grounding trait‐based root functional ecology
Grounding trait‐based root functional ecology
Abstract
A growing need exists to consider effects of biodiversity dynamics on the functioning of natural and anthropised ecosystems. This requires including the concepts of func...
Semiparametric methods for regression analysis of panel count data and mixed panel count data
Semiparametric methods for regression analysis of panel count data and mixed panel count data
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.] Recurrent event data and panel count data are two common types of data that have been studied extensively in ...
Platelet count patterns and patient outcomes in sepsis at a tertiary care center
Platelet count patterns and patient outcomes in sepsis at a tertiary care center
Abstract
Acute physiology and chronic health evaluation II (APACHE-II) scoring system is used to classify disease severity of patients in the intensive care unit. Howev...
Reticulocyte Count and Platelet Count as Predictors of Morphological Remission/Hemopoitic Recovery in Acute Lymphoblastic Leukemia (ALL) after Induction Chemotherapy
Reticulocyte Count and Platelet Count as Predictors of Morphological Remission/Hemopoitic Recovery in Acute Lymphoblastic Leukemia (ALL) after Induction Chemotherapy
Objectives: To determine the predictive values of reticulocyte and platelet count for remission in cases of acute lymphoblastic leukemia after induction therapy.
Materials and Me...
Abstract 2525: Neutrophil count, cancer incidence and cancer mortality: disparate relationships by race.
Abstract 2525: Neutrophil count, cancer incidence and cancer mortality: disparate relationships by race.
Abstract
Studies have demonstrated an association between elevated levels of total white blood cell (WBC) count, within the clinically normal range, and an increased...
Dynamic curve analysis of indicators related to lumbar cistern drainage for postoperative meningitis
Dynamic curve analysis of indicators related to lumbar cistern drainage for postoperative meningitis
Abstract
Background
Lumbar cisterna drainage (LCD) is widely used in neurosurgery as an effective treatment for postoperative meningitis, vasospasm of subarachnoid hemorrh...

