Javascript must be enabled to continue!
BNPdensity: Bayesian nonparametric mixture modelling in R
View through CrossRef
SummaryRobust statistical data modelling under potential model mis‐specification often requires leaving the parametric world for the nonparametric. In the latter, parameters are infinite dimensional objects such as functions, probability distributions or infinite vectors. In the Bayesian nonparametric approach, prior distributions are designed for these parameters, which provide a handle to manage the complexity of nonparametric models in practice. However, most modern Bayesian nonparametric models seem often out of reach to practitioners, as inference algorithms need careful design to deal with the infinite number of parameters. The aim of this work is to facilitate the journey by providing computational tools for Bayesian nonparametric inference. The article describes a set of functions available in theRpackageBNPdensityin order to carry out density estimation with an infinite mixture model, including all types of censored data. The package provides access to a large class of such models based on normalised random measures, which represent a generalisation of the popular Dirichlet process mixture. One striking advantage of this generalisation is that it offers much more robust priors on the number of clusters than the Dirichlet. Another crucial advantage is the complete flexibility in specifying the prior for the scale and location parameters of the clusters, because conjugacy is not required. Inference is performed using a theoretically grounded approximate sampling methodology known as the Ferguson & Klass algorithm. The package also offers several goodness‐of‐fit diagnostics such as QQ plots, including a cross‐validation criterion, the conditional predictive ordinate. The proposed methodology is illustrated on a classical ecological risk assessment method called the species sensitivity distribution problem, showcasing the benefits of the Bayesian nonparametric framework.
Title: BNPdensity: Bayesian nonparametric mixture modelling in R
Description:
SummaryRobust statistical data modelling under potential model mis‐specification often requires leaving the parametric world for the nonparametric.
In the latter, parameters are infinite dimensional objects such as functions, probability distributions or infinite vectors.
In the Bayesian nonparametric approach, prior distributions are designed for these parameters, which provide a handle to manage the complexity of nonparametric models in practice.
However, most modern Bayesian nonparametric models seem often out of reach to practitioners, as inference algorithms need careful design to deal with the infinite number of parameters.
The aim of this work is to facilitate the journey by providing computational tools for Bayesian nonparametric inference.
The article describes a set of functions available in theRpackageBNPdensityin order to carry out density estimation with an infinite mixture model, including all types of censored data.
The package provides access to a large class of such models based on normalised random measures, which represent a generalisation of the popular Dirichlet process mixture.
One striking advantage of this generalisation is that it offers much more robust priors on the number of clusters than the Dirichlet.
Another crucial advantage is the complete flexibility in specifying the prior for the scale and location parameters of the clusters, because conjugacy is not required.
Inference is performed using a theoretically grounded approximate sampling methodology known as the Ferguson & Klass algorithm.
The package also offers several goodness‐of‐fit diagnostics such as QQ plots, including a cross‐validation criterion, the conditional predictive ordinate.
The proposed methodology is illustrated on a classical ecological risk assessment method called the species sensitivity distribution problem, showcasing the benefits of the Bayesian nonparametric framework.
Related Results
Sample-efficient Optimization Using Neural Networks
Sample-efficient Optimization Using Neural Networks
<p>The solution to many science and engineering problems includes identifying the minimum or maximum of an unknown continuous function whose evaluation inflicts non-negligibl...
Figs S1-S9
Figs S1-S9
Fig. S1. Consensus phylogram (50 % majority rule) resulting from a Bayesian analysis of the ITS sequence alignment of sequences generated in this study and reference sequences from...
The Oxford Handbook of Bayesian Econometrics
The Oxford Handbook of Bayesian Econometrics
Bayesian econometric methods have enjoyed an increase in popularity in recent years. Econometricians, empirical economists, and policymakers are increasingly making use of Bayesian...
Nonparametric Segment Detection
Nonparametric Segment Detection
In computer and robotic vision point clouds from depth sensors have to be processed to form higher-level concepts such as lines, planes, and objects. Bayesian methods formulate pre...
Efficient Approaches to the Mixture Distance Problem
Efficient Approaches to the Mixture Distance Problem
The ancestral mixture model, an important model building a hierarchical tree from high dimensional binary sequences, was proposed by Chen and Lindsay in 2006. As a phylogenetic tre...
Multinomial Naïve Bayes Classifier: Bayesian versus Nonparametric Classifier Approach
Multinomial Naïve Bayes Classifier: Bayesian versus Nonparametric Classifier Approach
This paper proposes a Naïve Bayes Classifier for Bayesian and nonparametric methods of analyzing multinomial regression. The Naïve Bayes classifier adopted Bayes’ rule for solving ...
Full Bayesian models for paired RNA-seq data and Bayesian equivalence test
Full Bayesian models for paired RNA-seq data and Bayesian equivalence test
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.] "In my doctorate research, I have developed Bayesian models to analyze the paired RNAseq data for different t...
From p-values to Bayes Factor: A Meta-Analytic Comparison in Colorectal Research
From p-values to Bayes Factor: A Meta-Analytic Comparison in Colorectal Research
Abstract
The prevalent method for synthesizing evidence from multiple studies is the frequentist meta-analysis, which relies on assumptions of long-term frequencies and d...

