Javascript must be enabled to continue!
An interpretable ensemble method for deep representation learning
View through CrossRef
Model ensemble is widely used in deep learning since it can balance the
variance and bias of complex models. The mainstream model ensemble
methods can be divided into “implicit” and “explicit”. The
“implicit” method obtains different models by randomly inactivating the
internal parameters in the complex structure of the deep learning model,
and these models are integrated by sharing parameters. However, these
methods lack flexibility because they can only ensemble homogeneous
models with the similar structure. While the “explicit” ensemble
method can fuse completely different heterogeneous model structures,
which significantly enhances the flexibility of model selection and makes
it possible to integrate more models with entirely different
perspectives. However, the explicit ensemble will face the challenge of
averaging the outputs, leading to a chaotic result. To this end,
researchers further proposed using knowledge distillation and
adversarial learning technologies to perform a nonlinear combination of
multiple heterogeneous models to obtain better ensemble performance,
however these require significant modifications to the training or testing
procedure and are computationally expensive compared to simply
averaging. In this paper, based on the linear combination assumption, we
propose an interpretable ensemble method for averaging model results
which is simple to implement, and conducting experiments on the
representation learning tasks of Computer Vision(CV) and Natural
Language Processing(NLP). The results show that our method is superior
to direct averaging results while retaining the practicality of direct
averaging.
Title: An interpretable ensemble method for deep representation learning
Description:
Model ensemble is widely used in deep learning since it can balance the
variance and bias of complex models.
The mainstream model ensemble
methods can be divided into “implicit” and “explicit”.
The
“implicit” method obtains different models by randomly inactivating the
internal parameters in the complex structure of the deep learning model,
and these models are integrated by sharing parameters.
However, these
methods lack flexibility because they can only ensemble homogeneous
models with the similar structure.
While the “explicit” ensemble
method can fuse completely different heterogeneous model structures,
which significantly enhances the flexibility of model selection and makes
it possible to integrate more models with entirely different
perspectives.
However, the explicit ensemble will face the challenge of
averaging the outputs, leading to a chaotic result.
To this end,
researchers further proposed using knowledge distillation and
adversarial learning technologies to perform a nonlinear combination of
multiple heterogeneous models to obtain better ensemble performance,
however these require significant modifications to the training or testing
procedure and are computationally expensive compared to simply
averaging.
In this paper, based on the linear combination assumption, we
propose an interpretable ensemble method for averaging model results
which is simple to implement, and conducting experiments on the
representation learning tasks of Computer Vision(CV) and Natural
Language Processing(NLP).
The results show that our method is superior
to direct averaging results while retaining the practicality of direct
averaging.
Related Results
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Deep convolutional neural network and IoT technology for healthcare
Deep convolutional neural network and IoT technology for healthcare
Background
Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find ...
Improvement of river flow estimation accuracy using ensemble learning stacking
Improvement of river flow estimation accuracy using ensemble learning stacking
<p>In recent years, disasters are more frequent and enormous due to global warming. In the field of hydrology, high-precision rainfall-runoff modeling is required. Re...
Deep Neural Ensemble Classification for COVID-19 Dataset
Deep Neural Ensemble Classification for COVID-19 Dataset
The COVID-19 pandemic has necessitated the development of accurate and efficient classification models for diagnosis and prognosis. While deep learning has shown promising results ...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
Preliminary study of a new-style terrain disturbance method based on gradient inhomogeneity in convection-allowing scale ensemble prediction system
Preliminary study of a new-style terrain disturbance method based on gradient inhomogeneity in convection-allowing scale ensemble prediction system
<p>Terrain with different shapes and ground surface properties has extremely complex impacts on atmospheric motion, and the forecast uncertainty and complexity caused...
Deep Learning for Latent Space Data Assimilation LSDA in Subsurface Flow Systems
Deep Learning for Latent Space Data Assimilation LSDA in Subsurface Flow Systems
Abstract
We present a deep learning architecture for efficient reduced-order implementation of ensemble data assimilation. Specifically, deep learning is used to imp...
Experiential Learning and Education in Management
Experiential Learning and Education in Management
Experiential learning describes the process of learning that results from gathering and processing information through direct engagement with the world. In contrast to behavioral a...

