Javascript must be enabled to continue!
Domain Adaptation and Domain Generalization with Representation Learning
View through CrossRef
<p>Machine learning has achieved great successes in the area of computer vision, especially in object recognition or classification. One of the core factors of the successes is the availability of massive labeled image or video data for training, collected manually by human. Labeling source training data, however, can be expensive and time consuming. Furthermore, a large amount of labeled source data may not always guarantee traditional machine learning techniques to generalize well; there is a potential bias or mismatch in the data, i.e., the training data do not represent the target environment. To mitigate the above dataset bias/mismatch, one can consider domain adaptation: utilizing labeled training data and unlabeled target data to develop a well-performing classifier on the target environment. In some cases, however, the unlabeled target data are nonexistent, but multiple labeled sources of data exist. Such situations can be addressed by domain generalization: using multiple source training sets to produce a classifier that generalizes on the unseen target domain. Although several domain adaptation and generalization approaches have been proposed, the domain mismatch in object recognition remains a challenging, open problem – the model performance has yet reached to a satisfactory level in real world applications. The overall goal of this thesis is to progress towards solving dataset bias in visual object recognition through representation learning in the context of domain adaptation and domain generalization. Representation learning is concerned with finding proper data representations or features via learning rather than via engineering by human experts. This thesis proposes several representation learning solutions based on deep learning and kernel methods. This thesis introduces a robust-to-noise deep neural network for handwritten digit classification trained on “clean” images only, which we name Deep Hybrid Network (DHN). DHNs are based on a particular combination of sparse autoencoders and restricted Boltzmann machines. The results show that DHN performs better than the standard deep neural network in recognizing digits with Gaussian and impulse noise, block and border occlusions. This thesis proposes the Domain Adaptive Neural Network (DaNN), a neural network based domain adaptation algorithm that minimizes the classification error and the domain discrepancy between the source and target data representations. The experiments show the competitiveness of DaNN against several state-of-the-art methods on a benchmark object dataset. This thesis develops the Multi-task Autoencoder (MTAE), a domain generalization algorithm based on autoencoders trained via multi-task learning. MTAE learns to transform the original image into its analogs in multiple related domains simultaneously. The results show that the MTAE’s representations provide better classification performance than some alternative autoencoder-based models as well as the current state-of-the-art domain generalization algorithms. This thesis proposes a fast kernel-based representation learning algorithm for both domain adaptation and domain generalization, Scatter Component Analysis (SCA). SCA finds a data representation that trades between maximizing the separability of classes, minimizing the mismatch between domains, and maximizing the separability of the whole data points. The results show that SCA performs much faster than some competitive algorithms, while providing state-of-the-art accuracy in both domain adaptation and domain generalization. Finally, this thesis presents the Deep Reconstruction-Classification Network (DRCN), a deep convolutional network for domain adaptation. DRCN learns to classify labeled source data and also to reconstruct unlabeled target data via a shared encoding representation. The results show that DRCN provides competitive or better performance than the prior state-of-the-art model on several cross-domain object datasets.</p>
Title: Domain Adaptation and Domain Generalization with Representation Learning
Description:
<p>Machine learning has achieved great successes in the area of computer vision, especially in object recognition or classification.
One of the core factors of the successes is the availability of massive labeled image or video data for training, collected manually by human.
Labeling source training data, however, can be expensive and time consuming.
Furthermore, a large amount of labeled source data may not always guarantee traditional machine learning techniques to generalize well; there is a potential bias or mismatch in the data, i.
e.
, the training data do not represent the target environment.
To mitigate the above dataset bias/mismatch, one can consider domain adaptation: utilizing labeled training data and unlabeled target data to develop a well-performing classifier on the target environment.
In some cases, however, the unlabeled target data are nonexistent, but multiple labeled sources of data exist.
Such situations can be addressed by domain generalization: using multiple source training sets to produce a classifier that generalizes on the unseen target domain.
Although several domain adaptation and generalization approaches have been proposed, the domain mismatch in object recognition remains a challenging, open problem – the model performance has yet reached to a satisfactory level in real world applications.
The overall goal of this thesis is to progress towards solving dataset bias in visual object recognition through representation learning in the context of domain adaptation and domain generalization.
Representation learning is concerned with finding proper data representations or features via learning rather than via engineering by human experts.
This thesis proposes several representation learning solutions based on deep learning and kernel methods.
This thesis introduces a robust-to-noise deep neural network for handwritten digit classification trained on “clean” images only, which we name Deep Hybrid Network (DHN).
DHNs are based on a particular combination of sparse autoencoders and restricted Boltzmann machines.
The results show that DHN performs better than the standard deep neural network in recognizing digits with Gaussian and impulse noise, block and border occlusions.
This thesis proposes the Domain Adaptive Neural Network (DaNN), a neural network based domain adaptation algorithm that minimizes the classification error and the domain discrepancy between the source and target data representations.
The experiments show the competitiveness of DaNN against several state-of-the-art methods on a benchmark object dataset.
This thesis develops the Multi-task Autoencoder (MTAE), a domain generalization algorithm based on autoencoders trained via multi-task learning.
MTAE learns to transform the original image into its analogs in multiple related domains simultaneously.
The results show that the MTAE’s representations provide better classification performance than some alternative autoencoder-based models as well as the current state-of-the-art domain generalization algorithms.
This thesis proposes a fast kernel-based representation learning algorithm for both domain adaptation and domain generalization, Scatter Component Analysis (SCA).
SCA finds a data representation that trades between maximizing the separability of classes, minimizing the mismatch between domains, and maximizing the separability of the whole data points.
The results show that SCA performs much faster than some competitive algorithms, while providing state-of-the-art accuracy in both domain adaptation and domain generalization.
Finally, this thesis presents the Deep Reconstruction-Classification Network (DRCN), a deep convolutional network for domain adaptation.
DRCN learns to classify labeled source data and also to reconstruct unlabeled target data via a shared encoding representation.
The results show that DRCN provides competitive or better performance than the prior state-of-the-art model on several cross-domain object datasets.
</p>.
Related Results
Adaptive Planning for Resilient Coastal Waterfronts
Adaptive Planning for Resilient Coastal Waterfronts
Many delta and coastal cities worldwide face increasing flood risk due to changing climate conditions and sea level rise. The question is how to develop measures and strategies for...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Successful coastal adaptation projects? The role of multi-lateral climate funding.
Successful coastal adaptation projects? The role of multi-lateral climate funding.
<p><strong>This thesis investigates the evaluation of climate change adaptation success of projects in coastal zones of developing countries, specifically focusing on t...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Bidirectional-Feature-Learning-Based Adversarial Domain Adaptation with Generative Network
Bidirectional-Feature-Learning-Based Adversarial Domain Adaptation with Generative Network
Studying domain adaptation is a recent research trend. Generally, many generative models that researchers have studied perform well on training data from a specific domain. However...
Kernels of Motor Memory Formation: Temporal Generalization in Bimanual Adaptation
Kernels of Motor Memory Formation: Temporal Generalization in Bimanual Adaptation
Abstract
In daily life, we coordinate both simultaneous and sequential bimanual movements to manipulate objects. Our ability to rapidly account for different object...
Drivers and barriers of drought risk adaptation decisions by agro-pastoralists in Kenya
Drivers and barriers of drought risk adaptation decisions by agro-pastoralists in Kenya
The Horn of Africa Drylands are increasingly experiencing severe droughts, which imposes a thread on traditional livelihood strategies of pastoralist communities. Understanding ada...
Self-Training Cross-Domain Image Classification Model Via Label Adaptation
Self-Training Cross-Domain Image Classification Model Via Label Adaptation
Unsupervised domain adaptation focuses on transferring knowledge from a labeled source domain to a completely unlabeled target domain, with the goal of enhancing classification per...

