Javascript must be enabled to continue!
Developing a Phylogeny Based Machine Learning Algorithm for Metagenomics
View through CrossRef
Metagenomics is the study of the totality of the complete genetic elements discovered from a defined environment. Different from traditional microbiology study, which only analyzes a small percent of microbes that could survive in laboratory, metagenomics allows researchers to get entire genetic information from all the samples in the communities. So metagenomics enables understanding of the target environments and the hidden relationships between bacteria and diseases. In order to efficiently analyze the metagenomics data, cutting-edge technologies for analyzing the relationships among microbes and communities are required. To overcome the challenges brought by rapid growth in metagenomics datasets, advances in novel methodologies for interpreting metagenomics data are clearly needed. The first two chapters of this dissertation summarize and compare the widely-used methods in metagenomics and integrate these methods into pipelines. Properly analyzing metagenomics data requires a variety of bioinformatcis and statistical approaches to deal with different situations. The raw reads from sequencing centers need to be processed and denoised by several steps and then be further interpreted by ecological and statistical analysis. So understanding these algorithms and combining different approaches could potentially reduce the influence of noises and biases at different steps. And an efficient and accurate pipeline is important to robustly decipher the differences and functionality of bacteria in communities. Traditional statistical analysis and machine learning algorithms have their limitations on analyzing metagenomics data. Thus, rest three chapters describe a new phylogeny based machine learning and feature selection algorithm to overcome these problems. The new method outperforms traditional algorithms and can provide more robust candidate microbes for further analysis. With the frowing sample size, deep neural network could potentially describe more complicated characteristic of data and thus improve model accuracy. So a deep learning framework is designed on top of the shallow learning algorithm stated above in order to further improve the prediction and selection accuracy. The present dissertation work provides a powerful tool that utilizes machine learning techniques to identify signature bacteria and key information from huge amount of metagenomics data.
Title: Developing a Phylogeny Based Machine Learning Algorithm for Metagenomics
Description:
Metagenomics is the study of the totality of the complete genetic elements discovered from a defined environment.
Different from traditional microbiology study, which only analyzes a small percent of microbes that could survive in laboratory, metagenomics allows researchers to get entire genetic information from all the samples in the communities.
So metagenomics enables understanding of the target environments and the hidden relationships between bacteria and diseases.
In order to efficiently analyze the metagenomics data, cutting-edge technologies for analyzing the relationships among microbes and communities are required.
To overcome the challenges brought by rapid growth in metagenomics datasets, advances in novel methodologies for interpreting metagenomics data are clearly needed.
The first two chapters of this dissertation summarize and compare the widely-used methods in metagenomics and integrate these methods into pipelines.
Properly analyzing metagenomics data requires a variety of bioinformatcis and statistical approaches to deal with different situations.
The raw reads from sequencing centers need to be processed and denoised by several steps and then be further interpreted by ecological and statistical analysis.
So understanding these algorithms and combining different approaches could potentially reduce the influence of noises and biases at different steps.
And an efficient and accurate pipeline is important to robustly decipher the differences and functionality of bacteria in communities.
Traditional statistical analysis and machine learning algorithms have their limitations on analyzing metagenomics data.
Thus, rest three chapters describe a new phylogeny based machine learning and feature selection algorithm to overcome these problems.
The new method outperforms traditional algorithms and can provide more robust candidate microbes for further analysis.
With the frowing sample size, deep neural network could potentially describe more complicated characteristic of data and thus improve model accuracy.
So a deep learning framework is designed on top of the shallow learning algorithm stated above in order to further improve the prediction and selection accuracy.
The present dissertation work provides a powerful tool that utilizes machine learning techniques to identify signature bacteria and key information from huge amount of metagenomics data.
Related Results
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Comparison of targeted metagenomics and IS-Pro methods for analysing the lung microbiome
Comparison of targeted metagenomics and IS-Pro methods for analysing the lung microbiome
AbstractBackgroundTargeted metagenomics and IS-Pro method are two of the many methods that have been used to study the microbiome. The two methods target different regions of the 1...
METAGENOMICS CURRENT RESEARCH, APPLICATION AND COMPUTATIONAL ANALYSIS
METAGENOMICS CURRENT RESEARCH, APPLICATION AND COMPUTATIONAL ANALYSIS
Metagenomics is the combination of genomics branch and meta that means huge set of genomes from different organisms. Metagenomics is also called as environmental genomics or commun...
Modelling the effectiveness of surveillance based on metagenomics in detecting, monitoring, and forecasting antimicrobial resistance in livestock production under economic constraints
Modelling the effectiveness of surveillance based on metagenomics in detecting, monitoring, and forecasting antimicrobial resistance in livestock production under economic constraints
Abstract
Current surveillance of antimicrobial resistance (AMR) is mostly based on testing indicator bacteria using minimum inhibitory concen...
An Approach to Machine Learning
An Approach to Machine Learning
The process of automatically recognising significant patterns within large amounts of data is called "machine learning." Throughout the last couple of decades, it has evolved into ...
Machine Learning for Enhancing Mortgage Origination Processes: Streamlining and Improving Efficiency
Machine Learning for Enhancing Mortgage Origination Processes: Streamlining and Improving Efficiency
The mortgage industry, historically characterized by manual processes, paperwork, and complex decision-making, is on the brink of a digital revolution driven by machine learning (M...
Clinical metagenomics assessments improve diagnosis and outcomes in community-acquired pneumonia
Clinical metagenomics assessments improve diagnosis and outcomes in community-acquired pneumonia
Abstract
Background
Identifying the causes of community-acquired pneumonia (CAP) is challenging due to the disease’s complex etiology and the limita...

