Javascript must be enabled to continue!
Clinical Feature-Related Single-Base Substitution Sequence Signatures Identified with an Unsupervised Machine Learning Approach
View through CrossRef
Abstract
Background: Mutation processes leave different signatures in genes. For single-base substitutions, previous studies have suggested that mutation signatures are not only reflected in mutation bases but also in neighboring bases. However, because of the lack of a method to identify features of long sequences next to mutation bases, the understanding of how flanking sequences influence mutation signatures is limited.Methods: We constructed a long short-term memory – self organizing map (LSTM-SOM) unsupervised neural network. By extracting mutated sequence features via LSTM and clustering similar features with the SOM, single-base substitutions in The Cancer Genome Atlas database were clustered according to both their mutation site and flanking sequences. The relationship between mutation sequence signatures and clinical features was then analyzed. Finally, we clustered patients into different classes according to the composition of the mutation sequence signatures by the K-means method and then studied the differences in clinical features and survival between classes.Results: Ten classes of mutant sequence signatures (mutation blots, MBs) were obtained from 2,141,527 single-base substitutions via LSTM-SOM machine learning approach. Different features in mutation bases and flanking sequences were revealed among MBs. MBs reflect both the site and pathological features of cancers. MBs were related to clinical features, including age, gender, and cancer stage. The class of an MB in a given gene was associated with survival. Finally, patients were clustered into 7 classes according to the MB composition. Significant differences in survival and clinical features were observed among different patient classes.Conclusions: We provided a method for analyzing the characteristics of mutant sequences. Result of this study showed that flanking sequences, together with mutation bases, shape the signatures of SBSs. MBs were shown related to clinical features and survival of cancer patients. Composition of MBs is a feasible predictive factor of clinical prognosis. Further study of the mechanism of MBs related to cancer characteristics is suggested.
Springer Science and Business Media LLC
Title: Clinical Feature-Related Single-Base Substitution Sequence Signatures Identified with an Unsupervised Machine Learning Approach
Description:
Abstract
Background: Mutation processes leave different signatures in genes.
For single-base substitutions, previous studies have suggested that mutation signatures are not only reflected in mutation bases but also in neighboring bases.
However, because of the lack of a method to identify features of long sequences next to mutation bases, the understanding of how flanking sequences influence mutation signatures is limited.
Methods: We constructed a long short-term memory – self organizing map (LSTM-SOM) unsupervised neural network.
By extracting mutated sequence features via LSTM and clustering similar features with the SOM, single-base substitutions in The Cancer Genome Atlas database were clustered according to both their mutation site and flanking sequences.
The relationship between mutation sequence signatures and clinical features was then analyzed.
Finally, we clustered patients into different classes according to the composition of the mutation sequence signatures by the K-means method and then studied the differences in clinical features and survival between classes.
Results: Ten classes of mutant sequence signatures (mutation blots, MBs) were obtained from 2,141,527 single-base substitutions via LSTM-SOM machine learning approach.
Different features in mutation bases and flanking sequences were revealed among MBs.
MBs reflect both the site and pathological features of cancers.
MBs were related to clinical features, including age, gender, and cancer stage.
The class of an MB in a given gene was associated with survival.
Finally, patients were clustered into 7 classes according to the MB composition.
Significant differences in survival and clinical features were observed among different patient classes.
Conclusions: We provided a method for analyzing the characteristics of mutant sequences.
Result of this study showed that flanking sequences, together with mutation bases, shape the signatures of SBSs.
MBs were shown related to clinical features and survival of cancer patients.
Composition of MBs is a feasible predictive factor of clinical prognosis.
Further study of the mechanism of MBs related to cancer characteristics is suggested.
Related Results
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
Human tissues comprise trillions of cells that populate a complex space of molecular phenotypes and functions and that vary in abundance by 4–9 orders of magnitude. Relying solely ...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
Generic substitution for prescribed brand medicines in Ethiopia: knowledge, attitude and practice among pharmacy professionals in community drug retail outlets
Generic substitution for prescribed brand medicines in Ethiopia: knowledge, attitude and practice among pharmacy professionals in community drug retail outlets
Abstract
Background
Generic substitution is a good approach to reduce pharmaceutical expenses without compromising healthcare quality. Yet, the prac...
Correlation of Mutational Signatures in Cancer Genes with General Signatures
Correlation of Mutational Signatures in Cancer Genes with General Signatures
The occurrence of various mutation patterns, such as changes in the DNA sequence and the loss of some sequences, is called a “mutational signature,” and they represent the molecula...
DLUT: Decoupled Learning-Based Unsupervised Tracker
DLUT: Decoupled Learning-Based Unsupervised Tracker
Unsupervised learning has shown immense potential in object tracking, where accurate classification and regression are crucial for unsupervised trackers. However, the classificatio...
An Approach to Machine Learning
An Approach to Machine Learning
The process of automatically recognising significant patterns within large amounts of data is called "machine learning." Throughout the last couple of decades, it has evolved into ...
Unsupervised Deep Learning for Enhanced holoentropy Image Stitching
Unsupervised Deep Learning for Enhanced holoentropy Image Stitching
Traditional feature-based image stitching technologies rely heavily on feature detection quality, often failing to stitch images with few features or low resolution. The learning b...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...

