Javascript must be enabled to continue!
CentIER: accurate centromere identification for plant genomes with sequence specificity information
View through CrossRef
Abstract
Centromere identification is one of the important problems in genomics, providing a foundation for the studies of centromeres in aspects of composition, functionality, evolution, inheritance, and variation. The existing wet-experiment-based method is costly and time-consuming, while the bioinformatic method can only detect tandem repeats losing non-repetitive sequence regions in the centromere. To address these shortcomings, we introduce a new pipeline, CentIER, for the automatic and accurate identification and annotation of centromere regions by taking advantage of the sequence specificity information. CentIER only requires users to input the genomic sequence, and then it can partition the centromeric region from a chromosome, identify tandem repeat monomers, annotate retrotransposons, and ultimately output visualized results. By referencing the experimentally determined centromere regions, it was discovered that the predictive accuracy of centromere recognition by CentIER exceeded 90%. Following the evaluation of CentIER’s accuracy, it was applied to investigate the sequence and distribution characteristics of centromeric retrotransposons and tandem repeat sequences of different species, providing insights into these traits in monocotyledonous and dicotyledonous plants.
Title: CentIER: accurate centromere identification for plant genomes with sequence specificity information
Description:
Abstract
Centromere identification is one of the important problems in genomics, providing a foundation for the studies of centromeres in aspects of composition, functionality, evolution, inheritance, and variation.
The existing wet-experiment-based method is costly and time-consuming, while the bioinformatic method can only detect tandem repeats losing non-repetitive sequence regions in the centromere.
To address these shortcomings, we introduce a new pipeline, CentIER, for the automatic and accurate identification and annotation of centromere regions by taking advantage of the sequence specificity information.
CentIER only requires users to input the genomic sequence, and then it can partition the centromeric region from a chromosome, identify tandem repeat monomers, annotate retrotransposons, and ultimately output visualized results.
By referencing the experimentally determined centromere regions, it was discovered that the predictive accuracy of centromere recognition by CentIER exceeded 90%.
Following the evaluation of CentIER’s accuracy, it was applied to investigate the sequence and distribution characteristics of centromeric retrotransposons and tandem repeat sequences of different species, providing insights into these traits in monocotyledonous and dicotyledonous plants.
Related Results
The nucleosomes that mark centromere location on chromosomes old and new
The nucleosomes that mark centromere location on chromosomes old and new
Abstract
Proper segregation of chromosomes is an essential component of cell division. The centromere is the locus at which the kinetochore—the proteinaceous complex...
Abstract LB034: EWSR1, Ewing sarcoma breakpoint region 1, maintains centromere identity by binding to centromeric R-loops
Abstract LB034: EWSR1, Ewing sarcoma breakpoint region 1, maintains centromere identity by binding to centromeric R-loops
Abstract
The centromere DNA-kinetochore complex is the specialized chromatin structure that mediates chromosome attachment to microtubules and is required for high-f...
RepeatOBserver: tandem repeat visualization and centromere detection
RepeatOBserver: tandem repeat visualization and centromere detection
Abstract
Tandem repeats can play an important role in centromere structure, subtelomeric regions, DNA methylation, recombination, and the regulat...
Provocative Tests in Diagnosis of Thoracic Outlet Syndrome: A Narrative Review
Provocative Tests in Diagnosis of Thoracic Outlet Syndrome: A Narrative Review
Abstract
Thoracic outlet syndrome (TOS) is a group of conditions caused by the compression of the neurovascular bundle within the thoracic outlet. It is classified into three main ...
RNA quality control by CCR4 safeguards chromatin integrity and centromere function inArabidopsis
RNA quality control by CCR4 safeguards chromatin integrity and centromere function inArabidopsis
SummaryThe centromere is a pivotal chromatin domain that ensures accurate chromosome segregation during cell division. However, the epigenome regulation of the centromere and its i...
How chromosomal rearrangements shape genomes : a computational and mathematical study
How chromosomal rearrangements shape genomes : a computational and mathematical study
Comment les réarrangements chromosomiques façonnent les génomes : étude par modélisation et simulations
Les origines de la complexité des génomes, ainsi que les dét...
Genomic characterization of the
C. tuberculostearicum
species complex, a ubiquitous member of the human skin microbiome
Genomic characterization of the
C. tuberculostearicum
species complex, a ubiquitous member of the human skin microbiome
ABSTRACT
Corynebacterium
is a predominant genus in the skin microbiome, yet its genetic diversity on skin is incompletely chara...
Statistique des comparaisons de génomes complets bactériens
Statistique des comparaisons de génomes complets bactériens
La génomique comparative est l'étude des relations structurales et fonctionnelles entre des génomes appartenant à différentes souches ou espèces. Cette discipline offre ainsi la po...

