Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

GenMap: Fast and Exact Computation of Genome Mappability

View through CrossRef
Abstract We present a fast and exact algorithm to compute the ( k, e )-mappability. Its inverse, the ( k, e )-frequency counts the number of occurrences of each k -mer with up to e errors in a sequence. The algorithm we present is a magnitude faster than the algorithm in the widely used GEM suite while not relying on heuristics, and can even compute the mappability for short k -mers on highly repetitive plant genomes. We also show that mappability can be computed on multiple sequences to identify marker genes illustrated by the example of E. coli strains. GenMap allows exporting the mappability information into different formats such as raw output, wig and bed files. The application and its C++ source code is available on https://github.com/cpockrandt/genmap .
Title: GenMap: Fast and Exact Computation of Genome Mappability
Description:
Abstract We present a fast and exact algorithm to compute the ( k, e )-mappability.
Its inverse, the ( k, e )-frequency counts the number of occurrences of each k -mer with up to e errors in a sequence.
The algorithm we present is a magnitude faster than the algorithm in the widely used GEM suite while not relying on heuristics, and can even compute the mappability for short k -mers on highly repetitive plant genomes.
We also show that mappability can be computed on multiple sequences to identify marker genes illustrated by the example of E.
coli strains.
GenMap allows exporting the mappability information into different formats such as raw output, wig and bed files.
The application and its C++ source code is available on https://github.
com/cpockrandt/genmap .

Related Results

Umap and Bismap: quantifying genome and methylome mappability
Umap and Bismap: quantifying genome and methylome mappability
Abstract Motivation Short-read sequencing enables assessment of genetic and biochemical traits of individu...
GenMap: fast and exact computation of genome mappability
GenMap: fast and exact computation of genome mappability
The (k, e)-frequency counts the number of occurrences of each k-mer with up to e errors in a sequence. We present a fast and exact method GenMap to compute the (k, e)-mappability, ...
Correcting Methylation Calls in Clinically Relevant Low-Mappability Regions
Correcting Methylation Calls in Clinically Relevant Low-Mappability Regions
AbstractDNA methylation is an important component in vital biological functions such as embryonic development, carcinogenesis, and heritable regulation. Accurate methods to assess ...
Whole Genome Resequencing and 1000 Genomes Project
Whole Genome Resequencing and 1000 Genomes Project
Abstract The recent advances in sequencing technologies have enabled the whole human genome to be sequenced within weeks. To date, several human...
Characterising genome architectures using Genome Decomposition Analysis
Characterising genome architectures using Genome Decomposition Analysis
Abstract Genome architecture describes how genes and other features are arranged in genomes. These arrangements reflect the evolutionary pressure...
Comparative genomics reveals insights into anuran genome size evolution
Comparative genomics reveals insights into anuran genome size evolution
Abstract Background Amphibians, particularly anurans, display an enormous variation in genome size. Due to the unavailability of whole genome datase...
EVOLVING CONSUMER PREFERENCES ON THE FAST FASHION MARKET
EVOLVING CONSUMER PREFERENCES ON THE FAST FASHION MARKET
Eter Kharaishvili E-mail: eter.kharaishvili@tsu.ge Professor, Ivane Javakhishvili Tbilisi State University Tbilisi, Georgia https://orcid.org/0000-0003-4013-7354   Nino Lobzh...
The Evolutionary Patterns of Genome Size in Ensifera (Insecta: Orthoptera)
The Evolutionary Patterns of Genome Size in Ensifera (Insecta: Orthoptera)
Genomic size variation has long been a focus for biologists. However, due to the lack of genome size data, the mechanisms behind this variation and the biological significance of i...

Back to Top