Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

“polishCLR: a Nextflow workflow for polishing PacBio CLR genome assemblies”

View through CrossRef
Abstract Long-read sequencing has revolutionized genome assembly, yielding highly contiguous, chromosome-level contigs. However, assemblies from some third generation long read technologies, such as Pacific Biosciences (PacBio) Continuous Long Reads (CLR), have a high error rate. Such errors can be corrected with short reads through a process called polishing. Although best practices for polishing non-model de novo genome assemblies were recently described by the Vertebrate Genome Project (VGP) Assembly community, there is a need for a publicly available, reproducible workflow that can be easily implemented and run on a conventional high performance computing environment. Here, we describe polishCLR ( https://github.com/isugifNF/polishCLR ), a reproducible Nextflow workflow that implements best practices for polishing assemblies made from CLR data. PolishCLR can be initiated from several input options that extend best practices to suboptimal cases. It also provides re-entry points throughout several key processes including identifying duplicate haplotypes in purge_dups, allowing a break for scaffolding if data are available, and throughout multiple rounds of polishing and evaluation with Arrow and FreeBayes. PolishCLR is containerized and publicly available for the greater assembly community as a tool to complete assemblies from existing, error-prone long-read data.
Title: “polishCLR: a Nextflow workflow for polishing PacBio CLR genome assemblies”
Description:
Abstract Long-read sequencing has revolutionized genome assembly, yielding highly contiguous, chromosome-level contigs.
However, assemblies from some third generation long read technologies, such as Pacific Biosciences (PacBio) Continuous Long Reads (CLR), have a high error rate.
Such errors can be corrected with short reads through a process called polishing.
Although best practices for polishing non-model de novo genome assemblies were recently described by the Vertebrate Genome Project (VGP) Assembly community, there is a need for a publicly available, reproducible workflow that can be easily implemented and run on a conventional high performance computing environment.
Here, we describe polishCLR ( https://github.
com/isugifNF/polishCLR ), a reproducible Nextflow workflow that implements best practices for polishing assemblies made from CLR data.
PolishCLR can be initiated from several input options that extend best practices to suboptimal cases.
It also provides re-entry points throughout several key processes including identifying duplicate haplotypes in purge_dups, allowing a break for scaffolding if data are available, and throughout multiple rounds of polishing and evaluation with Arrow and FreeBayes.
PolishCLR is containerized and publicly available for the greater assembly community as a tool to complete assemblies from existing, error-prone long-read data.

Related Results

polishCLR: A Nextflow Workflow for Polishing PacBio CLR Genome Assemblies
polishCLR: A Nextflow Workflow for Polishing PacBio CLR Genome Assemblies
Abstract Long-read sequencing has revolutionized genome assembly, yielding highly contiguous, chromosome-level contigs. However, assemblies from some third genera...
Recent Patents for Optical Component Polishing Technology
Recent Patents for Optical Component Polishing Technology
Background: Fluid jet polishing, Airbag polishing, Magnetorheological polishing, and Ion beam polishing are all emerging polishing technologies commonly used for processing optical...
Workflow graphical editor and translator to Nextflow
Workflow graphical editor and translator to Nextflow
Abstract* Background The Workflow Description Language (WDL) is an open standard that is widely used for bioinformatics workflows. Due to its declarative nature WDL workflows can b...
Pacific bioscience sequence technology: Review
Pacific bioscience sequence technology: Review
Pacific Biosciences has developed a platform that may sequence one molecule of DNA in a period via the polymerization of that strand with one enzyme. Single-molecule real-time sequ...
The effect of process parameters on chemical mechanical polishing of quartz glass
The effect of process parameters on chemical mechanical polishing of quartz glass
The effects of polishing pressure, polishing speed and pH value of the polishing slurry on the chemical activity of quartz glass, the material removal rate (MRR) and surface roughn...
An optimization study of polishing efficiency of blisk and its technological parameters
An optimization study of polishing efficiency of blisk and its technological parameters
When applied to blisk blade profile polishing of aero-engines, “five-axis NC + flexible grinding head + elastic grindstone” polishing technological equipment has advantages of high...
Analysis of epidemiology and novel mutation of Helicobacter pylori antibiotic resistance in South Korea
Analysis of epidemiology and novel mutation of Helicobacter pylori antibiotic resistance in South Korea
BackgroundHelicobacter pylori is a Gram‐negative bacterium associated with upper gastrointestinal disease, peptic ulcer, mucosa‐associated lymphoid tissue lymphoma and eradication ...

Back to Top