Javascript must be enabled to continue!
Phased Multi de Bruijn Sequences
View through CrossRef
We introduce phased multi de Bruijn sequences, a generalization of de Bruijn sequences. A phased string is a string whose positions sequentially rotate through several alphabets; e.g., “0Ax1Ay1By0Az1Bx” rotates through alphabets $\Omega_0=\{0,1\}$, $\Omega_1=\{A,B\}$, and $\Omega_2=\{x,y,z\}$. We consider cyclic and linear phased strings in which all possible phased $k$-mers (phased strings of length $k$) occur with particular multiplicities, depending on their “phase” (the alphabet they start in). For example, consider the cycle (s)=(0Ax1Ay1By0Az0By1Bx0Ay0Bx1Bz1Ax0Bz1Az) in these alphabets. All possible phased 2-mers starting in phases 0, 1, and 2 respectively have multiplicities 3, 2, and 2; e.g., “0A” occurs three times, “Ax” occurs twice, and “z0” occurs twice (including the occurrence that wraps around the cycle). We determine parameters ($k$, number of phases, alphabet sizes, and multiplicities) for which this is possible. Then we count the total number of phased multi de Bruijn sequences for these parameters, both for cyclic and linear sequences. This extends classical de Bruijn sequences and multi de Bruijn sequences (our previous generalization of de Bruijn sequences in which all possible $k$-mers over one alphabet occur $m$ times each). Our method of counting the sequences uses a change of basis for the Laplacian matrix; this also gives a new proof for the number of classical de Bruijn sequences, as they are a special case of this framework.
Title: Phased Multi de Bruijn Sequences
Description:
We introduce phased multi de Bruijn sequences, a generalization of de Bruijn sequences.
A phased string is a string whose positions sequentially rotate through several alphabets; e.
g.
, “0Ax1Ay1By0Az1Bx” rotates through alphabets $\Omega_0=\{0,1\}$, $\Omega_1=\{A,B\}$, and $\Omega_2=\{x,y,z\}$.
We consider cyclic and linear phased strings in which all possible phased $k$-mers (phased strings of length $k$) occur with particular multiplicities, depending on their “phase” (the alphabet they start in).
For example, consider the cycle (s)=(0Ax1Ay1By0Az0By1Bx0Ay0Bx1Bz1Ax0Bz1Az) in these alphabets.
All possible phased 2-mers starting in phases 0, 1, and 2 respectively have multiplicities 3, 2, and 2; e.
g.
, “0A” occurs three times, “Ax” occurs twice, and “z0” occurs twice (including the occurrence that wraps around the cycle).
We determine parameters ($k$, number of phases, alphabet sizes, and multiplicities) for which this is possible.
Then we count the total number of phased multi de Bruijn sequences for these parameters, both for cyclic and linear sequences.
This extends classical de Bruijn sequences and multi de Bruijn sequences (our previous generalization of de Bruijn sequences in which all possible $k$-mers over one alphabet occur $m$ times each).
Our method of counting the sequences uses a change of basis for the Laplacian matrix; this also gives a new proof for the number of classical de Bruijn sequences, as they are a special case of this framework.
Related Results
Multi de Bruijn Sequences and the Cross-Join Method
Multi de Bruijn Sequences and the Cross-Join Method
We show a method to construct binary multi de Bruijn sequences using the cross-join method. We extend the proof given by Alhakim for ordinary de Bruijn sequences to the case of mul...
Building Large Updatable Colored de Bruijn Graphs via Merging
Building Large Updatable Colored de Bruijn Graphs via Merging
MOTIVATION: There exists several massive genomic and metagenomic data collection efforts, including GenomeTrakr and MetaSub, which are routinely updated with new data. To analyze s...
Distribution of Variables in Lambda-Terms with Restrictions on De Bruijn Indices and De Bruijn Levels
Distribution of Variables in Lambda-Terms with Restrictions on De Bruijn Indices and De Bruijn Levels
We consider two special subclasses of lambda-terms that are restricted by a bound on the number of abstractions between a variable and its binding lambda, the so-called De-Bruijn i...
MBG: Minimizer-based Sparse de Bruijn Graph Construction
MBG: Minimizer-based Sparse de Bruijn Graph Construction
Motivation
De Bruijn graphs can be constructed from short reads efficiently and have been used for many purposes. Traditionally long read sequencing technologies ...
Buffering Updates Enables Efficient Dynamic de Bruijn Graphs
Buffering Updates Enables Efficient Dynamic de Bruijn Graphs
Abstract
Motivation
The de Bruijn graph has become a ubiquitous graph model for biological data ever since its initial introduc...
Phased Array UT Technology for Nuclear Pipe Inspection
Phased Array UT Technology for Nuclear Pipe Inspection
Phased array UT technologles have been applied to improve pipe inspection speed and reliability. Recent results on similar and dissimilar metal welds show clear, accurate, and fast...
Application of Aperiodic 'Einstein' Monotile in Limited Field of View Phased Arrays
Application of Aperiodic 'Einstein' Monotile in Limited Field of View Phased Arrays
The discovery of the 'Einstein' monotile represents one of the most significant advancements in geometry in 2023. Research based on this monotile has been initiated across various ...
Disentangled Long-Read De Bruijn Graphs via Optical Maps
Disentangled Long-Read De Bruijn Graphs via Optical Maps
Abstract
Pacific Biosciences (PacBio), the main third generation sequencing technology can produce scalable, high-throughput, unprecedented sequencing results throu...

