Javascript must be enabled to continue!
Phased Multi de Bruijn Sequences
View through CrossRef
We introduce phased multi de Bruijn sequences, a generalization of de Bruijn sequences. A phased string is a string whose positions sequentially rotate through several alphabets; e.g., “0Ax1Ay1By0Az1Bx” rotates through alphabets $\Omega_0=\{0,1\}$, $\Omega_1=\{A,B\}$, and $\Omega_2=\{x,y,z\}$. We consider cyclic and linear phased strings in which all possible phased $k$-mers (phased strings of length $k$) occur with particular multiplicities, depending on their “phase” (the alphabet they start in). For example, consider the cycle (s)=(0Ax1Ay1By0Az0By1Bx0Ay0Bx1Bz1Ax0Bz1Az) in these alphabets. All possible phased 2-mers starting in phases 0, 1, and 2 respectively have multiplicities 3, 2, and 2; e.g., “0A” occurs three times, “Ax” occurs twice, and “z0” occurs twice (including the occurrence that wraps around the cycle). We determine parameters ($k$, number of phases, alphabet sizes, and multiplicities) for which this is possible. Then we count the total number of phased multi de Bruijn sequences for these parameters, both for cyclic and linear sequences. This extends classical de Bruijn sequences and multi de Bruijn sequences (our previous generalization of de Bruijn sequences in which all possible $k$-mers over one alphabet occur $m$ times each). Our method of counting the sequences uses a change of basis for the Laplacian matrix; this also gives a new proof for the number of classical de Bruijn sequences, as they are a special case of this framework.
Title: Phased Multi de Bruijn Sequences
Description:
We introduce phased multi de Bruijn sequences, a generalization of de Bruijn sequences.
A phased string is a string whose positions sequentially rotate through several alphabets; e.
g.
, “0Ax1Ay1By0Az1Bx” rotates through alphabets $\Omega_0=\{0,1\}$, $\Omega_1=\{A,B\}$, and $\Omega_2=\{x,y,z\}$.
We consider cyclic and linear phased strings in which all possible phased $k$-mers (phased strings of length $k$) occur with particular multiplicities, depending on their “phase” (the alphabet they start in).
For example, consider the cycle (s)=(0Ax1Ay1By0Az0By1Bx0Ay0Bx1Bz1Ax0Bz1Az) in these alphabets.
All possible phased 2-mers starting in phases 0, 1, and 2 respectively have multiplicities 3, 2, and 2; e.
g.
, “0A” occurs three times, “Ax” occurs twice, and “z0” occurs twice (including the occurrence that wraps around the cycle).
We determine parameters ($k$, number of phases, alphabet sizes, and multiplicities) for which this is possible.
Then we count the total number of phased multi de Bruijn sequences for these parameters, both for cyclic and linear sequences.
This extends classical de Bruijn sequences and multi de Bruijn sequences (our previous generalization of de Bruijn sequences in which all possible $k$-mers over one alphabet occur $m$ times each).
Our method of counting the sequences uses a change of basis for the Laplacian matrix; this also gives a new proof for the number of classical de Bruijn sequences, as they are a special case of this framework.
Related Results
Multi de Bruijn Sequences and the Cross-Join Method
Multi de Bruijn Sequences and the Cross-Join Method
We show a method to construct binary multi de Bruijn sequences using the cross-join method. We extend the proof given by Alhakim for ordinary de Bruijn sequences to the case of mul...
Building Large Updatable Colored de Bruijn Graphs via Merging
Building Large Updatable Colored de Bruijn Graphs via Merging
MOTIVATION: There exists several massive genomic and metagenomic data collection efforts, including GenomeTrakr and MetaSub, which are routinely updated with new data. To analyze s...
Disentangled Long-Read De Bruijn Graphs via Optical Maps
Disentangled Long-Read De Bruijn Graphs via Optical Maps
Abstract
Pacific Biosciences (PacBio), the main third generation sequencing technology can produce scalable, high-throughput, unprecedented sequencing results throu...
Theoretical analysis and experimental measurement of digital multi-beam phased antenna array in the C frequency range
Theoretical analysis and experimental measurement of digital multi-beam phased antenna array in the C frequency range
The choice of elements for constructing a phased antenna array providing a relative frequency bandwidth up to 9% for the transmission or reception of wireless communication system ...
Inspeksi Upper Wing Top Skin Panel Menggunakan Phased Array Ultrasonic Testing (PAUT)
Inspeksi Upper Wing Top Skin Panel Menggunakan Phased Array Ultrasonic Testing (PAUT)
<em>Non Destructive Testing (NDT) adalah cara yang paling ekonomis untuk melakukan inspeksi dan cara untuk menemukan cacat. Salah satu metode inspeksi NDT adalah Ultrasonic T...
Quantitative Analysis of Shallow Earthquake Sequences and Regional Earthquake Behavior: Implications for Earthquake Forecasting
Quantitative Analysis of Shallow Earthquake Sequences and Regional Earthquake Behavior: Implications for Earthquake Forecasting
<p>This study is a quantitative investigation and characterization of earthquake sequences in the Central Volcanic Region (CVR) of New Zealand, and several regions in New Zea...
Quantitative Analysis of Shallow Earthquake Sequences and Regional Earthquake Behavior: Implications for Earthquake Forecasting
Quantitative Analysis of Shallow Earthquake Sequences and Regional Earthquake Behavior: Implications for Earthquake Forecasting
<p>This study is a quantitative investigation and characterization of earthquake sequences in the Central Volcanic Region (CVR) of New Zealand, and several regions in New Zea...
Computational protein design : un outil pour l'ingénierie des protéines et la biologie synthétique
Computational protein design : un outil pour l'ingénierie des protéines et la biologie synthétique
Le « Computational protein design » ou CPD est la recherche des séquences d’acides aminés compatibles avec une structure protéique ciblée. L’objectif est de concevoir une fonction ...

