Constrained Consensus Sequence Algorithm for DNA Archiving
The paper describes an algorithm to compute a consensus sequence from a set of DNA sequences of approximatively identical length generated by 3rd sequencing generation technologies. Its purpose targets DNA storage and is guided by specific features that cannot be exhibited from biological data such as the exact length of the consensus sequences, the precise location of known patterns, the kmer composition, etc.
READ FULL TEXT