EEEB GU4055
1. Review notebook assignments: de Bruijn graphs.
2. Discuss the assigned readings: de Bruijn / Euler.
3. Introduce new topic: de novo genome assembly
kmers are used to decompose reads into smaller units that align to form a path.
Shortest possible superstring that contains all substrings.
denovo genome assembly is computationally demanding. Requires reads that cover the full genome many times (e.g., 50X). The end goal is to assemble scaffolds that match to chromosomes -- the real *bits* of the genome.
Even the shortest piece of the human genome is 48Mbp!
Sometimes intermediate/draft genomes are good enough to answer many questions. For many tasks, though, they are not. e.g., genome annotation. Trade off in costs and time.
Hybrid genome assembly methods.
Hybrid genome assembly methods.
Hybrid genome assembly methods.