Long-read de novo assemblies: Glossary

Key Points

Introduction
  • This part of the lesson will mostly be given in the form of a lecture

Setup for the workshop
  • VM can be used after the workshop

  • Installing required packages on own machine may require root privileges.

QC and evaluation of long read data
  • PacBio and Nanopore have read lengths of many kilobases

  • In practice quality of input DNA, library prep. and many other factors will determine read length distribution

  • PacBio and Nanopore have different types of errors

Assembly of long reads (and Illumina reads)
  • Each platform and each data sets requires hands-on work with the assembler

Comparison and visualization of long read assemblies
  • (Near-) exact contigs show up as a diagonal line in a mummerplot

  • Repeat content shows up as dots / lines across the image

Validation of assemblies
Summary of obtained results and closing

Glossary

FIXME