A Guide to Sequencing for Long Repetitive Regions

Research output: Contribution to journalArticlepeer-review


Full-length analysis of genes with highly repetitive sequences is challenging in two respects: assembly algorithm and sequencing accuracy. The de Bruijn graph often used in short-read assembly cannot distinguish adjacent repeat units. On the other hand, the accuracy of long reads is not yet high enough to identify each and every repeat unit. In this chapter, I present an example of a strategy to solve these problems and obtain the full length of long repeats by combining the extraction and assembly of repeat units based on overlap-layout-consensus and scaffolding by long reads.

Original languageEnglish
Pages (from-to)131-146
Number of pages16
JournalMethods in molecular biology (Clifton, N.J.)
Publication statusPublished - 2023


  • De novo sequencing
  • Highly repetitive sequence
  • Non-model organism
  • Overlap-layout-consensus
  • Structural protein

ASJC Scopus subject areas

  • Molecular Biology
  • Genetics


Dive into the research topics of 'A Guide to Sequencing for Long Repetitive Regions'. Together they form a unique fingerprint.

Cite this