TY - JOUR
T1 - Precise characterization of somatic complex structural variations from tumor/control paired long-read sequencing data with nanomonsv
AU - Shiraishi, Yuichi
AU - Koya, Junji
AU - Chiba, Kenichi
AU - Okada, Ai
AU - Arai, Yasuhito
AU - Saito, Yuki
AU - Shibata, Tatsuhiro
AU - Kataoka, Keisuke
N1 - Publisher Copyright:
© 2023 The Author(s).
PY - 2023/8/11
Y1 - 2023/8/11
N2 - We present our novel software, nanomonsv, for detecting somatic structural variations (SVs) using tumor and matched control long-read sequencing data with a single-base resolution. The current version of nanomonsv includes two detection modules, Canonical SV module, and Single breakend SV module. Using tumor/control paired long-read sequencing data from three cancer and their matched lymphoblastoid lines, we demonstrate that Canonical SV module can identify somatic SVs that can be captured by short-read technologies with higher precision and recall than existing methods. In addition, we have developed a workflow to classify mobile element insertions while elucidating their in-depth properties, such as 5′ truncations, internal inversions, as well as source sites for 3′ transductions. Furthermore, Single breakend SV module enables the detection of complex SVs that can only be identified by long-reads, such as SVs involving highly-repetitive centromeric sequences, and LINE1- and virus-mediated rearrangements. In summary, our approaches applied to cancer long-read sequencing data can reveal various features of somatic SVs and will lead to a better understanding of mutational processes and functional consequences of somatic SVs.
AB - We present our novel software, nanomonsv, for detecting somatic structural variations (SVs) using tumor and matched control long-read sequencing data with a single-base resolution. The current version of nanomonsv includes two detection modules, Canonical SV module, and Single breakend SV module. Using tumor/control paired long-read sequencing data from three cancer and their matched lymphoblastoid lines, we demonstrate that Canonical SV module can identify somatic SVs that can be captured by short-read technologies with higher precision and recall than existing methods. In addition, we have developed a workflow to classify mobile element insertions while elucidating their in-depth properties, such as 5′ truncations, internal inversions, as well as source sites for 3′ transductions. Furthermore, Single breakend SV module enables the detection of complex SVs that can only be identified by long-reads, such as SVs involving highly-repetitive centromeric sequences, and LINE1- and virus-mediated rearrangements. In summary, our approaches applied to cancer long-read sequencing data can reveal various features of somatic SVs and will lead to a better understanding of mutational processes and functional consequences of somatic SVs.
UR - https://www.scopus.com/pages/publications/85167844795
UR - https://www.scopus.com/inward/citedby.url?scp=85167844795&partnerID=8YFLogxK
U2 - 10.1093/nar/gkad526
DO - 10.1093/nar/gkad526
M3 - Article
C2 - 37336583
AN - SCOPUS:85167844795
SN - 0305-1048
VL - 51
SP - E74
JO - Nucleic acids research
JF - Nucleic acids research
IS - 14
ER -