This article may be too technical for most readers to understand.(January 2021) |
Duplex sequencing is a library preparation and analysis method for next-generation sequencing (NGS) platforms that employs random tagging of double-stranded DNA to detect mutations with higher accuracy and lower error rates.
This method uses degenerate molecular tags in addition to sequencing adapters to recognize reads originating from each strand of DNA. As the two strands are complementary, true mutations are found at the same position in both strands. In contrast, PCR or sequencing errors result in mutations in only one strand and can thus be discounted as technical error. Duplex sequencing theoretically can detect mutations with frequencies as low as 5 x 10−8 --that is more than 10,000 times higher in accuracy compared to the conventional next-generation sequencing methods.[1][2]
The estimated error rate of standard next-generation sequencing platforms is 10−2 to 10−3 per base call. With this error rate, billions of base calls that are produced by NGS will result in millions of errors. The errors are introduced during sample preparation and sequencing such as polymerase chain reaction, sequencing, and image analysis errors. While the NGS platforms' error rate is acceptable in some applications such as detection of clonal variants, it is a major limitation for applications that require higher accuracy for detection of low-frequency variants such as detection of intra-organismal mosaicism, subclonal variants in genetically heterogeneous cancers, or circulating tumor DNA.[3][4][5]
Several library preparation strategies have been developed that increase accuracy of NGS platforms such as molecular barcoding and circular consensus sequencing method.[6][7][8][9] Like NGS platforms, the data generated by these methods originates from a single strand of DNA, and therefore the errors that are introduced during PCR amplification, tissue processing, DNA extraction, hybridization capture (where used) or DNA sequencing itself can still be distinguished as a true variant. The duplex sequencing method addresses this problem by taking advantage of the complementary nature of two strands of DNA and confirming only variants that are present in both strands of DNA. Because the probability of two complementary errors arising at the same location in both strands is exceedingly low, duplex sequencing increases the accuracy of sequencing significantly.[1][6][8][10]