ITD assembler: an algorithm for internal tandem duplication discovery from short-read sequencing data

dc.citation.articleNumber188en_US
dc.citation.journalTitleBMC Bioinformaticsen_US
dc.citation.volumeNumber17en_US
dc.contributor.authorRustagi, Navinen_US
dc.contributor.authorHampton, Oliver A.en_US
dc.contributor.authorLi, Jieen_US
dc.contributor.authorXi, Liuen_US
dc.contributor.authorGibbs, Richard A.en_US
dc.contributor.authorPlon, Sharon E.en_US
dc.contributor.authorKimmel, Mareken_US
dc.contributor.authorWheeler, David A.en_US
dc.date.accessioned2016-08-11T13:50:05Zen_US
dc.date.available2016-08-11T13:50:05Zen_US
dc.date.issued2016en_US
dc.date.updated2016-08-11T13:50:05Zen_US
dc.description.abstractAbstract Background Detection of tandem duplication within coding exons, referred to as internal tandem duplication (ITD), remains challenging due to inefficiencies in alignment of ITD-containing reads to the reference genome. There is a critical need to develop efficient methods to recover these important mutational events. Results In this paper we introduce ITD Assembler, a novel approach that rapidly evaluates all unmapped and partially mapped reads from whole exome NGS data using a De Bruijn graphs approach to select reads that harbor cycles of appropriate length, followed by assembly using overlap-layout-consensus. We tested ITD Assembler on The Cancer Genome Atlas AML dataset as a truth set. ITD Assembler identified the highest percentage of reported FLT3-ITDs when compared to other ITD detection algorithms, and discovered additional ITDs in FLT3, KIT, CEBPA, WT1 and other genes. Evidence of polymorphic ITDs in 54 genes were also found. Novel ITDs were validated by analyzing the corresponding RNA sequencing data. Conclusions ITD Assembler is a very sensitive tool which can detect partial, large and complex tandem duplications. This study highlights the need to more effectively look for ITD’s in other cancers and Mendelian diseases.en_US
dc.identifier.citationRustagi, Navin, Hampton, Oliver A., Li, Jie, et al.. "ITD assembler: an algorithm for internal tandem duplication discovery from short-read sequencing data." <i>BMC Bioinformatics,</i> 17, (2016) BioMed Central: http://dx.doi.org/10.1186/s12859-016-1031-8.en_US
dc.identifier.doihttp://dx.doi.org/10.1186/s12859-016-1031-8en_US
dc.identifier.urihttps://hdl.handle.net/1911/91216en_US
dc.language.isoengen_US
dc.publisherBioMed Centralen_US
dc.rightsThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.en_US
dc.rights.holderRustagi et al.en_US
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en_US
dc.titleITD assembler: an algorithm for internal tandem duplication discovery from short-read sequencing dataen_US
dc.typeJournal articleen_US
dc.type.dcmiTexten_US
dc.type.publicationpublisher versionen_US
local.sword.agentBioMed Centralen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
12859_2016_Article_1031.pdf
Size:
852.04 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: