Structural variations (SVs) in a genome are now known as a prominent and important type of genetic variation. Among all types of SVs, the identification of transposon insertion polymorphisms (TIPs) is more challenging due to the highly repetitive nature of transposon sequences. We developed a computational method, TIP-finder, to identify TIPs through analysis of next generation personal genome data and their extremely large copy numbers. We tested the efficiency of TIP-finder with simulated data and are able to detect about 88% of TIPs with precision of ≥91%. Using TIP-finder to analyze the Solexa pair-end sequence data at deep coverage for six genomes representing two trio families, we identified a total of 5569 TIPs, consisting of 4881, 456, 91, and 141 insertions from Alu, L1, SVA and HERV, respectively, representing the most comprehensive analysis of such type of genetic variation.

Additional Metadata
Persistent URL dx.doi.org/10.1063/1.3663485
Conference Advances in Mathematical and Computational Methods: Addressing Modern Challenges of Science, Technology, and Society
Citation
Luo, X. (Xuemei), Dehne, F, & Liang, P. (Ping). (2011). Identification of transposon insertion polymorphisms by computational comparative analysis of next generation personal genome data. Presented at the Advances in Mathematical and Computational Methods: Addressing Modern Challenges of Science, Technology, and Society. doi:10.1063/1.3663485