Abstract

Germline mutation detection from human DNA sequence data is challenging due to the rarity of such events relative to the intrinsic error rates of sequencing technologies and the uneven coverage across the genome. We developed PhaseByTransmission (PBT) to identify de novo single nucleotide variants and short insertions and deletions (indels) from sequence data collected in parent-offspring trios. We compute the joint probability of the data given the genotype likelihoods in the individual family members, the known familial relationships and a prior probability for the mutation rate. Candidate de novo mutations (DNMs) are reported along with their posterior probability, providing a systematic way to prioritize them for validation. Our tool is integrated in the Genome Analysis Toolkit and can be used together with the ReadBackedPhasing module to infer the parental origin of DNMs based on phase-informative reads. Using simulated data, we show that PBT outperforms existing tools, especially in low coverage data and on the X chromosome. We further show that PBT displays high validation rates on empirical parent-offspring sequencing data for whole-exome data from 104 trios and X-chromosome data from 249 parent-offspring families. Finally, we demonstrate an association between father’s age at conception and the number of DNMs in female offspring’s X chromosome, consistent with previous literature reports.

Overview publication

TitleA framework for the detection of de novo mutations in family-based sequencing data.
DateFebruary 1st, 2017
Issue nameEuropean journal of human genetics : EJHG
Issue numberv25.2:227-233
DOI10.1038/ejhg.2016.147
PubMed27876817
AuthorsFrancioli LC, Cretu-Stancu M, Garimella KV, Fromer M, Kloosterman WP, Samocha KE, Neale BM, Daly MJ, Banks E, DePristo MA & de Bakker PI
InfoGenome of the Netherlands consortium, Wijmenga C, Swertz MA, van Duijn CM, Boomsma DI, Slagboom P, van Ommen GB, de Bakker PI, Swertz MA, Francioli LC, van Dijk F, Menelaou A, Neerincx PB, Pulit SL, Deelen P, Elbers CC, Francesco Palamara P, Pe'er I, Abdellaoui A, Kloosterman WP, van Oven M, Vermaat M, Li M, Laros JF, Stoneking M, de Knijff P, Kayser M, Veldink JH, van den Berg LH, Byelas H, den Dunnen JT, Dijkstra M, Amin N, van der Velde KJ, Hottenga JJ, van Setten J, van Leeuwen EM, Kanterakis A, Kattenberg M, Karssen LC, van Schaik BD, Bot J, Nijman IJ, Renkens I, van Enckevort D, Mei H, Koval V, Estrada K, Medina-Gomez C, Ye K, Lameijer EW, Moed MH, Hehir-Kwa JY, Handsaker RE, McCarroll SA, Sunyaev SR, Polak P, Vuzman D, Sohail M, Hormozdiari F, Marschall T, Schönhuth A, Guryev V, de Bakker PI, Slagboom PE, Beekman MB, de Craen AJ, Suchiman HE, Hofman A, van Duijn CM, Oostra B, Isaacs A, Amin N, Rivadeneira F, Uitterlinden AG, Boomsma DI, Willemsen G, Platteel M, Pitts SJ, Potluri S, Sundar P, Cox DR, Li Q, Li Y, Du Y, Chen R, Cao H, Li N, Cao S, Wang J, Bovenberg JA, Brandsma M
Read Read publication