Non-identifiable Pedigrees and a Bayesian Solution

Kirkpatrick, Bonnie

doi:10.1007/978-3-642-30191-9_14

Bonnie Kirkpatrick²³

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 7292))

Included in the following conference series:

International Symposium on Bioinformatics Research and Applications

993 Accesses
3 Citations

Abstract

Some methods aim to correct or test for relationships or to reconstruct the pedigree, or family tree. We show that these methods cannot resolve ties for correct relationships due to identifiability of the pedigree likelihood which is the probability of inheriting the data under the pedigree model. This means that no likelihood-based method can produce a correct pedigree inference with high probability. This lack of reliability is critical both for health and forensics applications.

Pedigree inference methods use a structured machine learning approach where the objective is to find the pedigree graph that maximizes the likelihood. Known pedigrees are useful for both association and linkage analysis which aim to find the regions of the genome that are associated with the presence and absence of a particular disease. This means that errors in pedigree prediction have dramatic effects on downstream analysis.

In this paper we present the first discussion of multiple typed individuals in non-isomorphic pedigrees, \(\mathcal{P}\) and \(\mathcal{Q}\), where the likelihoods are non-identifiable, \(Pr[G~|~\mathcal{P},\theta] = Pr[G~|~\mathcal{Q},\theta]\), for all input data G and all recombination rate parameters θ. While there were previously known non-identifiable pairs, we give an example having data for multiple individuals.

Additionally, deeper understanding of the general discrete structures driving these non-identifiability examples has been provided, as well as results to guide algorithms that wish to examine only identifiable pedigrees. This paper introduces a general criteria for establishing whether a pair of pedigrees is non-identifiable and two easy-to-compute criteria guaranteeing identifiability. Finally, we suggest a method for dealing with non-identifiable likelihoods: use Bayes rule to obtain the posterior from the likelihood and prior. We propose a prior guaranteeing that the posterior distinguishes all pairs of pedigrees.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (France)

eBook: EUR 42.79; Price includes VAT (France)

Softcover Book: EUR 52.74; Price includes VAT (France)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Pedigree Analysis and Risk Assessment

Detecting Pedigree Relationship Errors

Pedigree-based relationship inference from complex DNA mixtures

Article 19 January 2017

References

Abecasis, G.R., Cherny, S.S., Cookson, W.O., et al.: Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nature Genetics 30, 97–101 (2002)
Article Google Scholar
Bourgain, C., Hoffjan, S., Nicolae, R., et al.: Novel case-control test in a founder population identifies p-selectin as an atopy-susceptibility locus. American Journal of Human Genetics 73(3), 612–626 (2003)
Article Google Scholar
Browning, S., Browning, B.L.: On reducing the statespace of hidden Markov models for the identity by descent process. Theoretical Population Biology 62(1), 1–8 (2002)
Article MATH Google Scholar
Coop, G., Wen, X., Ober, C., et al.: High-Resolution Map** of Crossovers Reveals Extensive Variation in Fine-Scale Recombination Patterns Among Humans. Science 319(5868), 1395–1398 (2008)
Article Google Scholar
Donnelly, K.P.: The probability that related individuals share some section of genome identical by descent. Theoretical Population Biology 23(1), 34–63 (1983)
Article MathSciNet MATH Google Scholar
Fishelson, M., Dovgolevsky, N., Geiger, D.: Maximum likelihood haploty** for general pedigrees. Human Heredity 59, 41–60 (2005)
Article Google Scholar
Geiger, D., Meek, C., Wexler, Y.: Speeding up HMM algorithms for genetic linkage analysis via chain reductions of the state space. Bioinformatics 25(12), i196 (2009)
Article Google Scholar
Kirkpatrick, B., Kirkpatrick, K.: Optimal State-Space Reduction for Pedigree Hidden Markov Models. Ar**v e-prints (February 2012)
Google Scholar
Kirkpatrick, B., Li, S.C., Karp, R.M., Halperin, E.: Pedigree Reconstruction Using Identity by Descent. In: Bafna, V., Sahinalp, S.C. (eds.) RECOMB 2011. LNCS, vol. 6577, pp. 136–152. Springer, Heidelberg (2011)
Chapter Google Scholar
Lauritzen, S.L., Sheehan, N.A.: Graphical models for genetic analysis. Statistical Science 18(4), 489–514 (2003)
Article MathSciNet MATH Google Scholar
McPeek, M.S.: Inference on pedigree structure from genome screen data. Statistica Sinica 12(1), 311–336 (2002)
MathSciNet MATH Google Scholar
McPeek, M.S., Sun, L.: Statistical tests for detection of misspecified relationships by use of genome-screen data. Amer. J. Human Genetics 66, 1076–1094 (2000)
Article Google Scholar
Pinto, N., Silva, P.V., Amorim, A.: General derivation of the sets of pedigrees with the same kinship coefficients. Hum. Hered. 70(3), 194–204 (2010)
Article Google Scholar
Sobel, E., Lange, K.: Descent graphs in pedigree analysis: Applications to haploty**, location scores, and marker-sharing statistics. American Journal of Human Genetics 58(6), 1323–1337 (1996)
Google Scholar
Stankovich, J., Bahlo, M., Rubio, J.P., et al.: Identifying nineteenth century genealogical links from genotypes. Human Genetics 117(2-3), 188–199 (2005)
Article Google Scholar
Sun, L., Wilder, K., McPeek, M.S.: Enhanced pedigree error detection. Hum. Hered. 54(2), 99–110 (2002)
Article Google Scholar
Thatte, B.D.: Reconstructing pedigrees: some identifiability questions for a recombination-mutation model. Ar**v e-prints (August 2010)
Google Scholar
Thompson, E.A.: The estimation of pairwise relationships. Annals of Human Genetics 39(2), 173–188 (1975)
Article MathSciNet Google Scholar
Thompson, E.A.: Pedigree Analysis in Human Genetics. Johns Hopkins University Press, Baltimore (1985)
Google Scholar
Thornton, T., McPeek, M.S.: Case-control association testing with related individuals: A more powerful quasi-likelihood score test. American Journal of Human Genetics 81, 321–337 (2007)
Article Google Scholar
Thornton, T., McPeek, M.S.: ROADTRIPS: case-control association testing with partially or completely unknown population and pedigree structure. American Journal of Human Genetics 86(2), 172–184 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of British Columbia, Canada
Bonnie Kirkpatrick

Authors

Bonnie Kirkpatrick
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departments of Bioengineering and Electrical Engineering, University of Texas at Dallas, 75080, Richardson, TX, USA
Leonidas Bleris
Department of Computer Science and Engineering, University of Connecticut, 06269, Storrs, CT, USA
Ion Măndoiu
Department of Biological Sciences, Carnegie Mellon University, 15213, Pittsburgh, PA, USA
Russell Schwartz
School of Information Science and Engineering, Central South University, 410083, Changsha, China
Jianxin Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kirkpatrick, B. (2012). Non-identifiable Pedigrees and a Bayesian Solution. In: Bleris, L., Măndoiu, I., Schwartz, R., Wang, J. (eds) Bioinformatics Research and Applications. ISBRA 2012. Lecture Notes in Computer Science(), vol 7292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30191-9_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-30191-9_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30190-2
Online ISBN: 978-3-642-30191-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Non-identifiable Pedigrees and a Bayesian Solution

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Pedigree Analysis and Risk Assessment

Detecting Pedigree Relationship Errors

Pedigree-based relationship inference from complex DNA mixtures

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Non-identifiable Pedigrees and a Bayesian Solution

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Pedigree Analysis and Risk Assessment

Detecting Pedigree Relationship Errors

Pedigree-based relationship inference from complex DNA mixtures

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation