Log in

Allele frequencies and minor contributor match statistic convergence using simulated population replicates

  • Original Article
  • Published:
International Journal of Legal Medicine Aims and scope Submit manuscript

Abstract

Probabilistic genoty** permits a comparison of forensic evidence given hypotheses regarding the origin of observed short tandem repeat alleles in a mixed DNA profile. Using the publicly available R package forensim, it has been proposed that mixtures with non-contributors from low genetic diversity populations are more likely to be mistakenly identified as contributors to a mixture than non-contributors from high genetic diversity populations. We hypothesized that these observations are attributed to the unique distribution of alleles in the reference population and may not generalize to other samplings of the same population. We used forensim to simulate 200 US populations (50 each of self-reported African-American, Asian-American, European-American, and Hispanic descent). We compared likelihood ratios for 2400 mixtures to those derived from published data and identified stark differences. A minimum of ten population replicates were required to reduce observed differences relative to published data. Deviations from Hardy–Weinberg equilibrium and allele frequency distributions suggest that simulated populations should be sufficiently evaluated for expectations of population genetic parameters prior to use in DNA mixture modeling experiments. Overall, our findings support the utility of forensim and further describe its suitability to model population genetic parameters but suggest that a single population replicate (directly ascertained or simulated) may be insufficient to make conclusions about a given DNA mixture.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Haned H (2011) Forensim: an open-source initiative for the evaluation of statistical methods in forensic genetics. Forensic Sci Int Genet 5:265–268

    Article  CAS  Google Scholar 

  2. Haned H, Benschop CCG, Gill PD, Sijen T (2015) Complex DNA mixture analysis in a forensic context: evaluating the probative value using a likelihood ratio model. Forensic Sci Int Genet 16:17–25

    Article  CAS  Google Scholar 

  3. Curran JM (2008) A MCMC method for resolving two person mixtures. Sci Justice 48:168–177

    Article  CAS  Google Scholar 

  4. Haned H, Gill P, Lohmueller K, Inman K, Rudin N (2016) Validation of probabilistic genoty** software for use in forensic DNA casework: definitions and illustrations. Sci Justice 56:104–108

    Article  Google Scholar 

  5. Kelly H, Bright JA, Buckleton JS, Curran JM (2014) A comparison of statistical models for the analysis of complex forensic DNA profiles. Sci Justice 54:66–70

    Article  Google Scholar 

  6. Manabe S, Morimoto C, Hamano Y, Fujimoto S, Tamaki K (2017): Development and validation of open-source software for DNA mixture interpretation based on a quantitative continuous model. PLoS One. 12:e0188183.

  7. Marciano MA, Adelman JD (2017) PACE: probabilistic assessment for contributor estimation- a machine learning-based assessment of the number of contributors in DNA mixtures. Forensic Sci Int Genet 27:82–91

    Article  CAS  Google Scholar 

  8. Caberio N, Flores M, Kalaydjian C, Paunovich M, Lobpez E, Reyes R, et al. (2019): Are individuals from populations with low genetic diversity more likely to be wrongly identified as contributors to DNA mixtures? . 30th International Symposium on Human Identification. Palm Springs, CA, USA: Promega Corporation.

  9. Bright JA, Curran JM, Buckleton JS (2014) The effect of the uncertainty in the number of contributors to mixed DNA profiles on profile interpretation. Forensic Sci Int Genet 12:208–214

    Article  CAS  Google Scholar 

  10. Haned H, Egeland T, Pontier D, Pène L, Gill P (2011) Estimating drop-out probabilities in forensic DNA samples: a simulation approach to evaluate different models. Forensic Sci Int Genet 5:525–531

    Article  CAS  Google Scholar 

  11. Haned H, Slooten K, Gill P (2012) Exploratory data analysis for the interpretation of low template DNA mixtures. Forensic Sci Int Genet 6:762–774

    Article  CAS  Google Scholar 

  12. Prieto L, Haned H, Mosquera A, Crespillo M, Alemañ M, Aler M et al (2014) Euroforgen-NoE collaborative exercise on LRmix to demonstrate standardization of the interpretation of complex DNA profiles. Forensic Sci Int Genet 9:47–54

    Article  CAS  Google Scholar 

  13. Smart U, Cihlar JC, Mandape SN, Muenzler M, King JL, Budowle B, et al. (2021): A continuous statistical phasing framework for the analysis of forensic mitochondrial DNA mixtures. Genes (Basel). 12.

  14. Coble MD, Bright JA, Buckleton JS, Curran JM (2015) Uncertainty in the number of contributors in the proposed new CODIS set. Forensic Sci Int Genet 19:207–211

    Article  Google Scholar 

  15. Novroski NMM, King JL, Churchill JD, Seah LH, Budowle B (2016) Characterization of genetic sequence variation of 58 STR loci in four major population groups. Forensic Sci Int Genet 25:214–226

    Article  CAS  Google Scholar 

  16. Ye Z, Wang Z, Hou Y (2020): Does Bonferroni correction “rescue” the deviation from Hardy-Weinberg equilibrium? Forensic Sci Int Genet. 46:102254.

  17. Chong KWY, Syn CK (2021) Uncertainty in estimating the number of contributors from simulated DNA mixture profiles, with and without allele dropout, from Chinese, Malay, Indian, and Caucasian ethnic populations. Sci Rep 11:5249

    Article  CAS  Google Scholar 

  18. Norsworthy S, Lun DS, Grgicak CM (2018) Determining the number of contributors to DNA mixtures in the low-template regime: exploring the impacts of sampling and detection effects. Leg Med (Tokyo) 32:1–8

    Article  CAS  Google Scholar 

  19. Rosenberg NA, Kang JT (2015) Genetic diversity and societally important disparities. Genetics 201:1–12

    Article  CAS  Google Scholar 

  20. Gettings KB, Borsuk LA, Steffen CR, Kiesler KM, Vallone PM (2018) Sequence-based U.S. population data for 27 autosomal STR loci. Forensic Sci Int Genet 37:106–115

    Article  CAS  Google Scholar 

  21. Young BA, Gettings KB, McCord B, Vallone PM (2019) Estimating number of contributors in massively parallel sequencing data of STR loci. Forensic Sci Int Genet 38:15–22

    Article  CAS  Google Scholar 

  22. Devesse L, Davenport L, Borsuk L, Gettings K, Mason-Buck G, Vallone PM, et al. (2020): Classification of STR allelic variation using massively parallel sequencing and assessment of flanking region power. Forensic Sci Int Genet. 48:102356.

  23. Rohlfs RV, Fullerton SM, Weir BS (2012): Familial identification: population structure and relationship distinguishability. PLoS Genet. 8:e1002469.

Download references

Acknowledgements

Points of view in this document are those of the authors and do not necessarily represent the official position or policies of their organizations. The data generation from Novroski et al., were supported in part by award no. 2015-DN-BX-K067, awarded by the National Institute of Justice, Office of Justice Programs, US Department of Justice.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Nicole M. M. Novroski or Frank R. Wendt.

Ethics declarations

Research involving human participants and/or animals

The data used in this study are freely available datasets, where no Institutional Review Board/Research Ethics Board permissions were required.

Informed consent

Not applicable.

Conflict of interest

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Nicole MM Novroski and Frank R Wendt equally contributed to this paper.

Supplementary Information

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Novroski, N.M.M., Moo-Choy, A. & Wendt, F.R. Allele frequencies and minor contributor match statistic convergence using simulated population replicates. Int J Legal Med 136, 1227–1235 (2022). https://doi.org/10.1007/s00414-022-02822-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00414-022-02822-0

Keywords

Navigation