Abstract
Probabilistic genoty** permits a comparison of forensic evidence given hypotheses regarding the origin of observed short tandem repeat alleles in a mixed DNA profile. Using the publicly available R package forensim, it has been proposed that mixtures with non-contributors from low genetic diversity populations are more likely to be mistakenly identified as contributors to a mixture than non-contributors from high genetic diversity populations. We hypothesized that these observations are attributed to the unique distribution of alleles in the reference population and may not generalize to other samplings of the same population. We used forensim to simulate 200 US populations (50 each of self-reported African-American, Asian-American, European-American, and Hispanic descent). We compared likelihood ratios for 2400 mixtures to those derived from published data and identified stark differences. A minimum of ten population replicates were required to reduce observed differences relative to published data. Deviations from Hardy–Weinberg equilibrium and allele frequency distributions suggest that simulated populations should be sufficiently evaluated for expectations of population genetic parameters prior to use in DNA mixture modeling experiments. Overall, our findings support the utility of forensim and further describe its suitability to model population genetic parameters but suggest that a single population replicate (directly ascertained or simulated) may be insufficient to make conclusions about a given DNA mixture.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00414-022-02822-0/MediaObjects/414_2022_2822_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00414-022-02822-0/MediaObjects/414_2022_2822_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00414-022-02822-0/MediaObjects/414_2022_2822_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00414-022-02822-0/MediaObjects/414_2022_2822_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00414-022-02822-0/MediaObjects/414_2022_2822_Fig5_HTML.png)
Similar content being viewed by others
References
Haned H (2011) Forensim: an open-source initiative for the evaluation of statistical methods in forensic genetics. Forensic Sci Int Genet 5:265–268
Haned H, Benschop CCG, Gill PD, Sijen T (2015) Complex DNA mixture analysis in a forensic context: evaluating the probative value using a likelihood ratio model. Forensic Sci Int Genet 16:17–25
Curran JM (2008) A MCMC method for resolving two person mixtures. Sci Justice 48:168–177
Haned H, Gill P, Lohmueller K, Inman K, Rudin N (2016) Validation of probabilistic genoty** software for use in forensic DNA casework: definitions and illustrations. Sci Justice 56:104–108
Kelly H, Bright JA, Buckleton JS, Curran JM (2014) A comparison of statistical models for the analysis of complex forensic DNA profiles. Sci Justice 54:66–70
Manabe S, Morimoto C, Hamano Y, Fujimoto S, Tamaki K (2017): Development and validation of open-source software for DNA mixture interpretation based on a quantitative continuous model. PLoS One. 12:e0188183.
Marciano MA, Adelman JD (2017) PACE: probabilistic assessment for contributor estimation- a machine learning-based assessment of the number of contributors in DNA mixtures. Forensic Sci Int Genet 27:82–91
Caberio N, Flores M, Kalaydjian C, Paunovich M, Lobpez E, Reyes R, et al. (2019): Are individuals from populations with low genetic diversity more likely to be wrongly identified as contributors to DNA mixtures? . 30th International Symposium on Human Identification. Palm Springs, CA, USA: Promega Corporation.
Bright JA, Curran JM, Buckleton JS (2014) The effect of the uncertainty in the number of contributors to mixed DNA profiles on profile interpretation. Forensic Sci Int Genet 12:208–214
Haned H, Egeland T, Pontier D, Pène L, Gill P (2011) Estimating drop-out probabilities in forensic DNA samples: a simulation approach to evaluate different models. Forensic Sci Int Genet 5:525–531
Haned H, Slooten K, Gill P (2012) Exploratory data analysis for the interpretation of low template DNA mixtures. Forensic Sci Int Genet 6:762–774
Prieto L, Haned H, Mosquera A, Crespillo M, Alemañ M, Aler M et al (2014) Euroforgen-NoE collaborative exercise on LRmix to demonstrate standardization of the interpretation of complex DNA profiles. Forensic Sci Int Genet 9:47–54
Smart U, Cihlar JC, Mandape SN, Muenzler M, King JL, Budowle B, et al. (2021): A continuous statistical phasing framework for the analysis of forensic mitochondrial DNA mixtures. Genes (Basel). 12.
Coble MD, Bright JA, Buckleton JS, Curran JM (2015) Uncertainty in the number of contributors in the proposed new CODIS set. Forensic Sci Int Genet 19:207–211
Novroski NMM, King JL, Churchill JD, Seah LH, Budowle B (2016) Characterization of genetic sequence variation of 58 STR loci in four major population groups. Forensic Sci Int Genet 25:214–226
Ye Z, Wang Z, Hou Y (2020): Does Bonferroni correction “rescue” the deviation from Hardy-Weinberg equilibrium? Forensic Sci Int Genet. 46:102254.
Chong KWY, Syn CK (2021) Uncertainty in estimating the number of contributors from simulated DNA mixture profiles, with and without allele dropout, from Chinese, Malay, Indian, and Caucasian ethnic populations. Sci Rep 11:5249
Norsworthy S, Lun DS, Grgicak CM (2018) Determining the number of contributors to DNA mixtures in the low-template regime: exploring the impacts of sampling and detection effects. Leg Med (Tokyo) 32:1–8
Rosenberg NA, Kang JT (2015) Genetic diversity and societally important disparities. Genetics 201:1–12
Gettings KB, Borsuk LA, Steffen CR, Kiesler KM, Vallone PM (2018) Sequence-based U.S. population data for 27 autosomal STR loci. Forensic Sci Int Genet 37:106–115
Young BA, Gettings KB, McCord B, Vallone PM (2019) Estimating number of contributors in massively parallel sequencing data of STR loci. Forensic Sci Int Genet 38:15–22
Devesse L, Davenport L, Borsuk L, Gettings K, Mason-Buck G, Vallone PM, et al. (2020): Classification of STR allelic variation using massively parallel sequencing and assessment of flanking region power. Forensic Sci Int Genet. 48:102356.
Rohlfs RV, Fullerton SM, Weir BS (2012): Familial identification: population structure and relationship distinguishability. PLoS Genet. 8:e1002469.
Acknowledgements
Points of view in this document are those of the authors and do not necessarily represent the official position or policies of their organizations. The data generation from Novroski et al., were supported in part by award no. 2015-DN-BX-K067, awarded by the National Institute of Justice, Office of Justice Programs, US Department of Justice.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Research involving human participants and/or animals
The data used in this study are freely available datasets, where no Institutional Review Board/Research Ethics Board permissions were required.
Informed consent
Not applicable.
Conflict of interest
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Nicole MM Novroski and Frank R Wendt equally contributed to this paper.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Novroski, N.M.M., Moo-Choy, A. & Wendt, F.R. Allele frequencies and minor contributor match statistic convergence using simulated population replicates. Int J Legal Med 136, 1227–1235 (2022). https://doi.org/10.1007/s00414-022-02822-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00414-022-02822-0