Statistical methods for discrimination of STR genotypes using high resolution melt curve data

Cloudy, Darianne C.; Boone, Edward L.; Kuehnert, Kristi; Smith, Chastyn; Cox, Jordan O.; Seashols-Williams, Sarah J.; Green, Tracey Dawson

doi:10.1007/s00414-024-03289-x

Statistical methods for discrimination of STR genotypes using high resolution melt curve data

Original Article
Open access
Published: 13 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Legal Medicine Aims and scope Submit manuscript

Statistical methods for discrimination of STR genotypes using high resolution melt curve data

Download PDF

Abstract

Despite the improvements in forensic DNA quantification methods that allow for the early detection of low template/challenged DNA samples, complicating stochastic effects are not revealed until the final stage of the DNA analysis workflow. An assay that would provide genoty** information at the earlier stage of quantification would allow examiners to make critical adjustments prior to STR amplification allowing for potentially exclusionary information to be immediately reported. Specifically, qPCR instruments often have dissociation curve and/or high-resolution melt curve (HRM) capabilities; this, coupled with statistical prediction analysis, could provide additional information regarding STR genotypes present. Thus, this study aimed to evaluate Qiagen’s principal component analysis (PCA)-based ScreenClust^® HRM^® software and a linear discriminant analysis (LDA)-based technique for their abilities to accurately predict genotypes and similar groups of genotypes from HRM data. Melt curves from single source samples were generated from STR D5S818 and D18S51 amplicons using a Rotor-Gene^® Q qPCR instrument and EvaGreen^® intercalating dye. When used to predict D5S818 genotypes for unknown samples, LDA analysis outperformed the PCA-based method whether predictions were for individual genotypes (58.92% accuracy) or for geno-groups (81.00% accuracy). However, when a locus with increased heterogeneity was tested (D18S51), PCA-based prediction accuracy rates improved to rates similar to those obtained using LDA (45.10% and 63.46%, respectively). This study provides foundational data documenting the performance of prediction modeling for STR genoty** based on qPCR-HRM data. In order to expand the forensic applicability of this HRM assay, the method could be tested with a more commonly utilized qPCR platform.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Over the past two decades, methods for forensic DNA analysis have greatly increased in efficiency, sensitivity, and accuracy. However, samples with a limited number of DNA copies continue to create challenges for forensic laboratories. The stochastic effects associated with low template (Lt) DNA, such as STR allele drop-out, allele drop-in, and inter-locus/intra-locus peak imbalances, can cause uncertainty in the interpretation process [1]. In addition to the difficulty of interpreting Lt DNA profiles, the presence of DNA from multiple contributors can further complicate the data, increasing uncertainty [2]. Adjustments to procedures that could compensate for these issues and potentially yield more discernable profiles are often not possible, as artifacts and other profile characteristics are not revealed until the end of the DNA workflow. In order to correct these issues, the analyst may need to return to the sample preparation, DNA extraction, or STR amplification stage, which is costly, time-consuming, and may not be possible if the sample was consumed in the initial testing. Thus, having more information earlier on in the analytical workflow would make better use of analyst time and maximize the utility of limited evidentiary samples.

Melt curve analysis utilizing qPCR instrumentation has long been explored as a potentially useful tool for a variety of forensic applications including mRNA analysis for body fluid identification, species identification, individualization of twins via methylation pattern analysis, Y STR screening, and human identification via single nucleotide polymorphism (SNP) analysis [3,4,5,6,7,8,9,10]. Recently, Torres et al. (2023) developed a high-resolution melt (HRM) assay that, when combined with the Quantifiler™ Trio kit (Thermo Fisher Scientific, Waltham, MA) components and machine learning algorithms, was capable of accurately predicting if a forensic sample is a single-source or mixed DNA sample for 79% of samples tested [10]. However, the assay was unable to predict the number of contributors (for mixture samples) nor was it able to determine the contributor genotypes [10, 11]. Introducing a detection assay to the quantification step that could identify contributor genotype and determine the number of contributors present would significantly strengthen this application. For example, having access to minimal STR genoty** information at the DNA quantification step would allow the examiner time to adjust the workflow prior to STR multiplex amplification. Further, this would allow them to provide early exclusionary information to investigators, if genotypes can be properly resolved.

Melt curve analysis of STR amplicons has been explored for rapid genoty** of single source DNA samples [12,13,14,15,16,17,18]. For example, one UK study explored amplification of the D18S51, TH01 and D8S1179 loci using HyBecon^® fluorescent probes and non-fluorescent blocker oligonucleotides to enhance melt curve analysis; this study successfully obtained partial STR profiles from buccal swabs [13]. Additionally, Kuehnert et al. developed an optimized procedure for the amplification of D5S818 and D18S51 with subsequent high-resolution melting (HRM) using a Rotor-Gene^® Q (QIAGEN, Hilden, Germany) and an intercalating dye (EvaGreen^®) [12]. While distinguishable peaks for each STR locus were observed from resulting melt curve data, genotypes were not consistently discernible [12]. Further, this assessment only analyzed 16–20 samples, all having one of only three closely related STR genotypes. Similarly, Nguyen et al. [14] developed a melt curve screening tool for forensically relevant samples utilizing mini-STR primers for CSF1PO and TH01 along with either Taqman^® chemistry or intercalating dyes (SYBR^® Green or LCGreen^® Plus). The study reported accurate STR allele determination from degraded and inhibited samples, but noted inconsistent reproducibility of the assay [14]. Unfortunately, none of these studies reported exploration of statistical software-based models, which could help improve accuracy and remove subjectivity from melt curve analysis.

The Qiagen Rotor-Gene^® ScreenClust HRM^® software (Qiagen) incorporates principle component analysis (PCA) as a way to group like-samples using melt curve data [12, 19, 20]. PCA is a correlational technique which transforms data into its main elements; from this transformation and reduction in dimension, a linear combination of variables can create new data. The newly created data can then be assessed for underlying patterns and variation [21]. Alternatively, linear discriminant analysis (LDA) is a classification algorithm that attempts to make a distinction between observations. LDA assesses the data provided and compares it to other previously classified data patterns by determining similarity based on variance between classes and within classes [12, 22, 23]. As of today, there is not a packaged software available for high resolution melt curve analysis which utilizes an LDA-type classification algorithm; however, previous work has generated code in R statistical software (©The R Foundation, Vienna, Austria) to meet this goal [12, 24]. Further exploration of HRM analysis for STR genoty** should include an analysis of a wide range of STR genotypes as well as a quantitative assessment of the melt curve data using statistical prediction modeling in order to determine if HRM could be used to provide reliable STR genoty** information for forensic investigations.

Methods and materials

Sample collection & initial DNA analysis

This study utilized previously collected DNA samples as well as buccal swab samples collected from volunteers in compliance with Virginia Commonwealth University Institutional Research Board protocol number HM20002931 and HM20006066. DNA from newly obtained samples was extracted using a QIAcube liquid handling robot (QIAGEN, Hilden, Germany) and the standard manufacturer’s Buccal Swab Spin QIAcube Protocol using QIAamp^® DNA Blood Mini kit reagents (Qiagen). Extracted samples were quantified using manufacturer’s protocol, but with half-volume reactions using the Investigator^® Quantiplex Kit (Qiagen) on the Rotor-Gene^® Q (Qiagen). Reference STR profiles for each sample were developed by amplifying 1ng of DNA extract with the AmpFLSTR^® Identifiler^® PCR amplification kit (Thermo Fisher Scientific, Waltham, MA) on the GeneAmp 9600 thermal cycler (PerkinElmer, Waltham, MA). The 15 µl reaction consisted of 5.7 µl of PCR Reaction mix, 2 µl of Primer set, 2.1 µl Tris-EDTA (TE), 0.2 µl of AmpliTaq Gold™ Polymerase (5U/µl) (Applied Biosystems, Waltham, MA), and 5 µl of template DNA. Thermal cycling conditions included a pre-denaturing step at 94 °C for 11 min, followed by 28 cycles of: denature 94 °C for 1 min, anneal 59 °C for 1 min, extension 72 °C for 1 min, and final post-extension step of 60 °C for 90 min. Amplified STR products were then separated and detected on the ABI PRISM^® 3130 Genetic Analyzer (Thermo Fisher Scientific) using a 36 cm capillary array with a 10s injection. Each reaction consisted of 0.1 µl of GeneScan™ 500-LIZ™ size standard (Thermo Fisher Scientific) and 12 µl of Hi-Di™ formamide (Thermo Fisher Scientific) diluent. The wells containing an allelic ladder received 1 µl of the ladder; 1.5 µl of amplified DNA was added to all other sample wells. The profiles were analyzed using GeneMapper ID™ software v4.1 (Thermo Fisher Scientific) with an analytical threshold of 75 relative fluorescent units (RFUs). The D5S818 and D18S51 genotypes were documented as the known reference genotypes for comparison in all studies detailed below. Ultimately, 311 samples were obtained and selected for this study. Samples selected were those that were available at the time of testing and had one of seven closely-related D5S818 genotypes [(10,11), (11,11), (11,12), (11,13), (12,12), (12,13), (13,13)] and/or one of six closely-related D18S51 genotypes [(12,14), (12,15), (12,16), (13,14), (13,16), (14,15)].

STR locus amplification & melt curve detection

Samples selected were amplified for each of two STR loci (D5S818 and D18S51) separately on the Rotor-Gene^® Q using the primer and amplification parameters previously established [12]. Each amplification reaction included 1X concentration of AmpliTaq Gold™ Buffer, 3mM MgCl₂, 250µM dNTPs, 1µM each of forward and reverse primer, 2U AmpliTaq Gold DNA polymerase, 1X concentration of EvaGreen^® intercalating dye (Biotium, Fremont, CA), and 250ng/µl of bovine serum albumin (BSA) in water. Two microliters of template DNA were added to each reaction for a total reaction volume of 40µl. Primer sequences for D5S818 were (F) 5’-GGGTGATTTTCCTCTTTGGT-3’ and (R) 5’-AACATTTGTATCTTTATCTGTATCCTTATTTAT-3’; primer sequences for D18S51 were (F) 5’-CAAACCCGACTACCAGCAAC-3’ and (R) 5’-GAGCCATGTTCATGCCACTG-3’. The amplification cycling for both primer sets consisted of an initial 10 min 95 °C denaturation followed by 45 cycles of: 95 °C denaturation for 5s, 56 °C annealing for 20s, and 65 °C elongation for 30s with fluorescence detection at the 65 °C elongation step in the standard green channel. A cycle of 72 °C for 2 min, 95 °C for 20s, 55 °C for 20s and 56 °C for 2 min followed to transition into the melt phase. The amplicons were melted by 0.1 °C incremental increases in temperature from 60 to 95 °C. Each incremental step was held for 2s with fluorescent detection throughout the melt using the high-resolution melt curve detection channel.

Genotype prediction analysis from HRM data

For PCA analysis, melt curve data generated from each sample at both STR loci were separately analyzed using the Rotor-Gene^® ScreenClust HRM^® software. For the D5S818 sample set, 56 samples were assigned as the training samples or “standards” based on their known genotypes; similarly, for the D18S51 sample set, 52 samples were assigned as the training samples or “standards”, with 7–10 samples per genotype for both loci. For each locus analyzed, all experimental samples were included as unknowns submitted for prediction analysis; the software placed each unknown into a genotype category based on highest probabilities given acceptable variability from the group mean. From the predicted clusters, confusion matrices were generated and then used to assess the software’s prediction accuracy (given as an overall percentage). From this, the percentage of misclassification for each genotype was determined and trends were identified. Geno-groups were formed based on these patterns of misclassification at each locus tested as well as the similarity of the genotype (and thus, amplicons produced). To subsequently evaluate the prediction accuracy using the identified geno-groups, the standard (training) samples were re-assigned in the software (as belonging to a geno-group, rather than a specific genotype), unknown samples were reanalyzed, and the newly predicted clusters were assessed for accuracy, as indicated above. Several different geno-grou** options were explored in order to determine the best option for the highest PCA-based prediction accuracy.

For LDA analysis, the melt curve data generated from each sample at both STR loci were separately analyzed using LDA code in R statistical software. The change in fluorescence (dF) with respect to temperature was exported, melt curves generated, and the primary peaks and shoulders were identified (Figs. 1 and 2). The data from each sample were then summarized into its primary peak and shoulder peak(s) temperatures along with their corresponding peak heights. For D5S818 samples, the peak/shoulder temperatures and peak heights for up to three observations were used; if only two peaks/shoulders were observed, the height at 64.95 °C was used as the third data point (required, as samples with disparate numbers of data points cannot be compared). No sample had fewer than two observed peaks. For D18S51 samples, the peak/shoulder temperatures and corresponding peak heights for four observations were used; if only three peaks/shoulders were observed, the peak height at 64.95 °C was used as the fourth data point. No sample had fewer than three observed peaks at this locus. For the D5S818 sample set, the same 56 samples used above were again assigned as the training samples for this analysis based on their known genotypes; similarly, for the D18S51 sample set, the same 52 samples used above were assigned as the training samples for this analysis. Code was generated in R statistical software so that the accuracy of LDA-based predictions could be calculated. Confusion matrices were generated and then used to assess the LDA prediction accuracy (given as an overall percentage). From this, the percentage of misclassification for each genotype was determined, trends were identified, and geno-grou** options were created. Geno-groups were formed, training samples were reassigned, and unknown samples reanalyzed, as described above. Several different geno-grou** options were explored in order to determine the best option for the highest LDA-based prediction accuracy.

Results and discussion

D5S818

When using the Rotor-Gene^® Q ScreenClust HRM^® software to predict D5S818 genotypes from HRM data using a PCA approach, samples were classified correctly only 23.77% of the time (Table 1). Overall, samples with known homozygous genotypes were more likely to classify accurately (39.58%) than samples whose known genotypes were heterozygous (20.18%). Most often, misclassified homozygous samples were predicted as having another homozygous genotype, whereas heterozygous sample misclassifications were more evenly split among homozygous and heterozygous genotypes. Conversely, when using LDA in R statistical software to predict D5S818 genotypes from the HRM data, samples were classified correctly at a rate of 58.92% (Table 2), which is substantially higher than the PCA-based model and the random chance rate of 14.29% (one in seven). As with the PCA method, samples with known homozygous genotypes were more likely to classify accurately (65.08%) than samples whose known genotypes were heterozygous (55.74%).

Table 1 Classification of D5S818 genotypes using HRM data and the PCA-based Rotor-Gene^® Q ScreenClust HRM^® software

Full size table

Table 2 Classification of D5S818 genotypes using HRM data and LDA analysis using R statistical software

Full size table

In an attempt to increase prediction accuracies, 13 different geno-grou**s were created (based on the above trends and misclassification rates) and tested. As expected, geno-grou** improved classification accuracies, regardless of which geno-grou** option was used or algorithm tested (data not shown). The three geno-grou** options that produced the highest prediction accuracies for each prediction model used in this study were assessed using the converse method to allow for direct comparison (Table 3). Geno-group option A provided the highest LDA-based and highest overall prediction accuracy (81.0%). Alternately, the highest prediction accuracy achieved with the PCA-based method was 46.6% (option F). Regardless of geno-grou** option tested, LDA-based prediction modeling provided higher geno-grou** prediction accuracies. This result may be due to the fact that LDA aims to maximize the separation amongst classes, in order to heighten class discrimination.

Table 3 Genotype prediction accuracy rates for top performing D5S818 geno-grou**s obtained using two prediction models (PCA and LDA)

Full size table

D18S51

In order to determine if a more polymorphic STR locus would be better for genotype predictions using the two selected models, the testing was repeated using primers targeting the D18S51 locus. With more common genotypes known and higher levels of heterozygosity reported for this locus [15], one may expect the melt curve resolution, and thus genoty** predictions, to improve. When using the PCA-based Rotor-Gene^® Q ScreenClust HRM^® software to predict heterozygous D18S51 genotypes from HRM data, samples were classified correctly 40.38% of the time (Table 4). Of the 62 samples that misclassified, 40.32% had one allele predicted accurately with the second allele off by only one-repeat unit (one allele value). Additionally, the samples expected to produce heterozygous amplicons with the greatest difference in base pair length (those with 12,16 genotypes) were the most likely to be classified correctly (57.14%). This result is not surprising as the amplicons with the greatest difference in base pair length correspondingly have the greatest difference in melting rates thus producing visually distinct melt curves when compared amongst melt curves with amplicons that are close in base pair length and thus have similar melting rates. When LDA was used to predict D18S51 genotypes from the HRM data, samples were classified at a rate similar to the PCA-based method (45.10%, Table 5). However, unlike the PCA model for D18S51, the data obtained using the LDA approach showed no discernible trends when misclassifications were closely examined.

Table 4 Classification of D18S51 genotypes using HRM data and the PCA-based Rotor-Gene^® Q ScreenClust HRM^® software

Full size table

Table 5 Classification of D18S51 genotypes using HRM data and LDA analysis using R statistical software

Full size table

As with the D5S818 locus, D18S51 geno-grou** options were created based on observed trends and classification rates; eight different geno-grou**s were tested using both prediction models. As described above for D5S818, geno-grou** improved classification accuracies when the PCA algorithm was used; for LDA, however, only half of the geno-grou** options tested resulted in favorable increases in prediction accuracies (data not shown). The three geno-grou** options that produced the highest prediction accuracies for each prediction model used in this study were assessed using the converse method to allow for direct comparison (Table 6). Geno-group option E provided the highest PCA-based and highest overall prediction accuracy (65.4%). The highest prediction accuracy achieved with the LDA-based method was very similar (63.5%, option G). This study suggests that PCA-based methods may work better for predicting genotypes of loci that have increased allele diversity, such as that observed with D18S51.

Table 6 Genotype prediction accuracy rates for top performing D18S51 geno-grou**s obtained using two prediction models (PCA and LDA)

Full size table

Conclusion

This study evaluated the use of PCA- and LDA-based prediction modeling tools for their ability to distinguish between genotypes of two STR loci using HRM data obtained from the Qiagen Rotor-Gene^® Q qPCR platform. When assessing the D5S818 locus, the LDA model substantially outperformed the PCA model for predicting genotypes. This trend held true when like-genotypes were grouped together for prediction analysis into geno-groups with prediction accuracies exceeding 80%. However, when assessing a more polymorphic STR locus (D18S51) with a more heterogeneous sample set, the differences in prediction accuracies between the models tested were far less pronounced suggesting that the LDA-based method may work better for predicting homozygous genotypes. Regardless of method or locus tested, placing samples with closely aligned genotypes into geno-groups for classification results in improved prediction modeling, but fewer classification options would limit the forensic utility of an HRM-based assay, as DNA from different contributors will be less likely to be individualized. Ultimately, the data from this study suggests that the best prediction model for STR genoty** may differ from locus-to-locus, depending on the nature and complexity of the STR locus tested. Further, the inclusion of additional heterozygous genotypes in the training sets used to train the software may improve overall prediction rates, regardless of the testing model employed.

Considering additional factors may be helpful when selecting a prediction model to use for genoty** using HRM data. For example, the PCA-based ScreenClust HRM^® software is commercially available, requires no coding, and is easy to use. However, the software is proprietary and the principal components it utilizes for analysis are unknown. Further, the ScreenClust HRM^® software requires that all known (training) standards be run on the instrument at the same time as tested unknown samples to provide the most accurate clustering. This would be impractical for wholescale use in forensic settings and becomes highly impractical when assessing loci with large repeat ranges and many common genotypes. Alternatively, R statistical software is free and training set data is stored for use of classification of unknown samples subsequently and independently tested. However, it requires some initial programming and forensic implementation would require the development of a more user-friendly interface.

In conclusion, this study provides foundational data documenting the performance of prediction modeling for STR genoty** based on HRM data. In order to expand the forensic applicability of the HRM assay described herein, it may be useful to test it using more commonly utilized qPCR platforms, such as Thermo Fisher’s QuantStudio™, and potentially incorporate it into the previously described mixture detection assay [10]. Further, exploring other prediction models that use similar classification schemes to those used in this study but are designed to classify larger data sets (e.g., comprehensive melt curve data), such as support vector machines (SVM), may prove useful [25,26,27].

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Gill P, Haned H, Bleka O et al (2015) Genoty** and interpretation of STR-DNA: Low-Template, mixtures and Database matches - Twenty years of Research and Development. Forensic Sci Int Genet 18:100–117. https://doi.org/10.1016/j.fsigen.2015.03.014
Article PubMed Google Scholar
Dror IE, Hampikian G (2011) Subjectivity and Bias in forensic DNA mixture interpretation. Sci Justice 51:204–208. https://doi.org/10.1016/j.scijus.2011.08.004
Article PubMed Google Scholar
Alghanim H, Balamurugan K, McCord B (2020) Development of DNA methylation markers for sperm, saliva and blood identification using pyrosequencing and qPCR/HRM. Anal Biochem 611:113933. https://doi.org/10.1016/j.ab.2020.113933
Article PubMed Google Scholar
Osathanunkul M, Sawongta N, Sathirapongsasuti N et al (2022) Distinguishing venomous jellyfish species via high resolution melting analysis. Front Mar Sci 9:1–10. https://doi.org/10.3389/fmars.2022.1019473
Article Google Scholar
Marqueta-Gracia JJ, Álvarez-Álvarez M, Baeta M et al (2018) Differentially methylated CpG regions analyzed by PCR-high resolution melting for monozygotic twin pair discrimination. Forensic Sci Int Genet 37:e1–e5. https://doi.org/10.1016/j.fsigen.2018.08.013
Article PubMed Google Scholar
dos Santos Rocha A, de Amorim ISS, de Simão T A, et al (2018) High-resolution melting (HRM) of Hypervariable mitochondrial DNA regions for forensic science. J Forensic Sci 63:536–540. https://doi.org/10.1111/1556-4029.13552
Article PubMed Google Scholar
Deng JQ, Liu BQ, Wang Y et al (2016) Y-STR genetic screening by high-resolution melting analysis. Genet Mol Res 15. https://doi.org/10.4238/gmr.15017266
Martino A, Mancuso T, Rossi AM (2010) Application of high-resolution melting to Large-Scale, high-throughput SNP genoty**: a comparison with the TaqMan^® Method. J Biomol Screen 15:623–629. https://doi.org/10.1177/1087057110365900
Article PubMed Google Scholar
Mehta B, Daniel R, McNevin D (2013) High resolution melting (HRM) of forensically informative SNPs. Forensic Sci Int Genet Suppl Ser 4:e376–e377. https://doi.org/10.1016/j.fsigss.2013.10.191
Article Google Scholar
Torres D, Smith C, Williams AL et al (2023) A Quantifiler™ Trio-based HRM screening assay for the accurate prediction of single source versus mixed biological samples. Int J Legal Med 137:1639–1651. https://doi.org/10.1007/s00414-023-03070-6
Article PubMed Google Scholar
Torres DA (2022) A Quantifiler™ Trio-based HRM mixture screening assay for the QuantStudio™ 6 flex qPCR platform. Virginia Commonwealth University
Kuehnert K (2015) Development of a STR genoty** screening assay using high resolution melting curve analysis of the STR loci D5S818 and D18S51. Virginia Commonwealth University
French DJ, Howard RL, Gale N et al (2008) Interrogation of short Tandem repeats using fluorescent probes and melting curve analysis: a step towards Rapid DNA Identity Screening. Forensic Sci Int Genet 2:333–339. https://doi.org/10.1016/j.fsigen.2008.04.007
Article PubMed Google Scholar
Nguyen Q, Mckinney J, Johnson DJ et al (2012) STR Melting Curve Analysis as a genetic Screening Tool for Crime Scene Samples*. J Forensic Sci 57:887–899. https://doi.org/10.1111/j.1556-4029.2012.02106.x
Article PubMed Google Scholar
Nicklas JA, Noreault-Conti T, Buel E (2012) Development of a fast, simple profiling method for Sample Screening using high Resolution Melting (HRM) of STRs. J Forensic Sci 57:478–488. https://doi.org/10.1111/j.1556-4029.2011.01981.x
Article PubMed Google Scholar
Jiang E, Yu P, Zhang S et al (2017) Establishment of an alternative efficiently genoty** strategy for human ABO gene. Leg Med 29:72–76. https://doi.org/10.1016/j.legalmed.2017.10.015
Article Google Scholar
Venables SJ, Mehta B, Daniel R et al (2014) Assessment of high resolution melting analysis as a potential SNP genoty** technique in forensic casework. Electrophoresis 35:3036–3043. https://doi.org/10.1002/elps.201400089
Article PubMed Google Scholar
Mehta B, Daniel R, McNevin D (2017) HRM and SNaPshot as alternative forensic SNP genoty** methods. Forensic Sci Med Pathol 13:293–301. https://doi.org/10.1007/s12024-017-9874-5
Article PubMed Google Scholar
Qiagen (2009) Rotor-Gene^® ScreenClust HRM^® Software User Guide
Reja V, Kwok A, Stone G et al (2010) ScreenClust: Advanced statistical software for supervised and unsupervised high resolution melting (HRM) analysis. Methods 50:S10–S14. https://doi.org/10.1016/j.ymeth.2010.02.006
Article PubMed Google Scholar
Abdi H, Williams LJ (2010) Principal component analysis. Wiley Interdiscip Rev Comput Stat 2:433–459. https://doi.org/10.1002/wics.101
Article Google Scholar
Balakrishnama S, Ganapathiraju A (1998) Linear Discriminant Analysis - a brief Tutorial. Institute for Signal and Information Processing
Suchismita Goswami∗, Edward J, Wegman (2016) Comparison of different classification methods on Glass Identification for Forensic Research. J Stat Sci Appl 4:65–84. https://doi.org/10.17265/2328-224x/2015.0304.001
Article Google Scholar
R Foundation for Statistical Computing: R Core Team (2015) R: a language and. Environment for Statistical Computing
Tian Y, Qi Z, Ju X et al (2013) Nonparallel Support Vector machines for Pattern classification. IEEE Trans Cybern 44:1067–1079. https://doi.org/10.1109/tcyb.2013.2279167
Article PubMed Google Scholar
Rebentrost P, Mohseni M, Lloyd S (2014) Quantum support vector machine for big data classification. Phys Rev Lett 113:1–5. https://doi.org/10.1103/PhysRevLett.113.130503
Article Google Scholar
Ozkok FO, Celik M (2022) A hybrid CNN-LSTM model for high resolution melting curve classification. Biomed Signal Process Control 71:103168. https://doi.org/10.1016/j.bspc.2021.103168
Article Google Scholar

Download references

Acknowledgements

National Institute of Justice, Grant Award No: 2015-MU-MU-K026.

Funding

The research leading to these results received funding from the NIJ under Grant Agreement No 2015-MU-MU-K026. This work was supported by NIJ (2015-MU-MU-K026).

Author information

Authors and Affiliations

Department of Forensic Science, Virginia Commonwealth University, 1015 Floyd Avenue, PO Box 843079, Richmond, VA , 23284, USA
Darianne C. Cloudy, Kristi Kuehnert, Chastyn Smith, Jordan O. Cox, Sarah J. Seashols-Williams & Tracey Dawson Green
Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, 1015 Floyd Avenue, PO Box 843079, Richmond, VA, 23284, USA
Edward L. Boone

Authors

Darianne C. Cloudy
View author publications
You can also search for this author in PubMed Google Scholar
Edward L. Boone
View author publications
You can also search for this author in PubMed Google Scholar
Kristi Kuehnert
View author publications
You can also search for this author in PubMed Google Scholar
Chastyn Smith
View author publications
You can also search for this author in PubMed Google Scholar
Jordan O. Cox
View author publications
You can also search for this author in PubMed Google Scholar
Sarah J. Seashols-Williams
View author publications
You can also search for this author in PubMed Google Scholar
Tracey Dawson Green
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Darianne Cloudy and Edward Boone. The first draft of the manuscript was written by Darianne Cloudy and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Chastyn Smith.

Ethics declarations

Ethical approval

This study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the Institutional Review Board (IRB) of Virginia Commonwealth University (VCU) protocol (HM20002931 and HM20006066) on 06/23/2016.

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Competing interests

• Employment: Not applicable.

• Financial Interests: Not applicable.

• Non-financial interests: Not applicable.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cloudy, D.C., Boone, E.L., Kuehnert, K. et al. Statistical methods for discrimination of STR genotypes using high resolution melt curve data. Int J Legal Med (2024). https://doi.org/10.1007/s00414-024-03289-x

Download citation

Received: 14 February 2024
Accepted: 03 July 2024
Published: 13 July 2024
DOI: https://doi.org/10.1007/s00414-024-03289-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Statistical methods for discrimination of STR genotypes using high resolution melt curve data

Abstract

Introduction

Methods and materials

Sample collection & initial DNA analysis

STR locus amplification & melt curve detection

Genotype prediction analysis from HRM data

Results and discussion

D5S818

D18S51

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval

Consent to participate

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation