TY - JOUR
T1 - A novel mathematical framework for pedigree-based calculation of Y-STR match probabilities
AU - Caliebe, Amke
AU - Zandstra, Dion
AU - Ralf, Arwin
AU - Kayser, Manfred
AU - Krawczak, Michael
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/4/26
Y1 - 2025/4/26
N2 - Y-chromosomal short tandem repeat (Y-STR) markers are routinely used in forensic casework to identify male donors of biological traces left at crime scenes, particularly in sexual assault cases. However, the evidential value of a match between the Y-STR profile of a trace and a potential donor, usually a crime suspect, is difficult to quantify, and the common albeit inappropriate practise to equate Y-STR match probabilities with Y-STR profile frequencies estimated from population databases has been subject to scientific debate for decades. As a solution to this long-standing problem, we suggest an alternative approach to the calculation of Y-STR match probabilities that involves splitting the group of potential donors other than the suspect into two: (i) his close male relatives (termed his ‘pedigree’) and (ii) all other males. While an upper limit to the match probability is easily calculated for the second group, it is computationally challenging to derive for the first. We therefore developed a mathematical framework that uses importance sampling to reconstruct and evaluate the Y-STR profiles of untyped members of the suspect’s pedigree by way of simulation. Extensive testing with elementary pedigrees of different structure and complexity confirmed that both, the framework and its Python-based software implementation yield match probability estimates that approximate well the correct analytical results, depending upon the number of simulations performed. Our methodology thus facilitates a more appropriate and valid solution to the long-standing problem of interpreting Y-STR profile matches in forensic casework.
AB - Y-chromosomal short tandem repeat (Y-STR) markers are routinely used in forensic casework to identify male donors of biological traces left at crime scenes, particularly in sexual assault cases. However, the evidential value of a match between the Y-STR profile of a trace and a potential donor, usually a crime suspect, is difficult to quantify, and the common albeit inappropriate practise to equate Y-STR match probabilities with Y-STR profile frequencies estimated from population databases has been subject to scientific debate for decades. As a solution to this long-standing problem, we suggest an alternative approach to the calculation of Y-STR match probabilities that involves splitting the group of potential donors other than the suspect into two: (i) his close male relatives (termed his ‘pedigree’) and (ii) all other males. While an upper limit to the match probability is easily calculated for the second group, it is computationally challenging to derive for the first. We therefore developed a mathematical framework that uses importance sampling to reconstruct and evaluate the Y-STR profiles of untyped members of the suspect’s pedigree by way of simulation. Extensive testing with elementary pedigrees of different structure and complexity confirmed that both, the framework and its Python-based software implementation yield match probability estimates that approximate well the correct analytical results, depending upon the number of simulations performed. Our methodology thus facilitates a more appropriate and valid solution to the long-standing problem of interpreting Y-STR profile matches in forensic casework.
UR - http://www.scopus.com/inward/record.url?scp=105003696324&partnerID=8YFLogxK
U2 - 10.1038/s41598-025-98644-2
DO - 10.1038/s41598-025-98644-2
M3 - Article
C2 - 40287458
AN - SCOPUS:105003696324
SN - 2045-2322
VL - 15
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 14651
ER -