Advertisement
Research Article| Volume 8, P173-175, December 2022

Y-chromosomal kinship estimation for forensic familial searching: YMrCA to the rescue

  • Sofie Claerhout
    Correspondence
    Corresponding author at: KU Leuven, Forensic Biomedical Sciences, Department of Imaging & Pathology, Leuven, Belgium.
    Affiliations
    KU Leuven, Forensic Biomedical Sciences, Department of Imaging & Pathology, Leuven, Belgium

    KU Leuven Kulak, Interdisciplinary Research Facility, Kortrijk, Belgium
    Search for articles by this author
  • Simon Vanpaemel
    Affiliations
    KU Leuven, Department of Mechanical Engineering, Noise and Vibration Engineering, Heverlee, Belgium
    Search for articles by this author
  • Mandev S. Gill
    Affiliations
    KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory of Evolutionary and Computational Virology, Leuven, Belgium
    Search for articles by this author
  • Guy Baele
    Affiliations
    KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory of Evolutionary and Computational Virology, Leuven, Belgium
    Search for articles by this author
  • Ronny Decorte
    Affiliations
    KU Leuven, Forensic Biomedical Sciences, Department of Imaging & Pathology, Leuven, Belgium

    UZ Leuven, Laboratory of Forensic Genetics and Molecular Archaeology, Leuven, Belgium
    Search for articles by this author
Published:October 18, 2022DOI:https://doi.org/10.1016/j.fsigss.2022.10.025

      Abstract

      The Y-chromosome can be used as an identification method to find paternally related males of the perpetrator. When a close Y-haplotype match is identified, the time to their most recent common ancestor (tMRCA) needs to be estimated to reconstruct their genealogy. To date, two mutation models and three online tMRCA calculators exist. But, they do not include individual mutation rates with multi-step changes, while ignoring hidden multiple, back or parallel modifications. To improve tMRCA estimation, we developed a user-friendly calculator, the ‘YMrCA’, including all previously mentioned mutation characteristics. Here, a case using genealogical pairs with confirmed biological kinships visualizes the good estimation performance of the YMrCA compared to the state-of-the-art. Even when genealogical pairs have equal number of mutations, the YMrCA still estimates the correct number of generations due to the inclusion of individual Y-STR mutation rates and the different mutational influencing factors.

      Keywords

      1. Introduction

      Investigative Genetic Genealogy (IGG) is an identification method in forensic case work where instead of finding the donor of a trace, a relative of the donor is used to unravel the identity of the unknown perpetrator. As autosomal DNA kinships fade away over generations due to recombination events, Y-chromosome analysis provides the opportunity to identify distant kinships as 95% of the chromosome lacks recombination. A close or distant biological relative can be verified using (rapidly mutating) short tandem repeats or (RM) Y-STRs, which identifies familial Y-haplotypes [
      • Kayser M.
      Forensic use of Y-chromosome DNA: a general overview.
      ]. Through Y-haplotype comparison, the time to their most recent common ancestor (tMRCA) needs to be calculated. The more accurate the number of generations is estimated, the easier it becomes to reconstruct the genealogy. In a previous study, we investigated the two state-of-the-art mutation models (SMM: stepwise mutation model; IAM: infinite alleles model) and three online tMRCA calculators (McDonald, Walker and McGee) [
      • Claerhout S.
      • Defraye C.
      • Decorte R.
      Validation of Y-ancestor time calculators for forensic familial searching.
      ]. After running our 1120 biologically related genealogical pairs, however, consistent under- and overestimation and broad confidence intervals were observed, leading to dubious tMRCA estimates. This mainly because they do not include individual mutation rates with multi-step changes, while ignoring hidden multiple, back or parallel modifications. To improve tMRCA estimation, we developed a user-friendly calculator, the ‘YMrCA’, including all previously mentioned mutation characteristics [
      • Claerhout S.
      • Vanpaemel S.
      • Gill M.S.
      • et al.
      YMrCA: Improving Y-chromosomal ancestor time estimation for DNA kinship research.
      ]. In this study, a specific report case using genealogical pairs with confirmed biological kinships are presented to visualize the performance of the YMrCA compared to the state-of-the-art.

      2. Materials and methods

      2.1 Database

      Our database includes 2360 males sampled to investigate extrapair paternity rates, haplogroup specific Y-STR mutation rates and parallel Y-STR evolution [
      • Larmuseau M.H.D.
      • Claerhout S.
      • Gruyters L.
      • et al.
      Genetic-genealogy approach reveals low rate of extrapair paternity in historical Dutch populations.
      ,
      • Claerhout S.
      • Vandenbosch M.
      • Nivelle K.
      • et al.
      Determining Y-STR mutation rates in deep-routing genealogies: identification of haplogroup differences.
      ,
      • Claerhout S.
      • Van Der Haegen M.
      • Vangeel L.
      • et al.
      A game of hide and seq: Identification of parallel Y-STR evolution in deep-rooting pedigrees.
      ]. Permission for DNA analysis and publication was granted by informed consents and approved by the Ethical Commission of University Hospital Leuven (S54010, S55864, S59085). DNA samples were collected, extracted and genotyped for 42 Y-STRs as described in Claerhout et al. [
      • Claerhout S.
      • Vandenbosch M.
      • Nivelle K.
      • et al.
      Determining Y-STR mutation rates in deep-routing genealogies: identification of haplogroup differences.
      ]. Data is available through YHRD (https://yhrd.org), under accession numbers YA003651–53, YA003739–42 and YA004300–01.

      2.2 tMRCA estimation

      The R script from Boattini et al. for the stepwise mutation model (SMM) and infinite allele model (IAM) was implemented [
      • Boattini A.
      • Sarno S.
      • Mazzarisi A.M.
      • et al.
      Estimating Y-Str mutation rates and tmrca through deep-rooting italian pedigrees.
      ]. Additionally, the three existing public domain online calculators based on the standard infinite alleles model (IAM) were tested: McDonald [

      J.D. McDonald, online tMRCA Calculator, 2014. 〈http://faculty.scs.illinois.edu/∼mcdonald/tmrca.htm〉.

      ], Walker [

      M. Walker, online tMRCA Calculator, (n.d.). 〈http://www.moseswalker.com/mrca/calculator.asp?q=1〉, (Accessed 25 June 2019).

      ] and McGee [

      D. McGee, Y-Utility: Y-DNA Comparison Utility, FTDNA Mode 111 Marker, 2008. 〈http://www.mymcgee.com/tools/yutility111.html〉.

      ]. McDonald input requires three variables: number of Y-STR differences (non-matching alleles), number of analyzed Y-STRs and the average Y-haplotype mutation rate. The obtained output is a probability distribution and a cumulative probability visualized in a list or graph for the number of generations to the MRCA. Walker is a simplistic version of the McDonald calculator whereby only two variables are needed. The McGee calculator is based on their Y-chromosome database, named Y-Utility [

      D. McGee, Y-Utility: Y-DNA Comparison Utility, FTDNA Mode 111 Marker, 2008. 〈http://www.mymcgee.com/tools/yutility111.html〉.

      ]. Input required are at least two Y-haplotypes with their alleles listed in order of appearance on the platform. The YMrCA is our newly developed, open access calculator in which detailed information is provided in [
      • Claerhout S.
      • Vanpaemel S.
      • Gill M.S.
      • et al.
      YMrCA: Improving Y-chromosomal ancestor time estimation for DNA kinship research.
      ]. tMRCA estimation relies on two input data files, one including the Y-haplotypes for which the degree of relatedness needs to be calculated, and a file comprising all Y-marker characteristics.

      3. Results and discussion

      Two unique genealogical pairs with both four Y-STR changes on 46 Y-STR haplotype, but separated by different number of generations, were used to estimate their kinship. Pair A is separated by 10 generations and has three one-step changes (DYS499, DYS481 and DYS627) and one two-step change (DYS459). Pair B is separated by 20 generations having four 1-step differences (DYS385, DYS456, DYS518 and DYS627).
      Fig. 1 provides an overview of the results below. The SMM estimated for couple A 16 generations and for B 13 generations. This difference is because the model counted five mutation events for couple A due to its two-step mutation event. The IAM estimated for both couples 13 generations as they have equal number of Y-STR differences. This revealed a generation deviation for couple A of 6 generations (SMM) and 3 generations (IAM), while couple B resulted for both models in a generation deviation error of 7 generations. For the online tMRCA calculators, both the McDonald as Walker calculator estimated 16 generations for couple A and B with a 95% confidence interval of 3–40 generations. This resulted in a generation error of 6 generations for couple A and 4 generations for couple B. Without the inclusion of exact Y-STR allele calls or individual mutation rates, these calculators estimate equal probability curves for genealogical pairs encountering equal number of Y-STR differences. The McGee calculator was, despite the more advanced input of allele calls and their use of individual mutation rates, very discordant with reality. McGee estimated for couple A 36 generation (error +26) and for couple B 29 generations (error +9). This calculator did not provide probability curves but stated that the probability is 95% that the tMRCA is no longer than the indicated generation output. So in general, the state-of-the-art calculators estimated broad confidence intervals and high generation deviations, decreasing the tMRCA estimation accuracy which is disadvantageous for pedigree reconstruction and kinship analysis.
      Fig. 1
      Fig. 1Results for the two genealogical pairs with equal number of Y-STR modifications but different generations. (left) Couple A has 10 generations and couple B 20. The relative generation error is provided for the two mutation models, three calculators and YMrCA (right) Probability curves for McDonald versus YMrCA, which follows reality.
      The YMrCA calculator, on the other hand, resulted in exact the same number of generations for the two couples with a significantly smaller 95% confidence interval for couple A (Fig. 1, right). The adaptation of the input Y-STR matrices for the YMrCA calculator with these haplogroup-specific mutation rates increased estimation probabilities for the expected tMRCA and decreased both generation estimation errors towards a more truthful tMRCA prediction. To conclude, the overall high success rate of the YMrCA and its low deviation error constitute promising results compared to both the basic SMM and IAM, as to the already existing online tMRCA calculators. Additionally, the incorporation of individual Y-STR mutation rates and even haplogroup-specific ones into the YMrCA provides improved tMRCA output information for genealogical pairs having equal number of Y-STR differences.

      4. Conclusion

      Here we demonstrate that the mutation models and online calculators are not appropriate for use in forensic familial searching due to their large 95% CI, making predictions too difficult for pedigree reconstructions, and the consistent tMRCA under- and overestimations, leading to implausible and dubious probability scores. Our newly developed YMrCA demonstrated promising performance, with a significantly higher tMRCA estimation success rate following reality due to the inclusion of individual Y-STR mutation rates and characteristics.

      Conflict of interest statement

      None.

      Acknowledgments

      We thank all DNA donors for their participation in our genetic-genealogy project. Funding was provided by FWO (SC: 1265622N).

      References

        • Kayser M.
        Forensic use of Y-chromosome DNA: a general overview.
        Hum. Genet. 2017; 136: 621-635https://doi.org/10.1007/s00439-017-1776-9
        • Claerhout S.
        • Defraye C.
        • Decorte R.
        Validation of Y-ancestor time calculators for forensic familial searching.
        FSI: Genet. Suppl. 2019; 7https://doi.org/10.1016/j.fsigss.2019.10.032
        • Claerhout S.
        • Vanpaemel S.
        • Gill M.S.
        • et al.
        YMrCA: Improving Y-chromosomal ancestor time estimation for DNA kinship research.
        Hum. Mut. 2021; 42: 1307-1320https://doi.org/10.1002/humu.24259
        • Larmuseau M.H.D.
        • Claerhout S.
        • Gruyters L.
        • et al.
        Genetic-genealogy approach reveals low rate of extrapair paternity in historical Dutch populations.
        AJHB. 2017; 29: 1-9https://doi.org/10.1002/ajhb.23046
        • Claerhout S.
        • Vandenbosch M.
        • Nivelle K.
        • et al.
        Determining Y-STR mutation rates in deep-routing genealogies: identification of haplogroup differences.
        FSI Genet. 2018; 34: 1-10https://doi.org/10.1016/j.fsigen.2018.01.005
        • Claerhout S.
        • Van Der Haegen M.
        • Vangeel L.
        • et al.
        A game of hide and seq: Identification of parallel Y-STR evolution in deep-rooting pedigrees.
        EJHG. 2018; 27: 637-646https://doi.org/10.1038/s41431-018-0312-2
        • Boattini A.
        • Sarno S.
        • Mazzarisi A.M.
        • et al.
        Estimating Y-Str mutation rates and tmrca through deep-rooting italian pedigrees.
        Sci. Rep. 2019; 9: 1-12https://doi.org/10.1038/s41598-019-45398-3
      1. J.D. McDonald, online tMRCA Calculator, 2014. 〈http://faculty.scs.illinois.edu/∼mcdonald/tmrca.htm〉.

      2. M. Walker, online tMRCA Calculator, (n.d.). 〈http://www.moseswalker.com/mrca/calculator.asp?q=1〉, (Accessed 25 June 2019).

      3. D. McGee, Y-Utility: Y-DNA Comparison Utility, FTDNA Mode 111 Marker, 2008. 〈http://www.mymcgee.com/tools/yutility111.html〉.