Advertisement
Research Article| Volume 8, P143-144, December 2022

An ambiguous sequence-based allele of SE33

  • Tikumphorn Sathirapatya
    Correspondence
    Correspondence to: 1873 Forensic Genetics Research Unit, Department of Forensic Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok 10300, Thailand.
    Affiliations
    Department of Forensic Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand

    Forensic Genetics Research Unit, Ratchadapiseksompotch Fund, Faculty of Medicine, Chulalongkorn University, Bangkok Thailand
    Search for articles by this author
  • Hasnee Noh
    Affiliations
    Department of Forensic Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand

    Forensic Genetics Research Unit, Ratchadapiseksompotch Fund, Faculty of Medicine, Chulalongkorn University, Bangkok Thailand
    Search for articles by this author
  • Poonyapat Sukawutthiya
    Affiliations
    Department of Forensic Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand

    Forensic Genetics Research Unit, Ratchadapiseksompotch Fund, Faculty of Medicine, Chulalongkorn University, Bangkok Thailand
    Search for articles by this author
  • Wikanda Worrapitirungsi
    Affiliations
    Department of Forensic Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand

    Forensic Genetics Research Unit, Ratchadapiseksompotch Fund, Faculty of Medicine, Chulalongkorn University, Bangkok Thailand
    Search for articles by this author
  • Kornkiat Vongpaisarnsin
    Affiliations
    Department of Forensic Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand

    Forensic Serology and DNA, King Chulalongkorn Memorial Hospital and Thai Red Cross Society, Bangkok, Thailand

    Forensic Genetics Research Unit, Ratchadapiseksompotch Fund, Faculty of Medicine, Chulalongkorn University, Bangkok Thailand
    Search for articles by this author
Published:October 07, 2022DOI:https://doi.org/10.1016/j.fsigss.2022.10.013

      Abstract

      SE33 was a well-known autosomal short tandem repeat (STR) marker that was high polymorphic and therefore was high discrimination power. The sequence structure of STR markers has been increasingly explored with next-generation sequencing (NGS) technology. The sequencing resulted in the development of a new locus designation and allele nomenclature that was also backward compatible with the conventional capillary electrophoresis. SE33 was one of the STR markers that had been coamplified by Forenseq™ Signature Prep Kit (Verogen) but were not analyzed and illustrated in the Universal Analysis Software (UAS) (Verogen). This study reported an ambiguous sequence-based allele 16.3 of the SE33 locus. This allele was observed while analyzed by STRait Razor 3.0. The configuration file was modified from the previous studies to include 15 bp of 5′ flanking region and 24 bp of 3′ flanking region. The ambiguous allele was called 16.3 (106 bp) with a read count of 2070. However, the sequence of the repeat region cannot be designated as allele 16.3. Several possible scenarios for allele designation were presented and discussed.

      Keywords

      1. Introduction

      The upcoming next-generation sequencing (NGS) technology has revolutionized forensic DNA analysis. However, the good advantages of this technology came with the challenge of interpreting NGS data. As the NGS produced more complex data than the conventional fragment analysis, a new locus designation and allele nomenclature have been established [
      • Parson W.
      • Ballard D.
      • Budowle B.
      • Butler J.M.
      • Gettings K.B.
      • Gill P.
      • et al.
      Massively parallel sequencing of forensic STRs: considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements.
      ] to support the interpretation of NGS data for the community and to be backward compatible with the original capillary electrophoresis. These interpretation guidelines and STR sequence structure catalogs worked efficiently with all the STR markers. However, some markers had reported the discrepancy between NGS and capillary electrophoresis (CE) that stemmed from sequence variation [
      • Hölzl-Müller P.
      • Bodner M.
      • Berger B.
      • Parson W.
      Exploring STR sequencing for forensic DNA intelligence databasing using the Austrian National DNA Database as an example.
      ].
      SE33 was the most polymorphic short tandem repeat (STR) marker. For NGS technology, this marker was coamplified with Forenseq™ Signature Prep Kit (Verogen) but was not analyzed by Universal Analysis Software (Verogen) [
      • Novroski N.M.M.
      • King J.L.
      • Churchill J.D.
      • Seah L.H.
      • Budowle B.
      Characterization of genetic sequence variation of 58 STR loci in four major population groups.
      ]. The flanking region sequence was essentially included with reported sequences since there were variations in these regions that potentially led to discordance results with CE or length-based data [
      • Gettings K.B.
      • Aponte R.A.
      • Vallone P.M.
      • Butler J.M.
      STR allele sequence variation: current knowledge and future issues.
      ]. In 2018, Borsuk and colleagues [
      • Borsuk L.A.
      • Gettings K.B.
      • Steffen C.R.
      • Kiesler K.M.
      • Vallone P.M.
      Sequence-based US population data for the SE33 locus.
      ] established a motif ID for the common motif pattern of SE33 in a large group of populations. The common motif pattern included 15 bp of 5′ flanking region, repeat region and 24 bp of 3′ flanking region that covered all observed variants. The larger allele displayed a more complex microvariant allele (x.2) with the interruption of TT or CT in the repeat region.
      In this study, we demonstrated an ambiguous sequence-based allele of SE33. An allele number of 16.3 was reported in one sample by STRait Razor 3.0 [
      • Woerner A.E.
      • King J.L.
      • Budowle B.
      Fast STR allele identification with STRait Razor 3.0.
      ]. However, the repeat region sequence cannot be designated as allele 16.3. Therefore, we investigated further and compared this ambiguous sequence with other previous studies.

      2. Materials and methods

      2.1 Samples and sequencing

      One punch of 1.2 mm of blood FTA card sample was prepared along with other samples using primer mix B (DPMB) of the Forenseq DNA Signature Prep Kit (Verogen) and sequenced on Miseq FGx sequencer (Verogen) according to the manufacturer’s instruction. One sample was considered in this study.

      2.2 Data analysis

      SE33 was examined by STRait Razor 3.0 [
      • Woerner A.E.
      • King J.L.
      • Budowle B.
      Fast STR allele identification with STRait Razor 3.0.
      ] and the configuration file (Supplement table 1) was modified from [
      • Borsuk L.A.
      • Gettings K.B.
      • Steffen C.R.
      • Kiesler K.M.
      • Vallone P.M.
      Sequence-based US population data for the SE33 locus.
      ]. The sequence string of SE33 included 15 bp of 5′ flanking region and 24 bp of 3′ flanking region. The allele number from STRait Razor 3.0 result was compared with the allele number from length-based data for a concordance check.

      3. Results and discussion

      SE33 locus showed an allele number of 16.3 showed the highest read count (2070 reads) and allele 27.2 was the second high read count (1174 reads). This heterozygous genotype (16.3, 27.2) was confirmed by capillary electrophoresis. The sequence allele 16.3 was presented using the STRait Razor 3.0 with 106 bp sequence string of 5′- CT CTTT CTTT CTTT C [CTTT]3 C [CTTT]14 __CTTT CTTT CTTT CT CTTT CTTT - 3′ (Bold letter indicated repeat region and __ was a two bp deletion). According to Fig. 1, the repeat motif of [CTTT]3 and [CTTT]14 was interrupted with C and the missing of CT at the beginning of 3′ flanking region was found. This allele number of 16.3 was demonstrated in the STRseq catalog [
      • Gettings K.B.
      • Borsuk L.A.
      • Ballard D.
      • Bodner M.
      • Budowle B.
      • Devesse L.
      • et al.
      STRSeq: a catalog of sequence diversity at human identification Short Tandem Repeat loci.
      ] (accession no. MH232723.1) with a 106 bp sequence string identical to our finding. According to the supplement materials of [
      • Borsuk L.A.
      • Gettings K.B.
      • Steffen C.R.
      • Kiesler K.M.
      • Vallone P.M.
      Sequence-based US population data for the SE33 locus.
      ], the same allele was identified as B2 and the allele designation for this motif ID was N + 2.3 when N was 14 (Fig. 1). However, when considering only the sequence of repeat region, [CTTT]3 C [CTTT]14 appeared to be 17.1 rather than 16.3 (Fig. 1). The motif ID system from [
      • Borsuk L.A.
      • Gettings K.B.
      • Steffen C.R.
      • Kiesler K.M.
      • Vallone P.M.
      Sequence-based US population data for the SE33 locus.
      ] was versatile for SE33 allele designation, especially from NGS data, and provided backward compatibility to the capillary electrophoresis data. SE33 was marked as German core loci [8] and had been expected to be widely used outside of Europe shortly [
      • Gettings K.B.
      • Aponte R.A.
      • Vallone P.M.
      • Butler J.M.
      STR allele sequence variation: current knowledge and future issues.
      ]. B2 was found at 0.24% in global population [
      • Borsuk L.A.
      • Gettings K.B.
      • Steffen C.R.
      • Kiesler K.M.
      • Vallone P.M.
      Sequence-based US population data for the SE33 locus.
      ]. The finding in this study illustrated an ambiguous sequence-based allele from SE33 that could cause discordance between NGS and capillary electrophoresis. The length-based allele and sequence-based allele were potentially gone in different directions. Furthermore, as the complexity of SE33 allele designation was far more than other markers, the interpretation of NGS data should not rely on repeat region alone, and the motif ID system must be studied prior to allele calling.
      Fig. 1
      Fig. 1Diagram of an ambiguous allele 16.3 of SE33. Star (*) indicates two bp deletion at 3′ flanking region.

      4. Conclusion

      SE33 was a coamplified STR marker found in Forenseq™ Signature Prep Kit (Verogen) and can be analyzed by several available software that made SE33 more easily studied by NGS technology. However, some cautions need to be considered. For example, the interpretation should not rely on repeat region alone, and the motif ID system needs to be considered prior to allele calling.

      Conflict of interest statement

      All authors have no conflict of interest.

      Acknowledgments

      This study was supported by the Forensic Genetics Research Unit, Ratchadapiseksompotch Fund, Department of Forensic Medicine, Faculty of Medicine, Chulalongkorn University and keep, Bangkok, Thailand.

      Appendix A. Supplementary material

      References

        • Parson W.
        • Ballard D.
        • Budowle B.
        • Butler J.M.
        • Gettings K.B.
        • Gill P.
        • et al.
        Massively parallel sequencing of forensic STRs: considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements.
        Forensic Sci. Int. Genet. 2016; 22: 54-63
        • Hölzl-Müller P.
        • Bodner M.
        • Berger B.
        • Parson W.
        Exploring STR sequencing for forensic DNA intelligence databasing using the Austrian National DNA Database as an example.
        Int. J. Leg. Med. 2021; 135: 2235-2246
        • Novroski N.M.M.
        • King J.L.
        • Churchill J.D.
        • Seah L.H.
        • Budowle B.
        Characterization of genetic sequence variation of 58 STR loci in four major population groups.
        Forensic Sci. Int. Genet. 2016; 25: 214-226
        • Gettings K.B.
        • Aponte R.A.
        • Vallone P.M.
        • Butler J.M.
        STR allele sequence variation: current knowledge and future issues.
        Forensic Sci. Int. Genet. 2015; 18: 118-130
        • Borsuk L.A.
        • Gettings K.B.
        • Steffen C.R.
        • Kiesler K.M.
        • Vallone P.M.
        Sequence-based US population data for the SE33 locus.
        Electrophoresis. 2018; 39: 2694-2701
        • Woerner A.E.
        • King J.L.
        • Budowle B.
        Fast STR allele identification with STRait Razor 3.0.
        Forensic Sci. Int. Genet. 2017; 30: 18-23
        • Gettings K.B.
        • Borsuk L.A.
        • Ballard D.
        • Bodner M.
        • Budowle B.
        • Devesse L.
        • et al.
        STRSeq: a catalog of sequence diversity at human identification Short Tandem Repeat loci.
        Forensic Sci. Int. Genet. 2017; 31: 111-117