Assessment of Genetic Polymorphisms in the Rewa Population of Central India Using Y-Chromosomal STR Markers
The aim of the study was to analyze unrelated samples from the Rewa male population of Central India by targeting Y-short tandem repeats (STR) and then compare the results to previously published Y-STR haplotype data.
Materials and Methods. A total of 181 unrelated male subjects from the Rewa population were genotyped for seventeen Y-STRs (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385a/b, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, and Y-GATA-H4) by using an AmpFlSTR® Yfiler™ multiplex kit (Thermo Fisher Scientific, USA). The allele frequencies and forensic parameters were evaluated.
Results. A total of 111 distinct Y-STR alleles with corresponding frequencies ranging from 0.006 to 0.829 were identified. The gene diversity values ranged from 0.3092 at DYS437 to 0.8188 at DYS385b. The studied population showed a high level of haplotype diversity (0.9985) and discrimination capacity (0.927). A haplotype analysis was also conducted. Among the 181 unrelated male samples, 165 haplotypes and 153 unique haplotypes were found. Additionally, Rst (genetic distance) values were calculated using the analysis of molecular variance (AMOVA) for the studied population and for other 18 populations described in the literature. The Rst provides a convenient parameter for estimating the level of genetic differentiation from the microsatellite data. Based on these Rst values and using the multidimensional scaling plot, a neighbor-joining tree was constructed.
Conclusion. The high values of haplotype diversity and discrimination capacity indicate a great potential for distinguishing between male individuals in the studied population. The present population data are expected to find their use in forensic caseworks and population genetics.
Introduction
Rewa or Rewah district is located in the Vindhyan Range Plateau, in the North-Eastern part of Central India. According to the 2011 India Census, the total population size of Rewa is about 2.5 million; among them, there are 1.3 million males and 1.2 million females. The population density is 377 people per square kilometre area. Rewa was known as a princely state during the time of British Empire; today, its population is composed of various groups like tribes, casts, and communities living under various environmental conditions like forests, waterfalls, rivers, lush greenery, monuments, and forts (https://rewa.nic.in/).
Forensic laboratories across the globe are conducting Y-short tandem repeat (STR) typing for their routine casework analysis as well as population genetics studies [1, 2]. Y-STRs are short repeats of 2–7 base pairs and are widely spread throughout the Y-chromosome. Due to their high polymorphism, they are used as genetic markers in forensic analyses for anthropological cases, missing individuals, and disaster victim identification where only distant male relatives are available, as well as paternal-kinship analysis, population genetics, genealogical DNA testing, and human migration analysis. The Y-STR markers are highly valuable in cases of male/female cell admixture, namely in sexual assault cases. There, Y-STR has a significant role in differentiating between male and female DNA samples so as to produce interpretable results [3, 4]. Due to a lack of recombination process, Y-chromosome is considered as a male-specific marker and represents the paternal line inheritance with a little mutation or gene conversion [5, 6]. The non-recombination part of Y-chromosome DNA is useful in those cases where autosomal DNA profiling would be non-informative. Additionally, Y-STR haplotyping has been useful for solving motherless paternity cases by determining the patrilineal lineage of male perpetrators. Over time, various Y-STR multiplexes commercial kits with an ultra-high discrimination capacity between genetic variations have been developed. These characteristics of Y-chromosome make Y-STR profiling an important tool in forensic investigations, tracking human evolution, tracing bio-geographic ancestry of individuals, and detecting genetic differences between populations by identifying the informative Y-STR haplotypes [7–10].
So far, only a limited number of genetic studies on Y-STRs among the Central Indian population have been reported. Hence, there is a need to conduct more studies on this population [11–15].
The aim of the study was to analyze unrelated samples from the Rewa male population of Central India by targeting Y-short tandem repeats and then compare the results to previously published Y-STR haplotype data.
Materials and Methods
Sample collection and ethical consideration. Prior to conducting this study, ethical clearance had been obtained from the Ethical Committee of the Raksha Shakti University (Gujarat, India). A total number of 181 unrelated male individuals’ blood samples were collected randomly for the purpose of this study. All the samples belong to the population of Rewa, a state in Central India (Figure 1). Peripheral blood from each individual was collected and stored in EDTA tubes. Individuals from these settlements were approached in person with the help of their relatives and friends. All the participants were briefed about the purpose of the study. An informed written consent was obtained from each participant, in accordance with the Declaration of Helsinki (2013). The study participants ranged from 20–50 years of age.
Figure 1. Map of India (top left corner) with Central India highlighted and the map of Central India (main picture) with the Rewa district highlighted |
DNA extraction and quantification. Phenol–chloroform extraction was performed to extract genomic DNA from whole blood samples (n=181). The extracted DNA samples were quantified using a Quantifiler® Duo DNA Quantification Kit (Thermo Fisher Scientific, USA) in accordance with the manufacturer’s protocol. Quantified DNAs were normalized to the concentration of 1.0 ng/μl and then taken for PCR amplification.
Amplification. DNA samples were subjected to the amplification by using seventeen Y-STR loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385a/b, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, and Y-GATA-H4). PCR conditions were set as per the manufacturer’s instructions. The total volume of the PCR reaction mixture comprised 25 μl of reagents (5.0 μl of primers mix, 10.0 μl of master mix, and 10.0 μl of nuclease-free water). The amplification was run on a GeneAmp PCR System 9700 Thermal Cycler (Applied Biosystems, USA) using an AmpFLSTR Yfiler™ PCR kit (Thermo Fisher Scientific). Positive and negative controls were also run throughout the system.
DNA electrophoresis and analysis. The PCR products were size separated with the help of capillary electrophoresis using an ABI 3100 Genetic Analyzer (Life Technologies Corporation, USA) and size-characterized with the GeneScan 500 LIZ internal lane size standard (Thermo Fisher Scientific) as per the manufacturer’s recommended protocol. The GeneMapper ID-X Software Version 1.4 (Applied Biosystems) was used to determine the amplified fragments’ sizes. All alleles’ designations were based on a comparison with the allelic ladders provided by the AmpFLSTR Yfiler™ system (Thermo Fisher Scientific). The present study was carried out in accordance with the quality assurance standards recommended by the Scientific Working Group on DNA Analysis Methods (SWGDAM) [16].
Statistical analysis. Forensic and population genetic parameters were evaluated for seventeen Y-STRs. The gene diversities (GD) of the analysed 17 Y-STRs were calculated according to Nei and Tajima [17]. The allele and haplotype frequencies were determined using the counting method. Haplotype diversity (HD) was calculated using the formula:
HD=(N/N–1) (1–∑x2),
where x is the (relative) haplotype frequency of each haplotype in the sample and N is the sample size. The polymorphism information content (PIC) was calculated via the marker allelic frequencies using the following equation:
where n is the number of alleles and pi is the allele probability of the ith allele. The power of discrimination (PD) was also calculated as 1–PM, where PM is the probability matching. All the above parameters were calculated using the online software STR Analysis for Forensics (STRAF) [18]. The discrimination capacity (DC) was calculated by using the formula: DC=Nd/Nt, where Nd denotes the number of unique haplotypes and Nt denotes the total number of haplotypes. We used the Haplotype Analysis v1.04 software [19] to characterize the haplotype. Using the analysis of molecular variance (AMOVA) [20], population pairwise genetic distances (Rst) and their corresponding p-values were calculated to reveal the differences between the studied population and other eighteen populations (described in the literature) and then estimate their genetic relationships, using the YHRD website (https://www.yhrd.org). The Rst is equivalent to the fraction of the total variance in allele size (in terms of number of repeat units) between populations and is given by: Rst=(S–Sw)/S, where S is twice the variance in allele size over the populations and Sw is twice the average within-population variance in allele size. Reduced dimensionality spatial representation of the populations based on Rst values was performed using multidimensional scaling (MDS). The MEGA 6.0 software was used to construct the neighbor-joining tree with 1000 bootstrap replicates [21]. Neighbor-joining is another distance-based method for phylogenetic tree construction. It uses the clustering approach to constructing the tree.
Results
Y-chromosome diversity. Distribution of allele frequencies and gene diversity for the 17 Y-STR loci are presented in Supplement 1, https://drive.google.com/file/
Haplotypes analysis. A total number of 165 Y-haplotypes were obtained from the 181 individuals using the 17 Y-STR Yfiler™ kit. Among 165 haplotypes, the number of unique haplotypes was 153 (92.7%). The remaining 12 haplotypes (7.27%) were replicated between the subjects under study; 10 haplotypes were replicated twice; one haplotype was observed three times and another one was observed five times. The discriminating capacity of the 17 Y-STR Yfiler™ was found to be 0.9985. The overall haplotype diversity for the studied male population was found to be 0.9985. Both the discriminating power and haplotype diversity were remarkably high in the studied population (Supplement 2, https://drive.google.com/file/
Gene diversity and other forensic parameters. The values of GD for the seventeen Y-STR loci are shown in Figure 2. According to the estimated GD for each locus, the highest gene diversity was observed at loci DYS385b and DYS635 (GD was 0.8188 and 0.8055, respectively), whereas DYS437 and DYS392 loci demonstrated the lowest gene diversity values (GD was 0.3092 and 0.3664, respectively). Hence, the results showed a greater level of genetic polymorphism in fourteen Y-STR loci with the GD values exceeding 0.5, whereas three locus-possessed GD values were less than 0.5. The same trend was observed for the PIC values; the highest ones were observed at locus DYS385b and locus DYS635 (PIC is 0.7901 and 0.7729, respectively), whereas the DYS437 and DYS392 loci showed the lowest values (PIC is 0.2873 and 0.3395, respectively). In the probability matching test, DYS385b showed a lower value (0.1857), while DYS437 showed the highest one (0.6925) in this set of studied STRs. For the PD, DYS437 showed the lowest value (0.3075) while DYS385b demonstrated the highest one (0.8143). Gene diversity and other parameters which are of forensic interest are summarised in Table 1.
Table 1. Forensics-relevant parameters observed in the population of Rewa, India (n=181) |
Population statistics. Rst is one of the most widely used measures for genetic differentiation; it plays a central role in ecological and evolutionary genetic studies. Using AMOVA, population Rst and their respective p-values (p<0.05) were calculated in order to understand the genetic relationship between the studied population and other eighteen populations characterized in the literature: China [Han]; Uttar Pradesh, India [Afridi Pathan]; Punjab, India [Balmiki]; Gujarat, India [Bhil]; Andhra Pradesh, India [Brahmin]; Karnataka, India [Indian]; Madhya Pradesh, India [Gond]; Jharkhand, India [Indian]; Madhya Pradesh, India [Indian]; Tamil Nadu, India [Iyengar]; Madhya Pradesh, India [Kanyakubja Brahmin]; Maharashtra, India [Konkanastha Brahmin]; Tripura, India [Riang]; Tripura, India [Tripuri]; Malaysia [Indian]; Nepal [Newar]; Balochistan, Pakistan [Baloch]; and Singapore [Indian] [2, 12, 22–28]. For the comparison, the same 17 Y-STR loci were used (Table 2) with the statistical significance determined by a permutation test with 10,000 replicates. The genetic distances (Rst) and their corresponding p-values for the eighteen populations are illustrated in Table 2. According to the Rst values, the Afridi Pathan population (0.3595) from Uttar Pradesh and the Riang tribe (0.2802) from Tripura showed the longest genetic distances, while the Gond population (0.0465) from Madhya Pradesh and the Bhil population (0.0609) from Gujarat demonstrated the shortest genetic distances. The comparisons between the studied population and those previously reported showed significant variations in the Rst value (p<0.05). The clustering pattern of studied population compared with other Indian populations is shown in the MDS plot generated from the population data (Figure 3). This finding is also illustrated by the clustering pattern observed in the neighbor-joining tree (Figure 4).
Table 2. Molecular variance of pairwise distances between the studied population and reference populations from the YHRD (based on Rst values) |
Figure 3. Multidimensional scaling (MDS) plot based on pairwise Rst values between the studied population and reference populations from the YHRD |
Discussion
Due to the absence of recombination process in the Y-STR chromosome markers, they are considered highly effective in studying the genetic diversity of male individuals in a population. In order to understand the origin of modern humans, it is important to conduct genetic studies of various populations. Although such studies have been reported from different regions of Central India, no population studies have been conducted and reported from the Rewa region.
The present study was conducted with 181 unrelated male individuals of the Rewa population whose Y-STR markers were determined, processed, and then compared with published data on other populations for the genetic distance or genetic relationships. According to the results, the DC value (0.9985) was higher and these seventeen Y-STRs had more polymorphism and higher systematic efficiency in the Rewa populations; these parameters can be used as a useful forensic marker set. The GD value exceeded 0.5 in fourteen Y-STR loci, which showed a higher level of polymorphism. Along with that, three loci had GD values of less than 0.5; that indicated a lower level of polymorphism, which is less suitable for forensic purposes in this population.
The pairwise comparison between the studied population and those reported by others demonstrate that the Rewa people have a close affinity to Madhya Pradesh, India [Gond], followed by Gujarat, India [Bhil] Malaysia [Indian], Punjab, India [Balmiki], Singapore [Indian], and Madhya Pradesh, India [Indian]. The studied population showed a greater genetic affinity with the population of Uttar Pradesh, India [Afridi Pathan]. As observed in the neighbor-joining tree, the Madhya Pradesh, India [Gond], Gujarat, India [Bhil] Malaysia [Indian], Punjab, India [Balmiki], and Madhya Pradesh, India [Indian] cluster belong to the same tree branch. The tree also revealed that the same ethnic-origin populations of Central India (Madhya Pradesh, India [Indian] and Madhya Pradesh, India [Kanyakubja Brahmin]), belong to different cluster. The authors encourage other researchers to explore more genetic polymorphism in the Rewa population.
Conclusion
An attempt has been made to better understand the genetic relationship between Rewa (Central India) and other neighbouring populations. To this end, 181 Rewa male individuals were genotyped using an AmpFlSTR® Yfiler™ multiplex kit. This is the first report that provides population-based data on genetic variations and Y-STR polymorphism in the Rewa population using seventeen Y-STR markers. Here, we also report on the 17 Y-STR loci allelic frequencies and haplotype distribution in the Rewa population, which exhibit a significant discrimination capability, which can be applied to forensic casework and population studies.
Authors’ contribution. Aditi Mishra and Sarthak Mishra planned the study. Aditi Mishra and Ulhas Gondhali wrote the manuscript. Sumit Kumar Choudhary reviewed the manuscript. All authors have read and approved the manuscript.
Acknowledgments. The authors want to thank all individuals who participated in this study.
Study funding. The study was not supported by any financial sources.
Conflicts of interest. The authors declare that they have no conflicts of interest.
References
- Corach D., Filgueira Risso L., Marino M., Penacino G., Sala A. Routine Y-STR typing in forensic casework. Forensic Sci Int 2001; 118(2–3): 131–135, https://doi.org/10.1016/s0379-0738(00)00483-7.
- Singh M., Sarkar A., Nandineni M.R. A comprehensive portrait of Y-STR diversity of Indian populations and comparison with 129 worldwide populations. Sci Rep 2018; 8(1): 15421, https://doi.org/10.1038/s41598-018-33714-2.
- Kayser M. Forensic use of Y-chromosome DNA: a general overview. Hum Genet 2017; 136(5): 621–635, https://doi.org/10.1007/s00439-017-1776-9.
- Jobling M.A., Tyler-Smith C. New uses for new haplotypes: the human Y chromosome, disease and selection. Trends Genet 2000; 16(8): 356–362, https://doi.org/10.1016/s0168-9525(00)02057-6.
- Rozen S., Skaletsky H., Marszalek J.D., Minx P.J., Cordum H.S., Waterston R.H., Wilson R.K., Page D.C. Abundant gene conversion between arms of palindromes in human and ape Y chromosomes. Nature 2003; 423(6942): 873–876, https://doi.org/10.1038/nature01723.
- Trombetta B., Cruciani F., Underhill P.A., Sellitto D., Scozzari R. Footprints of X-to-Y gene conversion in recent human evolution. Mol Biol Evol 2010; 27(3): 714–725, https://doi.org/10.1093/molbev/msp231.
- Roewer L., Croucher P.J., Willuweit S., Lu T.T., Kayser M., Lessig R., de Knijff P., Jobling M.A., Tyler-Smith C., Krawczak M. Signature of recent historical events in the European Y-chromosomal STR haplotype distribution. Hum Genet 2005; 116(4): 279–291, https://doi.org/10.1007/s00439-004-1201-z.
- Heraclides A., Bashiardes E., Fernández-Domínguez E., Bertoncini S., Chimonas M., Christofi V., King J., Budowle B., Manoli P., Cariolou M.A. Y-chromosomal analysis of Greek Cypriots reveals a primarily common pre-Ottoman paternal ancestry with Turkish Cypriots. PloS One 2017; 12(6): e0179474, https://doi.org/10.1371/journal.pone.0179474.
- Iacovacci G., D’Atanasio E., Marini O., Coppa A., Sellitto D., Trombetta B., Berti A., Cruciani F. Forensic data and microvariant sequence characterization of 27 Y-STR loci analyzed in four Eastern African countries. Forensic Sci Int Genet 2017; 27: 123–131, https://doi.org/10.1016/j.fsigen.2016.12.015.
- Li L., Yu G., Li S., Jin L., Yan S. Genetic analysis of 17 Y-STR loci from 1019 individuals of six Han populations in East China. Forensic Sci Int Genet 2016; 20: 101–102, https://doi.org/10.1016/j.fsigen.2015.10.007.
- Ekka A., Kujur K., Sirmour R., Guha D., Padhye S.A., Verma A., Dixit S., Kumawat R.K., Chaubey G., Shrivastava P. Genetic polymorphism of 15 autosomal STR loci in population of Chhattisgarh, India. Gene Rep 2020; 21: 100883, https://doi.org/10.1016/j.genrep.2020.100883.
- Shrivastava P., Gupta U., Jain T., Trivedi V.B. Y STR haplotype diversity in central Indian population. Ann Hum Biol 2016; 43(5): 480–487, https://doi.org/10.3109/03014460.2015.1071425.
- Kaitholia K., Dash H.R., Shrivastava P., Kumawat R., Dixit S., Chaubey G. Forensic characterization and genetic evaluation in the Central Indian population using 27 Y-STRs. Int J Legal Med 2021; 135(3): 791–792, https://doi.org/10.1007/s00414-020-02486-8.
- Shrivastava P., Jain T., Trivedi V.B. Haplotype data for 17 Y-STR loci in the population of Madhya Pradesh, India. Forensic Sci Int Genet 2017; 26: e31–e32, https://doi.org/10.1016/j.fsigen.2016.11.002.
- Jain T., Shrivastava P., Trivedi V.B. Genetic portrait of Majhi tribe of Chhattisgarh, India based on 15 autosomal STRs and 23 Y-STRs. Int J Legal Med 2017; 131(1) :113–114, https://doi.org/10.1007/s00414-016-1421-5.
- Scientific Working Group on DNA Analysis Methods (SWGDAM). Revised validation guidelines. Forensic Sci Commun 2004; 6(3).
- Nei M., Tajima F. DNA polymorphism detectable by restriction endonucleases. Genetics 1981; 97(1): 145–163, https://doi.org/10.1093/genetics/97.1.145.
- Gouy A., Zieger M. STRAF — a convenient online tool for STR data evaluation in forensic genetics. Forensic Sci Int Genet 2017; 30: 148–151, https://doi.org/10.1016/j.fsigen.2017.07.007.
- Eliades N.G.H., Eliades D.G. HAPLOTYPE ANALYSIS: software for analysis of haplotype data. PHYSIS — Centre for Research and Conservation of Forest Lands; Cyprus; 2009, https://doi.org/10.13140/rg.2.2.19029.93922.
- Roewer L., Kayser M., Dieltjes P., Nagy M., Bakker E., Krawczak M., de Knijff P. Analysis of molecular variance (AMOVA) of Y-chromosome-specific microsatellites in two closely related human populations. Hum Mol Genet 1996; 5(7): 1029–1033, https://doi.org/10.1093/hmg/5.7.1029.
- Caspermeyer J. MEGA software celebrates silver anniversary. Mol Biol Evol 2018; 35(6): 1558–1560, https://doi.org/10.1093/molbev/msy098.
- Yong R.Y., Lee L.K., Yap E.P. Y-chromosome STR haplotype diversity in three ethnic populations in Singapore. Forensic Sci Int 2006; 159(2–3): 244–257, https://doi.org/10.1016/j.forsciint.2005.05.010.
- Adnan A., Rakha A., Nazir S., Alghafri R., Hassan Q., Wang C.C., Lu J. Forensic features and genetic legacy of the Baloch population of Pakistan and the Hazara population across Durand line revealed by Y-chromosomal STRs. Int J Legal Med 2021; 135(5): 1777–1784, https://doi.org/10.1007/s00414-021-02591-2.
- Gayden T., Mirabal S., Cadenas A.M., Lacau H., Simms T.M., Morlote D., Chennakrishnaiah S., Herrera R.J. Genetic insights into the origins of Tibeto-Burman populations in the Himalayas. J Hum Genet 2009; 54(4): 216–223, https://doi.org/10.1038/jhg.2009.14.
- Chang Y.M., Perumal R., Keat P.Y., Kuehn D.L. Haplotype diversity of 16 Y-chromosomal STRs in three main ethnic populations (Malays, Chinese and Indians) in Malaysia. Forensic Sci Int 2007; 167(1): 70–76, https://doi.org/10.1016/j.forsciint.2006.01.002.
- Imam J., Reyaz R., Singh R.S., Bapuly A.K., Shrivastava P. Genomic portrait of population of Jharkhand, India, drawn with 15 autosomal STRs and 17 Y-STRs. Int J Legal Med 2018; 132(1): 139–140, https://doi.org/10.1007/s00414-017-1610-x.
- Ghosh T., Kalpana D., Mukerjee S., Mukherjee M., Sharma A.K., Nath S., Rathod V.R., Thakar M.K., Jha G.N. Genetic diversity of 17 Y-short tandem repeats in Indian population. Forensic Sci Int Genet 2011; 5(4): 363–367, https://doi.org/10.1016/j.fsigen.2010.12.007.
- Zeng Z., Rowold D.J., Garcia-Bertrand R., Calderon S., Regueiro M., Li L., Zhong M., Herrera R.J. Taiwanese aborigines: genetic heterogeneity and paternal contribution to Oceania. Gene 2014; 542(2): 240–247, https://doi.org/10.1016/j.gene.2014.03.005.