If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Forensic inference from genetic markers uses highly polymorphic multi-locus genotypes. Measures of informativeness can aid in selecting efficient genetic markers. Existing measures do not account for multiple sources of genetic variation (i.e. mutation, silent alleles, etc.) and they are not directly applicable to complex identification problems. Using information theoretic principles within a probabilistic expert system (PES) we define a general measure of informativeness, Iq, of a marker for answering a forensic query. Iq gives a slightly different ranking of most genetic markers as its comparable measures. Accounting for sources of variation such as mutation, silent and null alleles reduces Iq and may further affect ranking. This criterion has a solid theoretical basis and can account for multiple sources of genetic variation and other anomalies. It can be directly applied to a variety of planning issues concerning the type, quantity and specific choice of markers for use in paternity testing and more general forensic problems.
Forensic inference from genetic markers is needed in a variety of identification problems including paternity testing, natural disasters, criminal investigations, immigration, etc. Polymorphic multi-locus genotypes and population allele frequencies used in the inference process are often complicated by population genetic factors such as mutation, co-ancestry, etc.
Highly informative genetic markers can reduce the amount of genotyping required. Measures of informativeness can aid in selecting efficient genetic markers for forensic inference; hence, it is desirable to measure the extent to which specific markers contribute to the forensic inference of interest. Existing measures, such as heterozygosity (h) [
] are primarily based on polymorphism, and despite their various features, they do not account for multiple sources of genetic variation (i.e. mutation, silent alleles, etc.), nor are they designed specifically for measuring information content, nor are they directly applicable to more complex identification problems.
Using information theoretic concepts and a decision-theoretic framework within a probabilistic expert system (PES), we define a general measure of informativeness, Iq, of a marker for a forensic query which can be used universally in a variety of forensic problems.
2. Methods
Consider a PES formulation for a paternity identification problem [
] for the paternity identification problem based on a single marker, with the query Q being represented by the node tf = pf? Using the PES to calculate the likelihood ratio for each marker and multiplying these to form a joint likelihood ratio resolves the paternity identification problem.
Fig. 1The overall PES representation of a paternity identification problem.
In determining which genetic markers contribute to the inference of paternity, we define the informativeness Iq for this scenario as
where H(X) denotes the entropy of the distribution of X
a measure of the total uncertainty of the distribution. The quantity Iq measures the reduction in uncertainty regarding Q due to observation of the genotypes of the associated individuals.
The quantity I(Q; PFGT, CGT, MGT) is also known as the mutual information between Q and (PFGT, CGT, MGT). For further details, the reader is referred to [
The concept of mutual information is very well established and understood and has a solid general foundation. It can be applied universally to any forensic query Q and any collection of evidence E1,…, Ek and can therefore be used for planning purposes in a multitude of scenarios, in particular it is valid also when mutation is incorporated into the PES.
Once a PES has been established for the forensic problem in question, the mutual information can be calculated by standard PES methods [
]. Mutation is incorporated as a proportional mutation model, for the sake of simplicity.
3. Results
For illustration, we consider the prior planning problem for paternity identification, i.e. the scenario where no genetic information is yet available for a triplet consisting of mother, child, and putative father, and informativeness of markers must be compared. The last column gives the informativeness when mutation is incorporated.
The results in Table 1 are rank ordered according to Iq. Traditional measures of informativeness identify THO1 as the most informative marker and FES as the least informative, whereas Iq ranks D1S80 highest, although the differences are small for all measures. Traditional measures give essentially identical rankings to all markers, with the exception of PE which switches the order of D1S80 and D21S11. Taking mutation into account slightly reduces informativeness.
Table 1Informativeness of genetic markers for paternity
The suggested measure, Iq, has a solid theoretical basis and gives similar rankings of forensic genetic markers as existing measures. It is applicable to any number of alleles and can account for multiple sources of genetic variation and other anomalies. It can be directly applied to a variety of planning issues concerning the type, quantity and choice of markers for use in paternity testing and more general forensic problems.
Conflict of interest
None.
References
Nei Matatoshi
Roychoudhury A.K.
Sampling variances of heterozygosity and genetic distance.