Hostname: page-component-745bb68f8f-f46jp Total loading time: 0 Render date: 2025-01-15T00:20:18.125Z Has data issue: false hasContentIssue false

Deviations from Hardy–Weinberg proportions for multiple alleles under viability selection

Published online by Cambridge University Press:  22 April 2008

GONZALO ALVAREZ*
Affiliation:
Departamento de Genética, Facultad de Biología, Universidad de Santiago de Compostela, 15782 Santiago de Compostela, Spain
*
*Tel.: +34 981563100, ext 13261. Fax: +34 981596904. e-mail: [email protected]
Rights & Permissions [Opens in a new window]

Summary

Departures of genotype frequencies from Hardy–Weinberg proportions (HWP) for a single autosomal locus due to viability selection in a random mating population have been studied only for the two-allele case. In this article, the analysis of deviations from HWP due to constant viability selection is extended to multiple alleles. The deviations for an autosomal locus with k alleles are measured by means of k fii fixation indices for homozygotes and k(k−1)/2 fij fixation indices for heterozygotes, and expressions are obtained for these indices (FIS statistics) under the multiallele viability model. Furthermore, expressions for fii and fij when the multiallele polymorphism is at stable equilibrium are also derived and it is demonstrated that the pattern of multiallele Hardy–Weinberg deviations at equilibrium is characterized by a global heterozygote excess and a deficiency of each of the homozygotes. This pattern may be useful for detecting whether a given multiallelic polymorphism is at stable equilibrium in the population due to viability selection. An analysis of Hardy–Weinberg deviations from published data for the three-allele polymorphism at the β-globin locus in human populations from West Africa is presented for illustration.

Type
Paper
Copyright
Copyright © Cambridge University Press 2008

1. Introduction

The departures of genotype frequencies from Hardy–Weinberg proportions (HWP) for a given locus provide relevant information for understanding genetic characteristics of populations, such as deviations from random mating, population subdivision, asymmetric allelic contributions of the sexes, or viability selection. Furthermore, the analysis of deviations from HWP is one of the few ways to identify systematic genotyping errors, so that at present it is a fundamental tool for genotyping quality control in large-scale studies of molecular markers (Hare et al., Reference Hare, Karl and Avise1996; Gomes et al., Reference Gomes, Collins, Lonjou, Thomas, Wilkinson, Watson and Morton1999; Xu et al., Reference Xu, Turner, Little, Bleecker and Meyers2002; Hosking et al., Reference Hosking, Lumsden, Lewis, Yeo, McCarthy, Bansal, Riley, Purvis and Xu2004; Chen et al., Reference Chen, Duan, Single, Mather and Thomson2005; Zou & Donner, Reference Zou and Donner2006; Teo et al., Reference Teo, Fry, Clark, Tai and Seielstad2007). Furthermore, in the context of studies of association between human diseases and molecular markers, the analysis of deviations from HWP is important for distinguishing those deviations in patients and control samples that could be attributed to the underlying genetic disease model at the susceptibility locus from those due to genotyping errors, chance and/or violations of the assumptions of Hardy–Weinberg equilibrium (Wittke-Thompson et al., Reference Wittke-Thompson, Pluzhnikov and Cox2005).

Natural selection operating through differential survival of genotypes is probably one of the most important mechanisms disturbing HWP in random mating populations, particularly when genotypes are recorded at the adult stage of the life cycle. Departures from HWP for a single autosomal locus produced by viability selection have been investigated for the two-allele case, especially as regards statistical tests for detecting natural selection (Lewontin & Cockerham, Reference Lewontin and Cockerham1959; Li, Reference Li1959; Workman, Reference Workman1969; Brown, Reference Brown1970; Hedrick, Reference Hedrick2005, pp. 150–152). However, the study of deviations from HWP caused by viability selection acting on multiple alleles has received very little attention. In contrast, analysis of deviations from HWP for multiple alleles has been performed for models of subdivided populations (Nei, Reference Nei1965; Li, Reference Li1969), inbreeding (Li & Horvitz, Reference Li and Horvitz1953; Yasuda, Reference Yasuda1968; Curie-Cohen, Reference Curie-Cohen1982; Robertson & Hill, Reference Robertson and Hill1984; Hill et al., Reference Hill, Babiker, Ranford-Cartwright and Walliker1995; Rousset & Raymond, Reference Rousset and Raymond1995) and differential selection between the sexes (Purser, Reference Purser1966; Ziehe & Gregorius, Reference Ziehe and Gregorius1981). Consequently, viability selection is the only basic model of deviations from HWP for which the multiple-allele case has not been investigated. This is rather striking given that the classical model of multiallele viability selection has been extensively studied, in particular with respect to conditions for the stability of multiallelic polymorphisms (Mandel, Reference Mandel1959, Reference Mandel1970; Weir, Reference Weir1970; Lewontin et al., Reference Lewontin, Ginzburg and Tuljapurkar1978; Karlin, Reference Karlin1981; Karlin & Feldman, Reference Karlin and Feldman1981). On theoretical grounds, multiallelic polymorphisms are expected to be easily maintained in natural populations by viability selection since, although the proportion of the viability parameter space permitting stable polymorphisms becomes extremely small as the number of alleles increases (Lewontin et al., Reference Lewontin, Ginzburg and Tuljapurkar1978; Karlin, Reference Karlin1981; Karlin & Feldman, Reference Karlin and Feldman1981), models based on Monte Carlo simulations in which a series of new mutations are introduced into the population show that viability selection is capable of maintaining a large number of alleles (up to 38 in some cases) (Spencer & Marks, Reference Spencer and Marks1988, Reference Spencer and Marks1992; Marks & Spencer, Reference Marks and Spencer1991). In the present article, expressions for departures of genotype frequencies from HWP, as measured by means of fixation indices (F IS statistics), are obtained for an autosomal locus with multiple alleles under a deterministic model of constant viability selection and random mating. Special attention is devoted to characterizing the multiallelic pattern of deviations from HWP exhibited by the population when it attains a stable equilibrium due to viability selection.

2. Hardy–Weinberg deviations under the multiallele viability model

(i) Model and notation

An autosomal locus with k alleles (denoted as A 1, A 2, …, A k) is considered, where p i is the frequency of the A i allele at the zygotic stage. Assuming random mating, the frequency of the A iA i homozygote is p i2 and the frequency of the A iA j heterozygote is 2p ip j. Under the standard one-locus multiallele viability selection model, with fitness values w ii for the A iA i homozygote and w ij for the A iA j heterozygote, the adult frequencies for the A iA i and A iA j genotypes, A ii and A ij respectively, are

(1)
A_{ii} \equals {{p_{i}^{\setnum{2}} w_{ii} } \sol W}\nonumber\\ A_{ij} \equals {{2p_{i} p_{j} w_{ij} }\sol W}

and the allele frequencies in adults are

(2)
p_{i}\hskip -1\prime \equals {{p_{i} w_{i} } \sol W}\nonumber\\p_{j}\hskip-1 \prime \equals {{p_{j} w_{j} }\sol W}

where w i is the marginal fitness of allele A i and W is the mean fitness of the population, given by

(3)
w_{i} \equals p_{i} w_{ii} \plus \mathop\sum\limits_{j \ne i} {p_{j} w_{ij} \equals \mathop\sum\limits_{j} {p_{j} w_{ij} } } \nonumber\\W \equals \mathop\sum\limits_{i} {p_{i}^{\setnum{2}} w_{ii} \plus \mathop\sum\limits_{i \lt j} {2p_{i} p_{j} w_{ij} } \equals \mathop\sum\limits_{i} {p_{i} w_{i} } \equals \mathop\sum\limits_{i} {\mathop\sum\limits_{j} {p_{i} p_{j} w_{ij}}}}.

The departures of adult genotype frequencies from HWP for multiple alleles can be expressed in terms of either k f ii fixation indices (F IS statistics) or, alternatively, k(k−1)/2 f ij fixation indices, as

(4)
f_{ii} \equals {{A_{ii} \minus p_{i} \hskip -2 \prime ^{\setnum{2}} } \over {p_{i}\hskip -1\prime \lpar 1 \minus p_{i}\hskip -1\prime \rpar }}
(5)
f_{ij} \equals 1 \minus {{A_{ij} } \over {2p_{i} \hskip -1\prime p_{j}\hskip -1\prime }}

taking into account that f ii and f ij are functionally related by

(6)
f_{ii} \equals {{\mathop\sum\limits_{j \ne i} {p_{j}\hskip -1\prime f_{ij} } } \over {\lpar 1 \minus p_{i}\hskip -1\prime \rpar }}

(Weir, Reference Weir1996, p. 94). In this formulation, the f ii coefficients can be considered as allele-specific F IS statistics (Chakraborty & Danker-Hopfe, Reference Chakraborty, Danker-Hopfe, Rao and Chakraborty1991).

(ii) Hardy–Weinberg deviations

Expressions for the f ii and f ij fixation indices under the model of viability selection are obtained by substituting the allele and genotype frequencies in (4) and (5) by their values given by (1) and (2), and they are

(7)
f_{ii} \equals {{p_{i} \lpar w_{ii} W \minus w_{i}^{\setnum{2}} \rpar } \over {w_{i} \lpar W \minus p_{i} w_{i} \rpar}}
(8)
f_{ij} \equals 1 \minus {{w_{ij} W} \over {w_{i} w_{j} }}.

In these expressions for the deviations from HWP for multiple alleles, the terms (w iiWw i2) for the homozygote A iA i and (w iw jw ijW) for the heterozygote A iA j determine the sign of the deviation. Thus, when the fitness of a particular genotype multiplied by the mean fitness is equal to the product of the marginal fitnesses of the alleles forming that genotype, a deviation from HWP is not expected to occur for that genotype. This is the case for multiplicative or geometric fitnesses where w ii=a i2 and w ij=a ia j since, in this case, marginal and mean fitnesses take the form

w_{i} \equals a_{i} \mathop\sum\limits_{j} {p_{j}\hskip -1\prime a_{j}}
w_{j} \equals a_{j} \mathop\sum\limits_{i} {p_{i}\hskip -1\prime a_{i}}
W \equals \mathop\sum\limits_{i} {\mathop\sum\limits_{j} {p_{i}\hskip -1\prime p_{j}\hskip -1\prime a_{i} a_{j}}}

and substituting these expressions in (7) and (8), we have

f_{ii} \equals f_{ij} \equals 0.

Therefore, under multiplicative viability fitnesses, the genotype frequencies at a multiallelic locus, after the operation of selection, are in accordance with Hardy–Weinberg expectations. This result was demonstrated for a two-allele locus by Lewontin & Cockerham (Reference Lewontin and Cockerham1959) and extended to the three-allele case by Li (Reference Li1959), and here is generalized for a k-allele system.

When the genotype fitnesses do not follow a geometric progression, the pattern of Hardy–Weinberg deviations is difficult to specify since f ii and f ij, as expressed by (7) and (8), are dependent on the marginal and mean fitness which are changing along generations. However, a particular and relatively simple pattern of Hardy–Weinberg departures is expected to occur for a multiallelic locus when a stable equilibrium is attained in the population by the operation of viability selection. At the equilibrium, f ii and f ij, as expressed by (7) and (8), reduce to

(9)
f\hskip 1 _{ii}\hskip -3 \vskip -1{\ast } \equals {{p\hskip 1_{i}\hskip -3 \vskip -1{\ast } } \over {\lpar 1 \minus p\hskip 1_{i}\hskip -3 \vskip -1{\ast } \rpar }}\left( {{{w_{ii} } \over {W\hskip 1{\ast } }} \minus 1} \right)
(10)
f\hskip 1 _{ij}\hskip -3 \vskip -1{\ast } \equals 1 \minus {{w_{ij} } \over {W{\ast}}}

where * denotes equilibrium values, since the condition for equilibrium in the multiallele viability model is simply w i=w j=………=W (Lewontin et al., Reference Lewontin, Ginzburg and Tuljapurkar1978). Consequently, the departure from HWP for a given genotype is basically determined, at the stable equilibrium, by the ratio of the genotype fitness to the mean fitness of the population. Given that the homozygote and heterozygote finesses must satisfy two inequalities with respect to the mean fitness of the population as necessary conditions for the existence of a stable multiallele polymorphism, which are W*>w ii for all i=1, 2, …, k and W\hskip 1{\ast } \lt \tilde{w}_{ij}\hskip -4 \vskip -1 {\ast}, where \tilde{w} _{ij}\hskip -4 \vskip -1{\ast} is the weighted mean fitness of heterozygotes at the equilibrium (Mandel, Reference Mandel1959; Ginzburg, Reference Ginzburg1979), it follows that all homozygotes must present a deficiency with respect to HWP, that is

f\hskip 1 _{ii}\hskip -4 \vskip -1 {\ast } \lt 0 \qquad {\rm \lpar all\ }i{\rm \ \equals \ 1\comma \ 2\comma \, } \ldots {\rm \comma \, k\rpar}

and an excess must be present in many but not necessarily all heterozygote classes. In the three-allele case, for example, it has been shown that at most one heterozygous viability may fall below that of at most two homozygotes (Mandel, Reference Mandel1959) and therefore, in this case, the f ij* corresponding to that particular heterozygote will be positive.

For a two-allele locus, the expression for departures from HWP under viability selection, obtained by Workman (Reference Workman1969) and Brown (Reference Brown1970), is a particular case of expressions (7) and (8). At equilibrium, the Hardy–Weinberg deviation for a diallelic locus as given by Workman (Reference Workman1969) is a particular case of expressions (9) and (10).

(iii) Estimation of FIS statistics

The model for statistical estimation of deviations from HWP for multiple alleles under viability selection is a model where either k f ii parameters, or alternatively k(k−1)/2 f ij parameters, must be independently estimated, in addition to the allele frequencies. At first sight, this model is more complicated than the model for the estimation of the inbreeding coefficient under regular inbreeding, in which only one f value needs to be estimated in addition to the allelic frequencies, and which has been extensively studied (Li & Horvitz, Reference Li and Horvitz1953; Curie-Cohen, Reference Curie-Cohen1982; Robertson & Hill, Reference Robertson and Hill1984; Hill et al., Reference Hill, Babiker, Ranford-Cartwright and Walliker1995; Rousset & Raymond, Reference Rousset and Raymond1995). However, in the framework of maximum likelihood theory, the estimation of both the set of parameters f and the allele frequencies is straightforward. Consider a random sample of n adults in which the observed numbers of A iA i and A iA j genotypes are n ii and n ij, respectively, and the observed allele frequency of A i is p i′. The likelihood of a sample of n individuals composed of n ii genotypes A iA i and n ij genotypes A iA j can be expressed in terms of a set F of k(k−1)/2 f ij parameters and a set P of k parameters of allele frequencies as

\eqalign{ L_{\lpar P\comma F\hskip 2\rpar } \equals \tab {{n\exclam } \over {n_{\setnum{11}} \exclam n_{\setnum{12}} \exclam.......n_{kk} \exclam }}\left( p_{\setnum{1}}\hskip -3\prime ^{\setnum{2}} \plus p_{\setnum{1}}\hskip -3\prime \mathop\sum\limits_{j \ne \setnum{1}} {p_{j}\hskip -1\prime \hskip 3f_{\setnum{1}j} } \right) ^{n_{{\setnum{11}}} } \cr \tab \times \left( 2p_{\setnum{1}}\hskip -3\prime \hskip 1pt p_{\setnum{2}}\hskip -3\prime \lpar 1 \minus f_{\setnum{12}} \rpar \right) ^{n_{{\setnum{12}}} }...... \left( p_{k}\hskip -3\prime ^{\setnum{2}} \plus p_{k}\hskip -3\prime \mathop\sum\limits_{j \ne k} {p_{j}\hskip -2\prime \hskip 2pt f_{kj} } \right) ^{n_{{kk}} }. \cr}

Under this formulation the f ii parameters are not taken into consideration and therefore the number of independent parameters to be estimated equals the number of degrees of freedom in the data, so that Bailey's method (Bailey, Reference Bailey1951; Weir, Reference Weir1996, pp. 63–66) can be applied. Consequently, the maximum likelihood estimates of the parameters are simply their observed values and, therefore, both the f ij obtained from (5) and the allele frequencies computed by gene counting are maximum likelihood estimates. With regard to the f ii fixation indices, their estimates from (4) are also maximum likelihood estimates since each particular f ii corresponds to the f estimate that results from grouping all the alleles into two categories, i versus non-i, and for a diallelic system both f 11=f 12=f 22=f and the allele frequency are maximum likelihood estimates (Li & Horvitz, Reference Li and Horvitz1953; Weir, Reference Weir1996, pp. 64–65). In this way, a k-allele system can be split into k diallelic systems each leading to maximum likelihood estimates of both f ii and p i′. Note that all this estimation procedure is valid not only for the particular case of deviations from HWP produced by viability selection, but for any case where each specific genotype has a specific departure from HWP, as for example population subdivision or different allelic frequencies between the sexes.

3. Hardy–Weinberg deviations for the β-globin locus in human populations from West Africa

The β-globin gene is one of the most thoroughly studied polymorphisms in man, since it is an adaptive polymorphism involved in resistance against Plasmodium falciparum malaria (Cavalli-Sforza & Bodmer, Reference Cavalli-Sforza and Bodmer1971; Vogel & Motulsky, Reference Vogel and Motulsky1997). An analysis of multiallelic deviations from HWP for this locus in human populations has been performed in the present study using published data (Allison, Reference Allison1956; Roberts & Boyo, Reference Roberts and Boyo1962; Modiano et al., Reference Modiano, Luoni, Sirima, Simporé, Verra, Konaté, Rastrelli, Olivieri, Calissano, Paganotti, D'Urbano, Sanou, Sawadogo, Modiano and Coluzzi2001). The populations considered belong to the geographical area of West Africa where this locus presents three alleles with detectable frequencies: the HbA allele that gives rise to the normal haemoglobin, the HbS allele responsible for the sickle haemoglobin, and the HbC responsible for haemoglobin C. Samples of adults and infants from the Jola and Fula populations (The Gambia), and of adults and children from the Yoruba population (Nigeria), were analysed. A very large sample (n=3513) from the Mossi population (Burkina Faso) was also included in the analysis: this is a control sample from a large case–control study performed in Burkina Faso to investigate the protective role against severe malaria of genotypes at the β-globin locus, and was composed mainly of healthy subjects more than 6 years old (87% children aged 6–15 years, and 8·4% individuals more than 15 years old), though a small number of children aged 1–5 years (4·6%) was also included (Modiano et al., Reference Modiano, Luoni, Sirima, Simporé, Verra, Konaté, Rastrelli, Olivieri, Calissano, Paganotti, D'Urbano, Sanou, Sawadogo, Modiano and Coluzzi2001).

Genotype distribution, allele frequencies and deviations from HWP for each of the samples analysed are given in Table 1. Deviations from HWP were measured by means of the f ii estimators of Robertson & Hill (Reference Robertson and Hill1984), giving estimates for the three homozygous genotypes (\vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2AA} for the homozygote HbAA, \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2SS} for HbSS, and \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2{CC}} for HbCC) and a global estimate of deviation from HWP at the locus (\vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2T}) obtained from the weighted average of the f ii estimates. The variance of f ii estimates equals 1/n for f ii=0 and the ratio of the squared estimate to its variance will be approximately distributed as a chi-square variable with one degree of freedom, leading to a two-tailed test of the null hypothesis H0: f ii=0 (Elandt-Johnson, Reference Elandt-Johnson1971, pp. 355–356; Robertson & Hill, Reference Robertson and Hill1984). One-tailed tests can be performed from the ratio of the estimate to its standard error, which is approximately distributed as a standard normal variable (Elandt-Johnson, Reference Elandt-Johnson1971, pp. 355–356). The two-tailed test of f ii is equivalent to the Hardy–Weinberg test of a single homozygous genotype recently proposed by Chen et al. (Reference Chen, Duan, Single, Mather and Thomson2005), since the chi-square statistic given by Chen et al. (Reference Chen, Duan, Single, Mather and Thomson2005, p. 1440) is simply nf ii2. The analysis of deviation from HWP for multiple alleles by means of f ii fixation indices and/or single genotype tests gives a complete view of the distribution of deviations among particular genotypes at the given locus in contrast to the overall tests such as the chi-square goodness-of-fit test or the exact test (Louis & Dempster, Reference Louis and Dempster1987; Guo & Thompson, Reference Guo and Thompson1992; Chakraborty & Zhong, Reference Chakraborty and Zhong1994; Rousset & Raymond, Reference Rousset and Raymond1995). A very regular pattern of deviations from HWP for the β-globin locus is observed in the adult samples from West Africa populations. First, a global heterozygote excess is found in all adult samples: \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2T} ranges from −0·103 to −0·052 with an average of −0·070±0·016. This heterozygote excess is statistically significant by one-sided tests in the Yoruba sample. Second, the distribution of Hardy–Weinberg deviations among particular homozygotes is clearly uneven in the adult samples, since homozygotes for the HbA and HbS alleles show a clear deficiency with respect to HWP, whereas the frequency of the homozygote HbCC is very close to Hardy–Weinberg expectations. Specifically, \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2AA} ranges from −0·165 to −0·105 with a mean value of −0·128±0·019, these deviations being statistically significant in two of the three adult samples analysed; similarly, f SS ranges from −0·144 to −0·091 with a mean value of −0·112±0·016, these deviations being statistically significant in the Yoruba sample. In contrast, \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2{CC}} ranges from −0·053 to −0·008 with a mean value of −0·024±0·015 and these negative estimates are associated with the absence of HbCC homozygotes in the adult samples due to the low frequency of the HbC allele (see expression (4)). In addition, these deviations are not statistically significant in any of the three adult samples studied. A substantial number of HbCC homozygotes is present in the large sample from the Mossi population which probably represents a partially selected stage since it is composed by individuals older than 6 years and, in this case, \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2{CC}} takes a positive value (\vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2{CC}}=0·0004). As a whole, these results do not support the idea that this three-allele polymorphism is at stable equilibrium in the West African populations due to viability selection, since stable equilibrium would require all three homozygotes to present a deficiency with respect to Hardy–Weinberg expectations, as already demonstrated. Obviously, a large number of West African populations must be analysed in order to confirm these results but it is interesting to point out that the analysis of multiallelic deviations from HWP presented here is in accordance with recent evidence based on epidemiological and fitness data which suggests that this three-allele system may be a transient polymorphism in West African populations (Modiano et al., Reference Modiano, Luoni, Sirima, Simporé, Verra, Konaté, Rastrelli, Olivieri, Calissano, Paganotti, D'Urbano, Sanou, Sawadogo, Modiano and Coluzzi2001; Hedrick, Reference Hedrick2004, 2005, pp. 161–163).

Table 1. Genotype distribution, allele frequencies and Hardy–Weinberg deviations at the β-globin locus in samples from West African populations

a 2 months to 1 year; b 6 years to 12 years; c 21 months to 6 years; d 4 months to 28 months.

* P<0·05; ** P<0·01; *** P<0·001.

The analysis of deviations from HWP for infants (2 months to 1 year) and very young children (4–28 months) shows that their genotypic frequencies are very close to Hardy–Weinberg expectation and, thus, the pattern of deviations from HWP observed in these age groups is very different from that found in adult samples (Table 1): mean values for \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2T}, \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2AA}, \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2SS} and \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2{CC}} are 0·018±0·005, 0·027±0·004, 0·043±0·007 and −0·007±0·004, respectively, in the three samples analysed. These results reveal that the heterozygote excess observed in adult samples is not a consequence of asymmetric allelic contributions of the sexes due to differential selection in the two sexes or to chance, since in this case the heterozygote excess would be present at the zygotic stage (see Section 4). On the contrary, our findings indicate that the heterozygote excess observed in adult samples is probably due to the operation of viability selection. In older children (21 months to 6 years, 6–12 years and >6 years), the pattern of deviations from HWP observed is very close to that seen in the adult samples: mean values for \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2T}, \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2AA}, \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2SS} and \vskip-2\hat{\hskip2{\vskip2f}}_{\hskip-2{CC}} in the three older-children samples are −0·056±0·016, −0·094±0·018, −0·095±0·027 and −0·013±0·008, respectively. This heterozygote excess is statistically significant by one-sided tests for both f T and f SS in two of the three samples analysed (Yoruba and Mossi) and for f AA in the Mossi sample. This result is consistent with evidence indicating that differential mortality among genotypes at the β-globin locus due to death from either sickle-cell anaemia or malaria occurs mainly in young children (Allison, Reference Allison1956; Roberts & Boyo, Reference Roberts and Boyo1960; Cavalli-Sforza & Bodmer, Reference Cavalli-Sforza and Bodmer1971, Greenwood et al., Reference Greenwood, Bradley, Greenwood, Byass, Jammeh, Marsh, Tulloch, Oldfield and Hayes1987; Vogel & Motulsky, Reference Vogel and Motulsky1997).

4. Discussion

The effect of viability selection on the distribution of genotypes for a multiallelic polymorphism in a random mating population is effectively identified through the departures of genotype frequencies from Hardy–Weinberg proportions (HWP) (expressions (7) and (8)). Furthermore, a genetic polymorphism for multiple alleles maintained by balancing viability selection will show, at equilibrium, both a global heterozygote excess and a deficiency of each of the homozygotes (expressions (9) and (10)). This pattern of Hardy–Weinberg deviations is a consequence of the relationship between the genotype fitnesses and the mean fitness when the population reaches stable equilibrium, since, at this point, the mean fitness of the population must be higher than the fitness of each homozygote and lower than the weighted mean fitness of heterozygotes (Mandel, Reference Mandel1959; Ginzburg, Reference Ginzburg1979). This ‘footprint’ of selection on the genotypic distribution may be useful for detecting whether a given multiallele polymorphism is at stable equilibrium in the population due to viability selection, since it can be easily distinguished from other potential causes of deviations from HWP. Inbreeding and subdivision or admixture of populations will give rise to heterozygote deficiency although, under population subdivision for multiple alleles, some particular heterozygote might be in excess due to a positive covariation of allelic frequencies (Nei, Reference Nei1965; Li, Reference Li1969). On the other hand, heterozygote excess can also be caused by differences in allelic frequencies between the sexes. These differences might arise either by chance or by differential selection between the sexes. These two different mechanisms are formally analogous in terms of deviations from HWP, since in both cases the deviation is dependent on the difference in allele frequencies in the two sexes, irrespective of the process generating these differences. Differences in allelic frequencies between sexes may well arise by chance if the number of parents is small, and will cause an excess of heterozygotes in the progeny (Robertson, Reference Robertson1965). Heterozygote excess as a consequence of asymmetric allelic contributions of the sexes due to differential viability or fertility selection in the two sexes has been characterized for the two-allele case (Bundgaard & Christiansen, Reference Bundgaard and Christiansen1972; Andresen, Reference Andresen1978) and for multiallelic systems (Purser, Reference Purser1966; Ziehe & Gregorius, Reference Ziehe and Gregorius1981). For multiple alleles, differential allelic contributions from each sex lead to a deficiency of each homozygote and an excess of the sum of all heterozygotes. Therefore, differences in allelic frequencies between sexes will give rise to a pattern of Hardy–Weinberg deviations very similar, at first sight, to that produced by balancing viability selection at equilibrium. There is, however, a striking difference between these two patterns of Hardy–Weinberg deviations as regards the specific stage of the life cycle in which they originate. Thus, differences in allelic frequencies between sexes will produce deviations from HWP apparent at the zygotic stage; in contrast, under viability selection genotypic frequencies at the zygotic stage are expected to show HWP as a consequence of random mating, and the deviations generated by the operation of the viability selection will be mainly observed in the adult phase of the life cycle. Moreover, under differential selection between the sexes, the deviations of genotype frequencies from HWP become very small after several generations and a strong affinity of these frequencies for HWP is observed at the equilibrium (Ziehe & Gregorius, Reference Ziehe and Gregorius1981). In contrast, under a viability selection model, large deviations from HWP may be seen at the stable equilibrium, at least for those genotypes showing larger departures from the mean fitness of the population (expressions (9) and (10)).

Heterozygote excess has been detected for some multiallelic polymorphisms such as the inversion polymorphism in Drosophila (Dobzhansky & Levene, Reference Dobzhansky and Levene1948; Ruiz et al., Reference Ruiz, Fontdevila, Santos, Seoane and Torroja1986) or the polymorphisms of the human β-globin gene (Cavalli-Sforza & Bodmer, Reference Cavalli-Sforza and Bodmer1971, pp. 161–165) and HLA complex (Hedrick, Reference Hedrick1990; Markov et al., Reference Markov, Hedrick, Zuerlein, Danilovs, Martin, Vyvial and Armstrong1993; Chen et al., Reference Chen, Hollenbach, Trachtenberg, Just, Carrington, Rønningen, Begovich, King, McWeeney, Mack, Erlich and Thomson1999). This excess is thought to be the result of the operation of natural selection. However, analysis of multiallelic deviations from HWP through the estimation of f ii fixation indices has rarely been carried out for such polymorphisms, since until now there was no reference model to interpret the observed patterns of multiallelic deviations generated by selection. Certainly, analysis of the heterozygote excess associated with adaptive polymorphisms in terms of multiallelic deviations may give valuable information on the mechanism of balancing selection responsible for the maintenance of such polymorphisms. As discussed above, the occurrence of a global heterozygote excess associated with a deficiency of each and every one of the homozygotes (f ii*<0, for all i=1, 2......k) is strong evidence suggesting that a multiallelic polymorphism is at equilibrium due to viability selection. Otherwise, when such a pattern of multiallelic deviations is not seen, either the population is not at equilibrium, or some mechanism of balancing selection other than viability selection must be responsible for maintaining the observed multiallelic polymorphism.

References

Allison, A. C. (1956). The sickle-cell and haemoglobin C genes in some African populations. Annals of Human Genetics 21, 6789.CrossRefGoogle ScholarPubMed
Andresen, E. (1978). A note on deviation from Hardy–Weinberg proportions due to differences in gene frequencies between parental males and females. Animal Blood Groups and Biochemical Genetics 9, 5558.CrossRefGoogle ScholarPubMed
Bailey, N. T. J. (1951). Testing the solubility of maximum likelihood equations in the routine application of scoring methods. Biometrics 7, 268274.CrossRefGoogle Scholar
Brown, A. H. D. (1970). The estimation of Wright's fixation index from genotypic frequencies. Genetica 41, 399406.CrossRefGoogle ScholarPubMed
Bundgaard, J. & Christiansen, F. B. (1972). Dynamics of polymorphisms. I. Selection components in a experimental population of Drosophila melanogaster. Genetics 71, 439460.CrossRefGoogle Scholar
Cavalli-Sforza, L. L. & Bodmer, W. F. (1971). The Genetics of Human Populations. San Francisco: Freeman.Google Scholar
Chakraborty, R. & Danker-Hopfe, H. (1991). Analysis of population structure: a comparative study of different estimators of Wright's fixation indices. In Handbook of Statistics (ed. Rao, C. R. & Chakraborty, R.), vol. 8, pp. 203254. Amsterdam: Elsevier.Google Scholar
Chakraborty, R. & Zhong, Y. (1994). Statistical power of an exact test of Hardy–Weinberg proportions of genotypic data at a multiallelic locus. Human Heredity 44, 19.CrossRefGoogle Scholar
Chen, J. J., Hollenbach, J. A., Trachtenberg, E. A., Just, J. J., Carrington, M., Rønningen, K. S., Begovich, A., King, M. C., McWeeney, S., Mack, S. J., Erlich, H. A. & Thomson, G. (1999). Hardy–Weinberg testing for HLA class II (DRB1, DQA1, DQB1, and DPB19 loci in 26 human ethnic groups. Tissue Antigens 54, 533542.CrossRefGoogle ScholarPubMed
Chen, J. J., Duan, T., Single, R., Mather, K. & Thomson, G. (2005). Hardy–Weinberg testing of a single homozygous genotype. Genetics 170, 14391442.CrossRefGoogle ScholarPubMed
Curie-Cohen, M. (1982). Estimates of inbreeding in a natural population: a comparison of sampling properties. Genetics 100, 339358.CrossRefGoogle Scholar
Dobzhansky, T. & Levene, H. (1948). Genetics of natural populations. XVII. Proof of operation of natural selection in wild populations of Drosophila pseudoobscura. Genetics 33, 537547.CrossRefGoogle Scholar
Elandt-Johnson, R. C. (1971). Probability Models and Statistical Methods in Genetics. New York: Wiley.Google Scholar
Ginzburg, L. R. (1979). Why are heterozygotes often superior in fitness? Theoretical Population Biology 15, 264267.CrossRefGoogle Scholar
Gomes, I., Collins, A., Lonjou, C., Thomas, N. S., Wilkinson, J., Watson, M. & Morton, N. (1999). Hardy–Weinberg quality control. Annals of Human Genetics 63, 535538.CrossRefGoogle ScholarPubMed
Greenwood, B. M., Bradley, A. K., Greenwood, A. M., Byass, P., Jammeh, K., Marsh, K., Tulloch, S., Oldfield, F. S. J. & Hayes, R. (1987). Mortality and morbidity from malaria among children in a rural area of The Gambia, West Africa. Transactions of the Royal Society of Tropical Medicine and Hygiene 81, 478486.CrossRefGoogle Scholar
Guo, S. W. & Thompson, E. A. (1992). Performing the exact test of Hardy–Weinberg proportions for multiple alleles. Biometrics 48, 361372.CrossRefGoogle ScholarPubMed
Hare, M. P., Karl, S. A. & Avise, J. C. (1996). Anonymous nuclear DNA markers in the American oyster and their implications for the heterozygote deficiency phenomenon in marine bivalves. Molecular Biology & Evolution 13, 334345.CrossRefGoogle ScholarPubMed
Hedrick, P. (1990). Evolution at HLA: possible explanations for the deficiency of homozygotes in two populations. Human Heredity 40, 213220.CrossRefGoogle ScholarPubMed
Hedrick, P. (2004). Estimation of relative fitnesses from relative risk data and the predicted future of haemoglobin alleles S and C. Journal of Evolutionary Biology 17, 221224.CrossRefGoogle Scholar
Hedrick, P. (2005). Genetics of Populations, 3rd edn. Sudbury, MA: Jones and Bartlett.Google Scholar
Hill, W. G., Babiker, H. A., Ranford-Cartwright, L. C. & Walliker, D. (1995). Estimation of inbreeding coefficients from genotypic data on multiple alleles, and application to estimation of clonality in malaria parasites. Genetical Research 65, 5361.CrossRefGoogle ScholarPubMed
Hosking, L., Lumsden, S., Lewis, K., Yeo, A., McCarthy, L., Bansal, A., Riley, J., Purvis, I. & Xu, C. F. (2004). Detection of genotyping errors by Hardy–Weinberg equilibrium testing. European Journal of Human Genetics 12, 395399.CrossRefGoogle ScholarPubMed
Karlin, S. (1981). Some natural viability systems for a multiallelic locus: a theoretical study. Genetics 97, 457473.CrossRefGoogle ScholarPubMed
Karlin, S. & Feldman, M. W. (1981). A theoretical and numerical assessment of genetic variability. Genetics 97, 475493.CrossRefGoogle ScholarPubMed
Lewontin, R. C. & Cockerham, C. C. (1959). The goodness-of-fit test for detecting natural selection in random mating populations. Evolution 13, 561564.CrossRefGoogle Scholar
Lewontin, R. C., Ginzburg, L. R. & Tuljapurkar, S. D. (1978). Heterosis as an explanation for large amounts of genic polymorphism. Genetics 88, 149170.CrossRefGoogle ScholarPubMed
Li, C. C. (1959). Notes on relative fitness of genotypes that forms a geometric progression. Evolution 13, 564567.Google Scholar
Li, C. C. (1969). Population subdivision with respect to multiple alleles. Annals of Human Genetics 33, 2329.CrossRefGoogle ScholarPubMed
Li, C. C. & Horvitz, D. G. (1953). Some methods of estimating the inbreeding coefficient. American Journal of Human Genetics 5, 107117.Google ScholarPubMed
Louis, E. J. & Dempster, E. R. (1987). An exact test for Hardy–Weinberg and multiple alleles. Biometrics 43, 805811.CrossRefGoogle ScholarPubMed
Mandel, S. P. H. (1959). The stability of a multiple allelic system. Heredity 13, 289302.CrossRefGoogle Scholar
Mandel, S. P. H. (1970). The equivalence of different sets of stability conditions for multiple allelic systems. Biometrics 26, 840845.CrossRefGoogle ScholarPubMed
Markov, T., Hedrick, P. W., Zuerlein, K., Danilovs, J., Martin, J., Vyvial, T. & Armstrong, C. (1993). HLA polymorphism in the Havasupai: evidence for balancing selection. American Journal of Human Genetics 53, 943952.Google Scholar
Marks, R. W. & Spencer, H. G. (1991). The maintenance of single-locus polymorphism. II. The evolution of fitnesses and allele frequencies. American Naturalist 138, 13541371.CrossRefGoogle Scholar
Modiano, D., Luoni, G., Sirima, B. S., Simporé, J., Verra, F., Konaté, A., Rastrelli, E., Olivieri, A., Calissano, C., Paganotti, G. M., D'Urbano, L., Sanou, I., Sawadogo, A., Modiano, G. & Coluzzi, M. (2001). Haemoglobin C protects against clinical Plasmodium falciparum malaria. Nature 414, 305308.CrossRefGoogle ScholarPubMed
Nei, M. (1965). Variation and covariation of gene frequencies in subdivided populations. Evolution 19, 256258.CrossRefGoogle Scholar
Purser, A. F. (1966). Increase in heterozygote frequency with differential fertility. Heredity 21, 322327.CrossRefGoogle ScholarPubMed
Roberts, D. F. & Boyo, A. E. (1960). On the stability of haemoglobin gene frequencies in West Africa. Annals of Human Genetics 24, 375387.CrossRefGoogle ScholarPubMed
Roberts, D. F. & Boyo, A. E. (1962). Abnormal haemoglobins in childhood among the Yoruba. Human Biology 34, 2037.Google Scholar
Robertson, A. (1965). The interpretation of genotypic ratios in domestic animal populations. Animal Production 7, 319324.Google Scholar
Robertson, A. & Hill, W. G. (1984). Deviations from Hardy–Weinberg proportions: sampling variances and use in estimation of inbreeding coefficients. Genetics 107, 703718.CrossRefGoogle ScholarPubMed
Rousset, F. & Raymond, M. (1995). Testing heterozygote excess and deficiency. Genetics 140, 14131419.CrossRefGoogle ScholarPubMed
Ruiz, A., Fontdevila, A., Santos, M., Seoane, M. & Torroja, E. (1986). The evolutionary history of Drosophila buzzatii. VIII. Evidence for endocyclic selection acting on the inversion polymorphism in a natural population. Evolution 40, 740755.Google Scholar
Spencer, H. G. & Marks, R. W. (1988). The maintenance of single-locus polymorphism. I. Numerical studies of a viability selection model. Genetics 120, 605613.CrossRefGoogle ScholarPubMed
Spencer, H. G. & Marks, R. W. (1992). The maintenance of single-locus polymorphism. IV. Models with mutation from existing alleles. Genetics 130, 211221.CrossRefGoogle ScholarPubMed
Teo, Y. Y., Fry, A. E., Clark, T. G., Tai, E. S. & Seielstad, M. (2007). On the usage of HWE for identifying genotyping errors. Annals of Human Genetics 71, 701703.CrossRefGoogle ScholarPubMed
Vogel, F. & Motulsky, A. G. (1997). Human Genetics. Berlin: Springer.CrossRefGoogle Scholar
Wittke-Thompson, J. K., Pluzhnikov, A. & Cox, N. J. (2005). Rational inferences about departures from Hardy–Weinberg equilibrium. American Journal of Human Genetics 76, 967986.CrossRefGoogle ScholarPubMed
Weir, B. S. (1970). Equilibria under inbreeding and selection. Genetics 65, 371378.CrossRefGoogle ScholarPubMed
Weir, B. S. (1996). Genetic Data Analysis II. Massachusetts: Sinauer.Google Scholar
Workman, P. L. (1969). The analysis of simple genetic polymorphisms. Human Biology 41, 97114.Google ScholarPubMed
Xu, J., Turner, A., Little, J., Bleecker, E. R. & Meyers, D. (2002). Positive results in association studies are associated with departure from Hardy–Weinberg equilibrium: hint for genotyping error? Human Genetics 111, 573574.CrossRefGoogle ScholarPubMed
Yasuda, N. (1968). Estimation of the inbreeding coefficient from phenotype frequencies by a method of maximum likelihood scoring. Biometrics 24, 915935.CrossRefGoogle ScholarPubMed
Ziehe, M. & Gregorius, H. (1981). Deviations of genotypic structures from Hardy–Weinberg proportions under random mating and differential selection between the sexes. Genetics 98, 215230.CrossRefGoogle ScholarPubMed
Zou, G. Y. & Donner, A. (2006). The merits of testing Hardy–Weinberg equilibrium in the analysis of unmatched case-control data: a cautionary note. Annals of Human Genetics 70, 923933.Google Scholar
Figure 0

Table 1. Genotype distribution, allele frequencies and Hardy–Weinberg deviations at the β-globin locus in samples from West African populations