Hostname: page-component-78c5997874-4rdpn Total loading time: 0 Render date: 2024-11-14T03:29:49.461Z Has data issue: false hasContentIssue false

Two SNPs Associated With Spontaneous Dizygotic Twinning: Effect Sizes and How We Communicate Them

Published online by Cambridge University Press:  13 July 2016

Hamdi Mbarek*
Affiliation:
Department of Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands Avera Institute for Human Genetics, Sioux Falls, South Dakota, USA
Conor V. Dolan
Affiliation:
Department of Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands
Dorret I. Boomsma
Affiliation:
Department of Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands Avera Institute for Human Genetics, Sioux Falls, South Dakota, USA
*
address for correspondence: Hamdi Mbarek, Vrije Universiteit, Dept Biological Psychology, Netherlands Twin Register, Van der Boechorststraat 1, 1081 BT, Amsterdam, the Netherlands. E-mail: [email protected]

Abstract

In a recent GWAS of spontaneous dizygotic twinning, Mbarek et al. (The American Journal of Human Genetics, 2016, Vol 98, pp. 898–908) identified two SNPs, rs11031006 (near FSHB) and rs17293443 (in SMAD3). In the present note, we address the question how to present the results in terms of effect sizes in a manner that is comprehensible to the general audience (e.g., mothers of twins, readership of newspapers). We propose to avoid the standard effect sizes such as odds ratios and relative risk as these require some knowledge of probability theory. Rather, we convey the results in terms of the conditional probabilities, but expressed in natural language.

Type
Articles
Copyright
Copyright © The Author(s) 2016 

We have brought together unique collections of mothers of dizygotic (DZ) twins, and conducted the first genome-wide association study (GWAS) for spontaneous DZ twinning (Mbarek et al., Reference Mbarek, Steinberg, Nyholt, Gordon, Miller, McRae and Boomsma2016). We identified the first robust genetic risk variants for DZ twinning: one near FSHB (rs11031006; chr 11p14.1), and a second within SMAD3 (rs17293443; chr 15q22.23). The two signals, which were replicated in a large Icelandic cohort, turned out to have widespread effects on female fertility. In disseminating the results of GWAS studies, like Mbarek et al. (Reference Mbarek, Steinberg, Nyholt, Gordon, Miller, McRae and Boomsma2016), one tends to emphasize the genome-wide significant hits. However, communicating information on the effect sizes associated with these hits to the general public is not simple. Standard effect sizes, such as odds ratios (OR) or relative risks (RR), are largely unsuited as they require some familiarity with probability theory. However, to rise above the simple ‘gene for. . .’ statement, it is desirable to provide a comprehensible statement on effect. The aims of this note are: (1) to present the results of the Mbarek et al. (Reference Mbarek, Steinberg, Nyholt, Gordon, Miller, McRae and Boomsma2016) study in terms of conditional probabilities, and (2) to consider, in the light of these, various effect sizes based on these results, given the aim of communicating the results to the general population.

Derived Genotype — Twinning Probabilities

To ease presentation, we denote the SNPs FB (rs11031006) and S3 (rs17293443). Based on Mbarek et al. (Reference Mbarek, Steinberg, Nyholt, Gordon, Miller, McRae and Boomsma2016), the risk allele frequencies (RAFs) in Iceland are 0.85 (FB; risk allele G, alternative allele A) and 0.24 (S3; risk allele C, alternative allele T). We use these values as they coincide with meta-analytic estimates, and there is very little difference in RAF between the samples in Mbarek et al. (Reference Mbarek, Steinberg, Nyholt, Gordon, Miller, McRae and Boomsma2016) (Netherlands: 0.841 and 0.239; Australia: 0.863 and 0.239; and United States, Minnesota: 0.846 and 0.233. The meta-analytic values are 0.849 and 0.236). The RR associated with these risk alleles, given the additive genetic association model, are 1.18 (FB) and 1.09 (S3). These results were obtained in the Icelandic sample (i.e., the replication sample in Mbarek et al., Reference Mbarek, Steinberg, Nyholt, Gordon, Miller, McRae and Boomsma2016), and we assume here that they hold in the Dutch population. The prevalence of spontaneous DZ twinning in the Dutch population displays considerable variation between 1904 and 2011 (Glasner et al., Reference Glasner, van Beijsterveldt, Willemsen and Boomsma2013; see also Pison et al., Reference Pison, Monden and Smits2015). We used the most recent prevalence estimate reported by Glasner et al. (Reference Glasner, van Beijsterveldt, Willemsen and Boomsma2013), that is, ~1.07% in 2011. The loci FB and S3 are unlinked (located on chromosomes 11 and 15), and, we assumed, in gametic phase equilibrium (Wray and Visscher, Reference Wray, Visscher, Neale, Ferreira, Medland and Posthuma2007). From this information and the assumptions mentioned, we constructed the genotype by twinning probability tables (Tables 1 and 2; for details, see the Appendix; R scripts used as available as supplemental material).

Table 1. FSHB (FB) and SMAD3 (S3) genotypes by spontaneous twinning probabilities. Row 9 and column 5 contain the marginal probabilities (italics; .0107 is the 2011 Dutch spontaneous DZ twinning probability). As indicated, the genotype frequencies are a function of the risk allele frequency (denoted pFB and pS3 ; q FB =1-p FB and q S3 =1-p S3 ).

Table 2. FSHB (FB) and SMAD3 (S3) haplotypes by spontaneous twinning probabilities. Row 11 and column 5 contain the marginal probabilities (italics; note: .0107 is the 2011 Dutch spontaneous DZ twinning probability). As indicated, the genotype frequencies are a function of the risk allele frequencies (denoted pFB and pS3 ; q FB =1-p FB and q S3 =1-p S3 ).

Effect Sizes: Odds Ratios and Relative Risks

ORs and RRs are standard measures of effect size, given a binary outcome. These measures are shown in Table 3 (separately for FB and S3) and in Table 4 (haplotypes). The values were calculated relative to the zero risk allele genotypes and haplotype. Note that the ORs and RRs are similar because the prevalence of the outcome is low (i.e., 0.0107). We have also included the probabilities of a spontaneous DZ birth given (conditional on) the genotype (Table 3) and haplotype (Table 4), as we propose to focus on these probabilities in communicating the effects.

TABLE 3 Conditional Probabilities Prob (DZ|genotype) and RRs and OR Associated with FB (Reference A;A) and S3 Genotypes (Reference T;T)

These results are based on the probabilities given in Table 1.

TABLE 4 Allelic ORs and RRs Associated With the Two SNPs

The ORs and RRs associated with the haplotypes (reference A;A and T;T). These results are based on the probabilities given in Table 2.

How to Convey These Results to a General Audience

Effect sizes expressed in terms of ORs and RRs pose no problem for (genetic) epidemiologists. However, as their interpretation requires knowledge of probability theory, we consider them unsuited for the general readership of newspapers and other media. To convey these effect sizes in simple terms, it is desirable to avoid terms like ‘odds’, ‘odd ratio’, ‘conditional probabilities’, and ‘RR’. We propose to present the effects for the combined effects of FB and S3 as follows: ‘In the Dutch population the probability of spontaneous DZ twinning is 10.7 per 1,000 births. If the risk alleles were absent, this would be 7.76 per 1,000 births. If all females carried all four risk alleles, this would be 12.71 per 1,000 births’. This is an attempt to express, in natural language, the effect size in terms of the conditional probabilities associated with the double homozygotes (FB(A;A) & S3(T;T) and FB(G;G) and S3(C;C)). Note that the value 10.7 is based on the 2011 Dutch spontaneous twinning probability (i.e., 0.0107; Glasner et al., Reference Glasner, van Beijsterveldt, Willemsen and Boomsma2013), and the values 7.76 and 12.71 are prob(DZ|{FB(A;A) & S3(T;T)}) and prob(DZ|{FB(G;G) & S3(C;C)}), respectively (see Table 4, column 5). Similar statements can be formulated for FB and S3 in isolation.

We believe that the expression (in natural language) of effect size relating to a continuous phenotype is relatively simple, as one can avoid statistical terms like ‘variance explained’ and ‘correlation’, by relating the allelic effect directly to the scale used to measure the phenotype. However, such statements of effect size remain natural language expressions concerning conditional distributions (rather than conditional probabilities). For instance, Loos and Yeo (Reference Loos and Yeo2014), in their discussion of the effects of the FTO locus, related the risk allele to a 0.39 kg/m2 increase in body mass index and to an increase in weight of 1,130 g for a person of 1.70 m.

DZ twinning is obviously a polygenic trait. Polygenic risk scores results in Mbarek et al. (Reference Mbarek, Steinberg, Nyholt, Gordon, Miller, McRae and Boomsma2016) reflect the polygenic contribution to the susceptibility to DZ twinning and its association with greater reproductive ability. Revealing more signals associated with this trait will provide a clear picture of the risk prediction for having DZ twins.

Acknowledgments

Support for the Netherlands Twin Register was obtained from the Netherlands Organization for Scientific Research (NWO) and The Netherlands Organization for Health Research and Development (ZonMW) grants, 904-61-193,480-04-004, 400-05-717, Addiction-31160008, 911-09-032, Biobanking and Biomolecular Resources Research Infrastructure (BBMRI –NL, 184.021.007); Royal Netherlands Academy of Science Professor Award (PAH/6635) to DIB; European Research Council (ERC-230374 and ERC-284167); Rutgers University Cell and DNA Repository (NIMH U24 MH068457-06), the Avera Institute, Sioux Falls, South Dakota (USA) and the National Institutes of Health (NIH R01 HD042157-01A1). Part of the genotyping was funded by the Genetic Association Information Network (GAIN) of the Foundation for the National Institutes of Health and Grand Opportunity grants 1RC2 MH089951). We acknowledge support from VU Amsterdam and the Institute for Health and Care Research.

Conflict of Interest

None.

Ethical Standards

The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.

Supplementary Material

To view supplementary material for this article, please visit http://dx.doi.org/10.1017/thg.2016.53.

Appendix

In the following Table A1 (based on FSHB), the marginal probabilities, that is, the genotype probabilities (a+b, c+d, e+f) and the spontaneous DZ twinning probability (a+c+e) are given in italics. In addition, given the additive model, we have (c/(c+d))/(e/(e+f) = (a/(a+b))/(b/(a+b)) = RR (relative risk). The ORs (odd ratios) with FB(A;A) are the reference, are (c/d)/(e/f) and (a/b)/(e/f).

TABLE A1 Genotype By Twinning Probability

Based on this information, we calculated the probabilities, denoted a to f in Table A2, by simple least squares in R (the R code is available as supplemental material). The haplotype by twinning probability table can be obtained in the same manner. However, it is easier to first construct Table A1 (see Table 1), to calculate prob(FB genotype | DZ outcome) and prob(S3 genotype | DZ outcome), and then to use these to approximate prob({FB genotype and S3 genotype} | DZ outcome) as prob(FB genotype | DZ outcome)*prob(S3 genotype | DZ outcome). This approximation is good, because the effect sizes are relatively small. Given these conditional probabilities and prob(DZ outcome), we applied Bayes’ theorem to obtain prob(DZ outcome | {FB genotype and S3 genotype}). Subsequently, given these and the haplotype probabilities (which depend only on the allele frequencies), we calculated the 18 entries of the haplotype x DZ twinning table (see Table A2). The R code we used to obtain the entries in Table A2 is available as supplemental material.

TABLE A2 Approximate FSHB (FB) and SMAD3 (S3) Haplotypes By Spontaneous Twinning Probabilities

Footnotes

The last row contains the marginal spontaneous DZ twinning probability (in italics; 0.0107). Column 5 contains the conditional probabilities (prob(DZ|haplotype). As expected, these closely resemble those given in Table 4; column 5).

References

Glasner, T. J., van Beijsterveldt, C. E. M., Willemsen, G., & Boomsma, D. I. (2013). Meerlinggeboorten in Nederland [Multiple births in the Netherlands]. Nederlands Tijdschrift voor Geneeskunde, 57, A5962.Google Scholar
Loos, R. J. F., & Yeo, G. S. H. (2014). The bigger picture of FTO – the first GWAS-identified obesity gene. Nature Review Endocrinology, 10, 5161.Google Scholar
Mbarek, H., Steinberg, S., Nyholt, D. R., Gordon, S. D., Miller, M. B., McRae, A. F.,. . .Boomsma, D. I. (2016). Identification of common genetic variants influencing spontaneous dizygotic twinning and female fertility. The American Journal of Human Genetics, 98, 898908.Google Scholar
Pison, G., Monden, C., & Smits, J. (2015). Twinning rates in developed countries: Trends and explanations. Population and Developmental Review, 41, 629649.Google Scholar
Wray, N., & Visscher, P. (2007). Population genetics and its relevance to gene mapping. In Neale, B., Ferreira, M., Medland, S. & Posthuma, D. (Eds.), Statistical genetics: Gene mapping through linkage and association. (pp. 87110). London: Taylor and Francis.Google Scholar
Figure 0

Table 1. FSHB (FB) and SMAD3 (S3) genotypes by spontaneous twinning probabilities. Row 9 and column 5 contain the marginal probabilities (italics; .0107 is the 2011 Dutch spontaneous DZ twinning probability). As indicated, the genotype frequencies are a function of the risk allele frequency (denoted pFB and pS3; qFB=1-pFB and qS3=1-pS3).

Figure 1

Table 2. FSHB (FB) and SMAD3 (S3) haplotypes by spontaneous twinning probabilities. Row 11 and column 5 contain the marginal probabilities (italics; note: .0107 is the 2011 Dutch spontaneous DZ twinning probability). As indicated, the genotype frequencies are a function of the risk allele frequencies (denoted pFB and pS3; qFB=1-pFB and qS3=1-pS3).

Figure 2

TABLE 3 Conditional Probabilities Prob (DZ|genotype) and RRs and OR Associated with FB (Reference A;A) and S3 Genotypes (Reference T;T)

Figure 3

TABLE 4 Allelic ORs and RRs Associated With the Two SNPs

Figure 4

TABLE A1 Genotype By Twinning Probability

Figure 5

TABLE A2 Approximate FSHB (FB) and SMAD3 (S3) Haplotypes By Spontaneous Twinning Probabilities

Supplementary material: File

Mbarek supplementary material

Mbarek supplementary material 1

Download Mbarek supplementary material(File)
File 17.6 KB