Statistical Power to Detect Genetic and Environmental Influences in the Presence of Data Missing at Random

Eske M. Derks; Conor V. Dolan; Dorret I. Boomsma

doi:10.1375/twin.10.1.159

Statistical Power to Detect Genetic and Environmental Influences in the Presence of Data Missing at Random

Published online by Cambridge University Press: 21 February 2012

Eske M. Derks ,

Conor V. Dolan and

Dorret I. Boomsma

Show author details

Eske M. Derks*: Affiliation:
Department of Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands. [email protected]
Conor V. Dolan: Affiliation:
Department of Psychology, University of Amsterdam, Amsterdam, the Netherlands.
Dorret I. Boomsma: Affiliation:
Department of Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands.
*: *Address for correspondence: Eske M. Derks, Vrije Universiteit, Department of Biological Psychology, Van der Boechorststraat 1, 1081 BT Amsterdam, the Netherlands.

Article contents

Abstract

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

We study the situation in which a cheap measure (X) is observed in a large, representative twin sample, and a more expensive measure (Y) is observed in a selected subsample. The aim of this study is to investigate the optimal selection design in terms of the statistical power to detect genetic and environmental influences on the variance of Y and on the covariance of X and Y. Data were simulated for 4000 dizygotic and 2000 monozygotic twins. Missingness (87% vs. 97%) was then introduced in accordance with 7 selection designs: (i) concordant low + individual high design; (ii) extreme concordant design; (iii) extreme concordant and discordant design (EDAC); (iv) extreme discordant design; (v) individual score selection design; (vi) selection of an optimal number of MZ and DZ twins; and (vii) missing completely at random. The statistical power to detect the influence of additive and dominant genetic and shared environmental effects on the variance of Y and on the covariance between X and Y was investigated. The best selection design is the individual score selection design. The power to detect additive genetic effects is high irrespective of the percentage of missingness or selection design. The power to detect shared environmental effects is acceptable when the percentage of missingness is 87%, but is low when the percentage of missingness is 97%, except for the individual score selection design, in which the power remains acceptable. The power to detect D is low, irrespective of selection design or percentage of missingness. The individual score selection design is therefore the best design for detecting genetic and environmental influences on the variance of Y and on the covariance of X and Y. However, the EDAC design may be preferred when an additional purpose of a study is to detect quantitative trait loci effects.

Type: Articles
Information: Twin Research and Human Genetics , Volume 10 , Issue 1 , 01 February 2007 , pp. 159 - 167

DOI: https://doi.org/10.1375/twin.10.1.159 [Opens in a new window]

Article contents

Statistical Power to Detect Genetic and Environmental Influences in the Presence of Data Missing at Random

Abstract

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests