The beneficial effects of physical exercise on health and quality of life are well documented. For example, a higher level of physical exercise is associated with a lower risk of developing obesity (Hills et al., Reference Hills, Andersen and Byrne2011), cardiovascular disease (Wannamethee & Shaper, Reference Wannamethee and Shaper2001), type 2 diabetes (Aune et al., Reference Aune, Norat, Leitzmann, Tonstad and Vatten2015), mental health problems (Grasdalsmoen et al., Reference Grasdalsmoen, Eriksen, Lønning and Sivertsen2020) and various cancers (Brown et al., Reference Brown, Winters-Stone, Lee and Schmitz2012). Additionally, physical exercise has positive effects on psychological wellbeing (Bartels et al., Reference Bartels, de Moor, van der Aa, Boomsma and de Geus2012), academic performance in adolescents and young adults (Aaltonen et al., Reference Aaltonen, Latvala, Jelenkovic, Rose, Kujala, Kaprio and Aaltonen2020) and cognition in the elderly (Iso-Markku et al., Reference Iso-Markku, Waller, Vuoksimaa, Vähä-Ypyä, Lindgren, Heikkilä and Kujala2018). Despite these well-known benefits of physical exercise, during the past decades, the prevalence of insufficient physical activity has increased worldwide (Guthold et al., Reference Guthold, Stevens, Riley and Bull2018). Thus, it is important to investigate determinants of individual differences in exercise behavior in various populations for development of optimal exercise intervention strategies and policy guidelines for global health promotion.
Previous twin studies have shown that genetic factors influence exercise behavior, even though the estimates of genetic influences have varied substantially by measure, age and sex (de Geus et al., Reference de Geus, Bartels, Kaprio, Lightfoot and Thomis2014). In Dutch twin studies of physical exercise (de Moor et al., Reference de Moor, Willemsen, Rebollo-Mesa, Stubbe, de Geus and Boomsma2011; van der Aa et al., Reference van der Aa, de Geus, van Beijsterveldt, Boomsma and Bartels2010; Vink et al., Reference Vink, Boomsma, Medland, de Moor, Stubbe, Cornes and de Geus2011), genetic influences were small in childhood (20%−40%) but increased from childhood to adolescence (38%−85%) and then decreased in adulthood (about 40%). In contrast, common environmental influences were substantial in childhood (50%−75%) but mostly nonsignificant afterward. Individual environmental influences were generally small during childhood and adolescence (<28%) but became increasingly large (about 60%) with age in adulthood. In a Finnish longitudinal twin study of physical exercise, Aaltonen et al. (Reference Aaltonen, Ortega-Alonso, Kujala and Kaprio2013) found that in both sexes the heritability of leisure-time physical activity remained significant (43% to 52%) during adolescence (16−18 years) but reduced to approximately 30% in young adulthood (about 24 years). In contrast, common environmental influences increased from adolescence (18% to 26%) to young adulthood (43% to 49%). Individual environmental influences remained relatively stable across adolescence and young adulthood, ranging from 20% to 30% in both sexes. The GenomeEUtwin project (Stubbe et al., Reference Stubbe, Boomsma, Vink, Cornes, Martin, Skytthe and de Geus2006; Vink et al., Reference Vink, Boomsma, Medland, de Moor, Stubbe, Cornes and de Geus2011), which pooled adult twin data from seven European countries (Australia, Denmark, Finland, the Netherlands, Norwey, Sweden and the UK), revealed that genetic influences on physical activity ranged from 48% to 71% across the countries (Stubbe et al., Reference Stubbe, Boomsma, Vink, Cornes, Martin, Skytthe and de Geus2006) and that heritability reduced from about 70% to 40% with increasing age from 19 to 50 years, whereas individual environmental influences increased from about 30% to 60% in four of the seven countries. With a few exceptions, common environmental influences were absent across the adulthood (Vink et al., Reference Vink, Boomsma, Medland, de Moor, Stubbe, Cornes and de Geus2011).
Sex differences in genetic and environmental influences on physical exercise were significant in some studies (e.g. Carlsson et al., Reference Carlsson, Andersson, Lichtenstein and Michaelsson2006; Maia et al., Reference Maia, Thomis and Beunen2002) but not in others (e.g. van der Aa et al., Reference van der Aa, de Geus, van Beijsterveldt, Boomsma and Bartels2010; Vink et al., Reference Vink, Boomsma, Medland, de Moor, Stubbe, Cornes and de Geus2011). Whenever significant differences were found, males tended to have higher genetic and lower common environmental influences than did females. For example, van der Aa et al. (Reference van der Aa, de Geus, van Beijsterveldt, Boomsma and Bartels2010) found that 85% and 38% of variability in leisure time physical activity at ages 13−14 years were explained by genetic factors for boys and girls, respectively, only in girls were 46% accounted for by common environmental influences. In another example, a Portuguese twin sample aged from 12 to 25 years showed that genetic influences were higher (63% vs. 32%), and common environmental influences were lower (0% vs. 38%) in males than in females, whereas individual environmental influences were roughly similar (37% vs. 30%) in two sexes (Maia et al., Reference Maia, Thomis and Beunen2002).
Genetic and environmental influences on physical activity in Asians remain poorly understood because most twin studies of exercise to date have investigated American and European samples. In a large sample of Chinese adult same-sex twins (age >18 years, N = 19,308), Wang et al. (Reference Wang, Gao, Lv, Yu, Wang, Pang and Li2016) found that individual difference in vigorous physical activity defined as 150 minutes of moderate or high-intensity physical activity per week was almost completely explained by common environmental influences in both sexes (95% in males, 98% in females) and that genetic influences were inconsequential (<3% in both sexes). These results sharply contrasted those found in most Western twin samples. Discrepancies in findings cannot be attributed to measurement differences because, like many Western twin studies, the Wang et al. study employed the questionnaire method. Differences in race/ethnic differences in physical fitness and exercise abilities might have contributed to the discrepancies in results. Thus, it is important to further explore genetic and environmental influences on physical activity in Asian samples which have genetic backgrounds similar to Chinese twins. If Asians are vastly different from Westerners in genetic and environmental etiologies of physical activity, then ethnically sensitive exercise intervention strategies and guidelines might be needed for effective health promotion. Thus, the present study investigated genetic and environmental influences on frequency of vigorous exercise (FVE) in South Korean adolescent and young adult twins, with a special emphasis on sex effects.
Methods
Sample
The total sample included 1757 adolescent and young adult twins (mean age = 19.05 years, SD = 3.01 years) identified from the South Korean Twin Registry (SKTR; Hur et al., Reference Hur, Kang, Jeong, Kang and Kim2019). The SKTR is a nationwide volunteer registry of South Korean preschooler to young adult twins and their families. Adolescent twins in the SKTR were collected from schools throughout South Korea, whereas young adult twins were from Facebook, twin clubs and colleges throughout South Korea. Various psychological assessments have been administered via telephone and mail surveys to twins registered with the SKTR. Zygosity of the twins in the SKTR was assessed through three items asking twins’ physical similarity and frequency of confusion by family members, close friends and strangers (Ooki et al., Reference Ooki, Yamada and Asaka1993).
The total sample consisted of 368 monozygotic male (MZM) twins (155 complete pairs and 58 twin individuals), 173 dizygotic male (DZM) twins (67 complete pairs and 39 twin individuals), 682 monozygotic female (MZF) twins (312 complete pairs and 58 twin individuals), 274 dizygotic female (DZF) twins (117 complete pairs and 40 twin individuals) and 260 opposite-sex dizygotic (OSDZ) twins (119 complete pairs and 22 twin individuals). Larger numbers of MZ than DZ twins likely reflected the twin birth rates in the South Korean population for the birth cohorts in the present study (Hur & Kwon, Reference Hur and Kwon2005; Hur & Song, Reference Hur and Song2009).
Measures
During the SKTR telephone interview, twins were asked how often they engaged in vigorous physical exercise. Interviewers explained ‘vigorous exercise’ as that intense enough to cause rapid breathing and/or sweating. There were four response categories (0 = no exercise, 1 = 1−2 times per week, 2 = 3−4 times per week 3 = 5 times or more per week). Interviewers also asked twins who chose the response category of 1 or higher if they exercised more or less than 30 minutes each time, and 83% reported that they exercised a minimum of 30 minutes or more. The remaining 17% reported that they exercised less than 30 minutes each time.
Statistical Analysis
Data analyses were conducted in the following steps. First, descriptive analyses examined relationships of FVE with sex, age and zygosity. As twins are not an independent sample, phenotypic analyses were performed using one randomly chosen member of each twin pair in descriptive analyses. Second, as a preliminary step examining genetic, common environmental and individual environmental influences on FVE, tetrachoric correlations with 95% confidence intervals (CI) for each type of twins were computed. Finally, a scalar sex-limitation model extended to include age as a moderator was fitted to data to estimate genetic and environmental contributions to individual differences in FVE in males and females and to investigate whether these contributions change with age during adolescence and young adulthood.
Because FVE is an ordinal variable, a liability threshold model was used to analyze twin data. The basic assumption underlying the liability threshold model is that a latent continuous liability underlies the distribution of the ordinal variable. The liability is assumed to have standard normal distribution with a mean of zero and standard deviation of 1.0. To model the four categories of vigorous exercise in the present study, three thresholds were required. The thresholds, expressed in z values, are defined by the prevalence of the four distinct categories of vigorous exercise frequencies. Using the raw data, MZ and DZ twin tetrachoric correlations were computed. As MZ twins share 100% of their segregating genes, but DZ twins, on average, share only 50%, higher MZ than DZ twin correlation indicates the presence of additive genetic effects (i.e. the sum of the average effect of all alleles that influence a trait). On the other hand, DZ correlation higher than half the MZ correlation indicates the presence of common environmental effects (i.e. those environmental factors shared between the two members of a twin pair).
A path diagram of the scalar sex-limitation model incorporating the age moderator is shown in Figure 1. In the model, the total variance of FVE was divided into additive genetic (A), common environmental (C) and individual environmental effects (E; i.e. those environmental factors unique to each member of a twin pair and measurement error). Correlations for additive genetic effects for MZ and DZ twins were fixed at 1.0 and .5, respectively, for each sex, whereas common environmental effects were fixed at 1.0 for both MZ and DZ twins. Individual environmental effects were uncorrelated between the two members of a twin pair, regardless of zygosity. The moderator variable, age (M) was transformed to a z score separately for males and females, and the main effect of age (z score) was incorporated on the thresholds for males and females (µ + βmM, µ + βfM). In addition, the model allowed the standard paths, A, C and E on the FVE to vary as a function of age (z score) and sex. Thus, the phenotypic variance (Vp) for each sex in the full age-moderation model can be expressed as Vp = (a + βaM)2 + (c + βcM)2 + (e + βeM)2. In this equation, βa, βc and βe represent the magnitude of the moderating effects related to A, C and E, respectively. Quantitative sex differences were tested by allowing the magnitudes of A, C and E effects to differ between males and females. In addition, the correlation between common environmental effects in OSDZ twin pairs was allowed to be less than 1.0, enabling estimation of qualitative sex difference in common environmental effects (i.e. partly different common environments operate in males and females). To determine the best-fitting, most parsimonious model, various constraints were imposed on the full model, and the resulting nested models were compared with the full model by the log likelihood ratio test (LRT) and Akaike’s information criteria (AIC). LRT evaluates the difference in minus two times the log likelihood (-2LL) between the full model and its nested model, which is asymptotically distributed as χ2 with degrees of freedom (df) equal to the two model’s difference in the degrees of freedom. Models with fewer parameters are preferred if they do not cause a significant deterioration of fit. Furthermore, the model with lowest AIC was considered the best-fitting because lower AIC value indicates better balance between parsimony and goodness-of-fit (Akaike, Reference Akaike1987). Statistical software package, Mx (Neale et al., Reference Neale, Boker, Xie and Maes2003) was used to conduct twin tetrachoric correlation and model-fitting analysis.
Results
Descriptive Statistics
Sex differences in frequency of vigorous exercise
Figure 2 shows sex differences in FVE in the total sample. Pearson chi-square tests showed significant sex differences in the total sample (χ2 (3) = 119.7, p < .001), indicating that males engage in vigorous exercise more frequently than females. Post hoc tests showed that frequency of ‘no exercise’ was significantly (p < .05) higher in females than in males. In contrast, frequencies of the two highest categories (i.e. 3−4 times per week, 5 times per week or more) were significantly (p < .05) higher in males than in females. Frequencies of the modest exercise category (i.e. 1−2 times per week) did not differ significantly between the two sexes.
The relationship between age and frequency of vigorous exercise
The relationship between age and FVE was modest but significant in both sexes. The correlations were −.12 (p < .01) in males and −.11 (p < .01) in females, suggesting that in both sex groups, twins exercise less with increasing age.
Frequency of vigorous exercise by zygosity
A Pearson chi-square test showed no significant difference between MZ and DZ twins (χ2 (3) = 2.3, p < .51), fulfilling the assumption for twin analysis. Frequencies for MZ and DZ twins were, respectively, 37% and 34% for the first response category (i.e. no exercise), 34% and 31% for the second response category (i.e. 1−2 times per week), 18% and 21% for the third response category (i.e. 3–4 times per week) and 12% and 14% for the final category (i.e. 5 times or more per week).
Twin Correlations
Figure 3 displays tetrachoric twin correlations and their 95% confidence intervals (CI) for the five zygosity groups. MZM twin correlation (r = .40, 95% CI [.22, .55]) was significant and greater than DZM twin correlation (r = .12, 95% CI [−.17, .38]), suggesting the presence of genetic influences on vigorous exercise in males. In contrast, the MZF correlation (r = .44, 95% CI [.32, .54]) was similar to the DZF correlation (r = .45, 95% CI [.27, .60]), suggesting the presence of common environmental influences. The OSDZ twin correlation (r = .12. 95% CI [−.09, .31]) was lower than the DZF correlation but higher than the DZM correlation. These patterns of twin correlations suggested that genetic influences were important for individual differences in vigorous exercise in males, whereas common environmental influences were significant in females. MZM and MZF twin correlations were much lower than unity, indicating that individual environmental influences are high in both males and females.
Model Fitting
Table 1 presents results of model-fitting analysis. When thresholds were equated across the two sexes from the full model, a significant difference in chi-square occurred (χ2 (3) = 149.63, p = .00), confirming sex differences in FVE (model 2). In model 3, qualitative sex difference in common environmental influences was tested. But the resulting chi-square difference was nonsignificant (χ2 (1) = .11, p = .73). In model 4, dropping all age moderators from both sexes did not significantly deteriorate model fit (χ2 (7) = 10.34, p = .17), suggesting that additive genetic, common environmental and individual environmental variances of FVE do not significantly change with age during adolescence and young adulthood. In model 5, additive genetic and common and individual environmental variances were equated across two sexes, which significantly worsened model fit (χ2 (10) = 20.93, p = .02). These results suggested that males and females differed in magnitudes of additive genetic and common and individual environmental variances. In model 6, common environmental variances were removed from both sex groups, whereas in model 7, additive genetic variances were eliminated. In both models, marginally significant chi-square differences occurred (χ2(9) = 16.43, p = .05 for model 6; χ2(9) = 17.46, p = .04 for model 7). However, a clearly nonsignificant chi-square difference (χ2(9) = 11.86, p = .22) was found in model 8, in which common environmental influences were removed from males and additive genetic influences, from females. This suggested preference of model 8 over models 6 or 7. Indeed, among all models tested, model 8 had the lowest AIC value. Thus, model 8 was the best-fitting, most parsimonious according to LRT and AIC. In the best-fitting model (Figure 4), additive genetic and individual environmental influences were, respectively, .35 (95% CI [.26, .39]) and .65 (95% CI [.61, .74]) in males, and common and individual environmental influences were, respectively, .45 (95% CI [.35, .53]) and .55 (95% CI [.47, .65]).
-2LL, -2 log likelihood; A, additive genetic influences; C, common environmental influences; E, individual environmental influences including measurement error. The best-fitting model is indicated in bold type.
Discussion
Consistent with prior Western and South Korean singleton samples (e.g. Cho et al., Reference Cho, Lee and Kim2014; Sallis et al., Reference Sallis, Prochaska and Taylor2000), this study showed that the prevalence of vigorous exercise was higher in males than in females. This study further showed that sex difference was driven largely by differences in the extreme groups (i.e. the no exercise group and the two high exercise groups); the modest exercise group showed no significant sex difference. These findings suggest that intervention should focus on no exercisers and intensive exercisers to reduce the gender gap in physical activities.
The present study showed that FVE declined with age in both sexes, which was also in line with many previous studies (e.g. van der Aa et al., Reference van der Aa, de Geus, van Beijsterveldt, Boomsma and Bartels2010). South Korean adolescents are obliged to participate in 1−4 hours of physical education class at school per week, and the hours of compulsory physical education class are lower in higher grades than in lower grades. Thus, the present findings of age-related decline in FVE likely reflected the decrease of compulsory physical education at school with increasing age. In young adults, the transition from student to work life occurs, which may also increasingly constrain the time to engage in vigorous physical exercise.
The finding that genetic influences were higher and common environmental influences were lower in males than in females agreed with results of several previous twin studies based on Western samples (Carlsson et al., Reference Carlsson, Andersson, Lichtenstein and Michaelsson2006; Maia et al., Reference Maia, Thomis and Beunen2002; van der Aa et al., Reference van der Aa, de Geus, van Beijsterveldt, Boomsma and Bartels2010), but contrasted with results from the Wang et al. (Reference Wang, Gao, Lv, Yu, Wang, Pang and Li2016) study based on Chinese twins. The present study differed from the Wang et al. study in age of sample (12.42−29.42 years vs. 18 years or older) and number of thresholds (1 vs. 3) in the measure of vigorous physical exercise. To make the measure of this study more comparable to that in the Chinese twin study, the data were reanalyzed using a single threshold model (dichotomy) with a cut-point of FVE of five times or more per week, corresponding to about 150 minutes or more activity per week as in the Chinese twin study. The best-fitting model still showed significant genetic influences in males (49%) and significant common environmental influences (50%) in females with the remaining variances attributable to individual environmental influences including measurement error (51% in males, 50% in females) in both sexes. Age differences between the Chinese and the present sample might have contributed to the differing outcomes to some degree. However, further exploration of genetic and environmental etiologies of physical activity in Asians is necessary to draw a firm conclusion.
Previous research has suggested that personality traits (e.g. sensation-seeking) (Sallis et al., Reference Sallis, Prochaska and Taylor2000), self-efficacy (Lee et al., Reference Lee, Park, Lee, Kim and Park2018), affective response to exercise (Schutte et al., Reference Schutte, Nederend, Hudziak, Bartels and de Geus2017), physical fitness traits (e.g. muscle strength, muscle endurance and aerobic power) (Bauman et al., Reference Bauman, Reis, Sallis, Wells, Loos and Martin2012) are key factors in maintaining regular physical exercise. Because all of these traits have been shown to be highly heritable (e.g. Polderman et al., Reference Polderman, Benyamin, de Leeuw, Sullivan, van Bochoven, Visscher and Posthuma2015), genes influencing them might be involved in the genetic variation for physical exercise in males. To date, genome-wide association studies have found a few loci significantly associated with physical exercise, but effect sizes of the genetic variants found were small (Hara et al., Reference Hara, Hachiya, Sutoh, Matsuo, Nishida, Shimanoe and Wakai2018; Klimentidis et al., Reference Klimentidis, Raichlen, Bea, Garcia, Wineinger, Mandarino and Going2018; de Moor et al., Reference de Moor, Liu, Boomsma, Li, Hamilton, Hottenga, Levy, Liu, Pei and Posthuma2009). In the first genomewide association study (GWAS), de Moor et al. (Reference de Moor, Liu, Boomsma, Li, Hamilton, Hottenga, Levy, Liu, Pei and Posthuma2009) found significant associations of physical exercise with the PAPSS2 gene involved in skeletal development and arthrosis and the SGIP1 gene primarily expressed in the hypothalamus of the brain in Dutch and American samples. Subsequently, Klimentidis et al. (Reference Klimentidis, Raichlen, Bea, Garcia, Wineinger, Mandarino and Going2018) showed that habitual physical activity was significantly associated with higher Alzheimer’s risk gene (APOE) and CADM2 gene related to higher body mass index and risk-taking behavior. In a large Japanese sample, Hara et al. (Reference Hara, Hachiya, Sutoh, Matsuo, Nishida, Shimanoe and Wakai2018) found a novel association between an SNP in the NPSR1-DPY19L1 intergenic region on chromosome 7 and leisure-time exercise behavior.
In females, common environmental influences were more important than genetic influences, explaining about 45% of variation of FVE in the present sample. Genetic environmental influences may show up later in life in females. Common environmental influences include parental supports and role models, sibling physical activity, family socioeconomic status and neighborhood recreational environment and exercise facilities (e.g. parks and trails, community health center). A review of 108 studies on factors influencing adolescents’ physical activity levels suggested that parental support and help were the most consistently associated with adolescents’ physical activity levels (Sallis et al., Reference Sallis, Prochaska and Taylor2000). Parental support and help vary across families, which may constitute common environmental variance in FVE. Although boys tend to receive more parental support in exercise activities than girls (Beets et al., Reference Beets, Cardinal and Alderman2010; Gustafson & Rhodes, Reference Gustafson and Rhodes2006), the present findings suggest that parental support and help exert significant influence only on girls. In line with the present study, Sallies et al. (Reference Sallies, Alcaraz, Mckenzie and Hovell1999) have shown that the effect of parents on sports participation is greater for girls than for boys. In a similar vein, Hanson and Chen (Reference Hanson and Chen2007) suggest that males are physically more active while interacting with their peers, whereas females are more active in structured activities that are more likely to involve parental influence. In young adults, although parents’ direct influence might diminish, the level of a co-twin’s physical activity might be important for maintenance of exercise especially for females. Taken together, present findings suggest that family-based interventions might be more useful for increasing exercise in females than in males in adolescents and young adults.
The present study has several limitations that need to be addressed. First, the present study was based on cross-sectional twin data. Decreased physical activity with age might reflect an increase in exercise in younger birth cohorts rather than aging effects. However, a decline in physical activities with age has been confirmed in previous longitudinal studies (Aaltonen et al., Reference Aaltonen, Ortega-Alonso, Kujala and Kaprio2013; van Mechelen et al., Reference van Mechelen, Twisk, Post, Snel and Kemper2000). Second, as in most twin studies, the present study analyzed self-report data. Although regular exercise can be reliably measured by self-report in adolescents and adults (Stubbe et al., Reference Stubbe, Boomsma, Vink, Cornes, Martin, Skytthe and de Geus2006), future twin studies using Asian samples might consider replicating the present findings using objective measures, for instance, movement sensors in smartphones. Finally, the generalizability of the present findings is limited because the sample consisted only of South Korean adolescents and young adults.
To conclude, although genetic and common environmental influences on FVE did not vary with age, they differed across males and females in South Korean twins during adolescence and young adulthood. These results suggest that intervention strategies to promote physical activity should be sex-specific.
Acknowledgments
Special thanks are given to twins who participated in the South Korean Twin Registry.
Financial support
This study was supported by the National Research Foundation of Korea grant (NRF-371-2011-1 B00047) and by the ‘Development of Health Prediction Technology based on Big Data’ (K18092) funded by the Ministry of Science and ICT(MSIT) of Korea given to the Korea Institute of Oriental Medicine (KIOM).