Hostname: page-component-55f67697df-4ks9w Total loading time: 0 Render date: 2025-05-08T18:05:57.541Z Has data issue: false hasContentIssue false

Integrating HiTOP and RDoC frameworks Part I: Genetic architecture of externalizing and internalizing psychopathology

Published online by Cambridge University Press:  08 May 2025

Christal N. Davis*
Affiliation:
Mental Illness Research, Education and Clinical Center, Crescenz VAMC, Philadelphia, PA, USA Center for Studies of Addiction, Department of Psychiatry, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
Yousef Khan
Affiliation:
Center for Studies of Addiction, Department of Psychiatry, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
Sylvanus Toikumo
Affiliation:
Mental Illness Research, Education and Clinical Center, Crescenz VAMC, Philadelphia, PA, USA Center for Studies of Addiction, Department of Psychiatry, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
Zeal Jinwala
Affiliation:
Center for Studies of Addiction, Department of Psychiatry, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
Dorret I. Boomsma
Affiliation:
Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research, Amsterdam Neuroscience, Vrije Universiteit Amsterdam, The Netherlands and Amsterdam Reproduction and Development Research Institute, Amsterdam, The Netherlands
Daniel F. Levey
Affiliation:
Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA Psychiatry Division, VA Connecticut Healthcare Center, West Haven, CT, USA
Joel Gelernter
Affiliation:
Psychiatry Division, VA Connecticut Healthcare Center, West Haven, CT, USA Departments of Psychiatry, Genetics, and Neuroscience, Yale University School of Medicine, New Haven, CT, USA
Rachel L. Kember
Affiliation:
Mental Illness Research, Education and Clinical Center, Crescenz VAMC, Philadelphia, PA, USA Center for Studies of Addiction, Department of Psychiatry, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
Henry R. Kranzler
Affiliation:
Mental Illness Research, Education and Clinical Center, Crescenz VAMC, Philadelphia, PA, USA Center for Studies of Addiction, Department of Psychiatry, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
*
Corresponding author: Christal N. Davis; Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Background

There is considerable comorbidity between externalizing (EXT) and internalizing (INT) psychopathology. Understanding the shared genetic underpinnings of these spectra is crucial for advancing knowledge of their biological bases and informing empirical models like the Research Domain Criteria (RDoC) and Hierarchical Taxonomy of Psychopathology (HiTOP).

Methods

We applied genomic structural equation modeling to summary statistics from 16 EXT and INT traits in individuals genetically similar to European reference panels (EUR-like; n = 16,400 to 1,074,629). Traits included clinical (e.g. major depressive disorder, alcohol use disorder) and subclinical measures (e.g. risk tolerance, irritability). We tested five confirmatory factor models to identify the best fitting and most parsimonious genetic architecture and then conducted multivariate genome-wide association studies (GWAS) of the resulting latent factors.

Results

A two-factor correlated model, representing EXT and INT spectra, provided the best fit to the data. There was a moderate genetic correlation between EXT and INT (r = 0.37, SE = 0.02), with bivariate causal mixture models showing extensive overlap in causal variants across the two spectra (94.64%, SE = 3.27). Multivariate GWAS identified 409 lead genetic variants for EXT, 85 for INT, and 256 for the shared traits.

Conclusions

The shared genetic liabilities for EXT and INT identified here help to characterize the genetic architecture underlying these frequently comorbid forms of psychopathology. The findings provide a framework for future research aimed at understanding the shared and distinct biological mechanisms underlying psychopathology, which will help to refine psychiatric classification systems and potentially inform treatment approaches.

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Introduction

Traditional categorical classifications of psychopathology are limited by a rudimentary understanding of the underlying biology. Psychiatric disorders co-occur more often than expected by chance (Kessler et al., Reference Kessler, McGonagle, Zhao, Nelson, Hughes, Eshleman and Kendler1994; Kessler, Chiu, Demler, & Walters, Reference Kessler, Chiu, Demler and Walters2005), suggesting the presence of shared genetic underpinnings across diagnostic categories (Lee, Feng, & Smoller, Reference Lee, Feng and Smoller2021). Additionally, current diagnostic classifications rely on arbitrary thresholds and a polythetic approach, resulting in thousands of unique symptom combinations for the same diagnosis (Galatzer-Levy & Bryant, Reference Galatzer-Levy and Bryant2013). These limitations complicate clinical care and constrain progress in psychiatric nosology and genetics.

To address these issues, approaches such as the Hierarchical Taxonomy of Psychopathology (HiTOP) and the Research Domain Criteria (RDoC) initiative (Cuthbert, Reference Cuthbert2015; Kotov et al., Reference Kotov, Krueger, Watson, Achenbach, Althoff, Bagby and Zimmerman2017; Reference Kotov, Krueger, Watson, Cicero, Conway, DeYoung and Wright2021; Kozak & Cuthbert, Reference Kozak and Cuthbert2016) have been proposed. HiTOP conceptualizes psychopathology along a continuum from symptoms to broad spectra, culminating in an overarching general psychopathology (p) factor. In contrast, RDoC focuses on transdiagnostic domains of functioning to identify putative causal mechanisms across multiple units of analysis. Despite different approaches, HiTOP and RDoC both aim to move beyond traditional diagnostic categories to capture the complexity of psychopathology (Michelini, Palumbo, DeYoung, Latzman, & Kotov, Reference Michelini, Palumbo, DeYoung, Latzman and Kotov2021). However, the integration of these frameworks, particularly through genetic investigations, remains largely unexplored.

Genetic research has advanced our understanding of psychopathology dimensions, with most of this work focusing on externalizing (EXT) and internalizing (INT) behaviors. Twin and family studies have shown that both externalizing (e.g. antisocial behavior, substance use) and internalizing (e.g. anxiety, depression) behaviors have shared genetic bases (Hewitt, Silberg, Neale, Eaves, & Erickson, Reference Hewitt, Silberg, Neale, Eaves and Erickson1992; Silberg et al., Reference Silberg, Erickson, Meyer, Eaves, Rutter and Hewitt1994). Additionally, twin and family studies (Allegrini et al., Reference Allegrini, Cheesman, Rimfeld, Selzam, Pingault, Eley and Plomin2020; Lahey, Van Hulle, Singh, Waldman, & Rathouz, Reference Lahey, Van Hulle, Singh, Waldman and Rathouz2011; Pettersson, Larsson, & Lichtenstein, Reference Pettersson, Larsson and Lichtenstein2016) and principal component analyses (Allegrini et al., Reference Allegrini, Cheesman, Rimfeld, Selzam, Pingault, Eley and Plomin2020; Selzam, Coleman, Caspi, Moffitt, & Plomin, Reference Selzam, Coleman, Caspi, Moffitt and Plomin2018) have examined genetic factors shared by both EXT and INT psychopathology. Genome-wide association studies (GWAS) of childhood behavior problems, encompassing EXT and INT psychopathology, identified two genome-wide significant loci (Neumann et al., Reference Neumann, Nolte, Pappa, Ahluwalia, Pettersson, Rodriguez and Tiemeier2022; Pappa et al., Reference Pappa, Fedko, Mileva-Seitz, Hottenga, Bakermans-Kranenburg, Bartels and Boomsma2015). These findings align with HiTOP, which posits shared genetic factors within and across psychopathology dimensions, but also highlight the need for larger, more integrative studies to disentangle shared and unique genetic influences across EXT and INT.

Genomic structural equation modeling (gSEM) provides a nuanced understanding of the genetic architecture underlying psychopathology. Studies using gSEM to investigate the factor structure of psychiatric disorders have identified one to four factors that underlie their shared liability (Grotzinger et al., Reference Grotzinger, Rhemtulla, de Vlaming, Ritchie, Mallard, Hill and Tucker-Drob2019; Grotzinger et al., Reference Grotzinger, Mallard, Akingbuwa, Ip, Adams and Lewis2022; Lee et al., Reference Lee, Anttila, Won, Feng, Rosenthal, Zhu and Smoller2019). This work has yielded two genome-wide loci for a higher-order p-factor that encompasses compulsive, psychotic, internalizing, and neurodevelopmental disorders (Grotzinger et al., Reference Grotzinger, Mallard, Akingbuwa, Ip, Adams and Lewis2022), with the researchers noting that the p-factor may have low utility for identifying underlying genetic variation. However, because the study included only psychiatric disorders, it did not capture a spectrum of psychopathology consistent with dimensional models like HiTOP. It also included only two EXT (attention-deficit hyperactivity disorder and problematic alcohol use) and two INT (anxiety and major depressive disorder) conditions.

Despite the valuable insights provided by gSEM studies focused specifically on EXT and INT, gaps remain. First, although subclinical traits have been included in studies of EXT (Karlsson Linnér et al., Reference Karlsson Linnér, Biroli, Kong, Meddens, Wedow, Fontana and Beauchamp2019; Karlsson Linnér et al., Reference Karlsson Linnér, Mallard, Barr, Sanchez-Roige, Madole, Driver and Collaborators2021), their integration in the study of INT has been limited, which risks overlooking genetic contributions across the full range of symptom severity. Second, studies have not typically focused on genetic effects shared across both EXT and INT psychopathology. Thus, our understanding of the common genetic factors that drive comorbidities between these two spectra is limited, despite evidence that individuals who experience comorbid EXT and INT problems are uniquely vulnerable to poor economic and social outcomes (Vergunst et al., Reference Vergunst, Commisso, Geoffroy, Temcheff, Poirier, Park and Orri2023), criminality (Commisso et al., Reference Commisso, Geoffroy, Temcheff, Scardera, Vergunst, Côté and Orri2024), and suicide attempts (Commisso et al., Reference Commisso, Temcheff, Orri, Poirier, Lau, Côté and Geoffroy2023).

The current study addresses gaps in our understanding of the genetic architecture of EXT and INT psychopathology. Using gSEM, we tested competing confirmatory factor models based on existing theories of psychopathology to identify the best-fitting structure that captured shared and specific genetic liabilities across clinical and subclinical EXT and INT traits. After identifying the optimal model, we performed GWAS to identify genetic variants that contribute to EXT, INT, and their shared etiology. To further characterize the shared genetic architecture, we applied MiXeR models to estimate polygenicity (i.e. the number of causal variants) and the degree of causal variant overlap between EXT and INT. We also calculated genetic correlations with a wide range of phenotypes to explore the behavioral manifestations of EXT, INT, and EXT + INT genetic liability. These analyses provide insights into the shared and distinct genetic effects that underlie EXT and INT psychopathology and lay the foundation for Part II, a companion article, in which we explore shared and distinct biological mechanisms, including pathways at the levels of genes, cells, and neural circuits.

Methods

Summary statistics

We selected EXT summary statistics based on published studies of individuals genetically similar to European reference panels (EUR-like) (Karlsson Linnér et al., Reference Karlsson Linnér, Biroli, Kong, Meddens, Wedow, Fontana and Beauchamp2019; Karlsson Linnér et al., Reference Karlsson Linnér, Mallard, Barr, Sanchez-Roige, Madole, Driver and Collaborators2021) and existing theory (Kotov et al., Reference Kotov, Krueger, Watson, Achenbach, Althoff, Bagby and Zimmerman2017) (Supplementary Table 1). We included the largest GWAS of attention deficit hyperactivity disorder (ADHD; n = 225,534; (Demontis et al., Reference Demontis, Walters, Athanasiadis, Walters, Therrien, Nielsen and Børglum2023) and four substance use disorders (SUDs, i.e. alcohol [AUD; n = 753,248; Zhou et al., Reference Zhou, Kember, Deak, Xu, Toikumo and Yuan2023], cannabis [CanUD; n = 886,025; Levey et al., Reference Levey, Galimberti, Deak, Wendt, Bhattacharya and Koller2023], opioid [OUD; n = 425,944; Kember et al., Reference Kember, Vickers-Smith, Xu, Toikumo, Niarchou, Zhou and Million Veteran2022], and tobacco [TUD; n = 495,005; Toikumo et al., Reference Toikumo, Jennings, Pham, Lee, Mallard and Bianchi2024]). These disorders align with HiTOP’s disinhibited externalizing spectrum (Krueger et al., Reference Krueger, Hobbs, Conway, Dick, Dretsch and Eaton2021). We also included subclinical measures reflecting disinhibition and antagonism (age of first sexual intercourse [AgeSex; reverse-coded; n = 317,694], general risk tolerance [Risk; n = 431,126], number of sexual partners [NumSex; n = 370,711; Karlsson Linnér et al., Reference Karlsson Linnér, Biroli, Kong, Meddens, Wedow, Fontana and Beauchamp2019; Karlsson Linnér et al., Reference Karlsson Linnér, Mallard, Barr, Sanchez-Roige, Madole, Driver and Collaborators2021], antisocial behavior [ASB; n = 16,400; Tielbeek et al., Reference Tielbeek, Johansson, Polderman, Rautiainen, Jansen and Taylor2017], and automobile speeding propensity [n = 404,291; Karlsson Linnér et al., Reference Karlsson Linnér, Biroli, Kong, Meddens, Wedow, Fontana and Beauchamp2019]).

We selected INT summary statistics that capture HiTOP’s internalizing symptom components, including eating pathology, fear, and distress (Kotov et al., Reference Kotov, Krueger, Watson, Achenbach, Althoff, Bagby and Zimmerman2017). The studies included the largest available GWAS of INT disorders: (1) anorexia nervosa (AN; n = 72,517; Watson et al., Reference Watson, Yilmaz, Thornton, Hübel, Coleman and Gaspar2019), (2) major depressive disorder (MDD; n = 1,074,629; Als et al., Reference Als, Kurki, Grove, Voloudakis, Therrien, Tasanko and Børglum2023), and (3) posttraumatic stress disorder (PTSD; n = 214,408; Stein et al., Reference Stein, Levey, Cheng, Wendt, Harrington, Pathak and Program2021). We also included irritability (http://www.nealelab.is/uk-biobank/; n = 345,231), loneliness (n = 490,689; Abdellaoui et al., Reference Abdellaoui, Sanchez-Roige, Sealock, Treur, Dennis, Fontanillas and Boomsma2019), subjective well-being (reverse-coded; n = 298,420; Okbay et al., Reference Okbay, Baselmans, De Neve, Turley, Nivard, Fontana and LifeLines Cohort2016), miserableness (http://www.nealelab.is/uk-biobank/; n = 355,182), and anxiety (ANX; n = 280,490). To detect variants associated with anxiety disorders (i.e. panic disorder, agoraphobia, generalized anxiety disorder [GAD], specific phobia, and social phobia) and subclinical anxiety (i.e. GAD-2 scores), we combined three GWAS (Levey et al., Reference Levey, Gelernter, Polimanti, Zhou, Cheng, Aslan and Stein2020; Otowa et al., Reference Otowa, Hek, Lee, Byrne, Mirza, Nivard and Hettema2016; Purves et al., Reference Purves, Coleman, Meier, Rayner, Davis, Cheesman and Eley2020) via multi-trait analysis of GWAS (Turley et al., Reference Turley, Walters, Maghzian, Okbay, Lee and Fontana2018).

Genomic structural equation modeling

We used linkage disequilibrium score regression (LDSC) (Bulik-Sullivan et al., Reference Bulik-Sullivan, Loh, Finucane, Ripke, Yang and Patterson2015), which estimates the genetic similarity between traits, to calculate genetic correlations (r g). GenomicSEM is robust to sample overlap, yielding valid results even when overlapping cohorts are included (Grotzinger et al., Reference Grotzinger, Rhemtulla, de Vlaming, Ritchie, Mallard, Hill and Tucker-Drob2019). Genetic variants, or single nucleotide variants (SNVs), were filtered using reference data from EUR-like individuals (HapMap3) (The International HapMap 3 Consortium, 2010). We excluded rare variants (i.e. those present in fewer than 1% of people). LDSC accounts for differences in genetic similarity across and within populations by using linkage disequilibrium (LD) information—patterns of correlation between nearby genetic variants—from the 1000 Genomes Project (The 1000 Genomes Project Consortium, 2015). Traits with an average rg < 0.20, indicating a small effect size, were excluded from subsequent modeling.

To investigate patterns of genetic overlap, we fit five confirmatory factor analyses (CFAs) based on existing theories of psychopathology (Caspi et al., Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2013; Krueger & Markon, Reference Krueger and Markon2006; Markon, Reference Markon2019). Whereas our objective was to test theoretically driven models, we did not perform exploratory factor analyses. First, we evaluated a correlated factors model with two factors representing EXT and INT psychopathology. Next, we tested a three-factor model to explore whether SUDs were distinct from other EXT behaviors. The third model was a bifactor model consisting of a general psychopathology factor and uncorrelated EXT and INT factors. Fourth, we evaluated a higher-order model, which is mathematically equivalent to the two-factor model but represents shared genetic effects between EXT and INT as a second-order factor. To ensure that the model was statistically identified, we constrained the loadings onto the second-order factor to the square root of the genetic correlation between EXT and INT (Loehlin, Reference Loehlin1996). Finally, we tested a unidimensional (p-factor) model where all traits were loaded onto one factor. In all CFA models except the three-factor model (where shared variance is explained by the SUD factor), residual variances for SUDs were allowed to correlate, reflecting shared variance not explained by the broader EXT factor. Model fit was evaluated using chi-square, Akaike information criterion (AIC), comparative fit index (CFI; >.90), and standardized root mean squared residual (SRMR; <0.08) statistics. Factor loadings of $ \ge $ 0.35 were required to ensure that each trait meaningfully contributed to the factor.

Genome-wide association studies

For GWAS, we excluded rare genetic variants (i.e. those present in <1% of people) and poorly imputed variants (i.e. imputation scores <0.6). Each genetic variant was regressed onto the latent variable(s) using diagonally weighted least squares estimation. To evaluate heterogeneity, we calculated QSNV values, which indicate whether a variant’s effect is consistent across all traits included on a factor or varies across them. Genetic variants with a Q-statistic p-value <5 × 10−8 were removed to ensure that only variants with common effects across the spectrum were represented. We identified lead genetic variants using LD clumping in PLINK 1.9 (Purcell et al., Reference Purcell, Neale, Todd-Brown, Thomas, Ferreira, Bender and Sham2007), which groups correlated variants based on reference data from the 1000 Genomes Project and identified the most significant variant in the group.

Polygenicity and genetic overlap

We estimated the genetic complexity of traits using MiXeR, which provides a measure of polygenicity (the number of genetic variants that influence a trait) and discoverability (the average strength of these effects) (Frei et al., Reference Frei, Holland, Smeland, Shadrin, Fan, Maeland and Dale2019; Holland et al., Reference Holland, Frei, Desikan, Fan, Shadrin, Smeland and Dale2020). In MiXeR, causal variants are defined as genetic variants with nonzero additive effects. The method assumes that effects follow a mixture distribution, where a proportion of variants have an effect and others have no effect. MiXeR was also used to identify the proportion of unique and shared causal variants for EXT and INT, regardless of whether their effects are in the same or opposite direction for each trait. Simulation studies have demonstrated that MiXeR is robust to sample overlap (Frei et al., Reference Frei, Holland, Smeland, Shadrin, Fan, Maeland and Dale2019).

Genetic correlations

To identify behavioral tendencies that are genetically linked to the spectra, we calculated genetic correlations between EXT, INT, EXT + INT, and 1,437 other traits using LDSC. LDSC accounts for sample overlap to ensure unbiased estimates (Bulik-Sullivan et al., Reference Bulik-Sullivan, Loh, Finucane, Ripke, Yang and Patterson2015). To account for multiple testing and reduce the risk of false positives, we applied a Benjamini–Hochberg false discovery rate (FDR) correction. We also identified genetic correlations that differed in magnitude for EXT and INT using an FDR correction to determine significance and determined the proportion of genetic correlations concordant in direction for EXT and INT.

Results

Confirmatory factor analysis

Of 18 traits examined (Figure 1), two (automobile speeding propensity and anorexia nervosa) were excluded due to weak associations with all others (mean r g < 0.20). Using the remaining 16 traits, we tested several CFA models (Figure 2 and Supplementary Figures 1–3). A p-factor model did not provide adequate fit ( $ \chi $ 2(98) = 8965.28, AIC = 9041.28, CFI = 0.79, and SRMR = 0.15). Although a bifactor model fit well ( $ \chi $ 2(88) = 3472.02, AIC = 3568.02, CFI = 0.92, and SRMR = 0.07), it led to several weak (<0.35) and one negative standardized loading. Because of its limited interpretability and several traits that did not adequately load onto the factors, we did not perform GWAS on the bifactor model.

Figure 1. Genetic correlations between the externalizing and internalizing traits. ADHD, ‘attention deficit hyperactivity disorder’; AgeSex, ‘age at first sexual intercourse (reverse coded)’; ASB, ‘antisocial behavior’; AUD, ‘alcohol use disorder’; CanUD, ‘cannabis use disorder’; MDD, ‘major depressive disorder’; NumSex, ‘number of sexual partners’; OUD, ‘opioid use disorder’; PTSD, ‘posttraumatic stress disorder’; Risk, ‘general risk tolerance’; Speeding, ‘automobile speeding propensity’; TUD, ‘tobacco use disorder’. Traits are ordered alphabetically. AgeSex and Wellbeing are reverse-coded.

Figure 2. Confirmatory factor analyses of externalizing and internalizing psychopathology. (a) correlated two-factor model, (b) higher-order factor model. Fit of both: $ {\chi}^2 $ (97), 3877.82; AIC, 3955.82; CFI, 0.91; SRMR, 0.09. EXT, ‘externalizing’; INT, ‘internalizing’; ADHD, ‘attention deficit hyperactivity disorder’; AgeSex, ‘age at first sex (reverse coded)’; NumSex, ‘number of sexual partners’; ASB, ‘antisocial behavior’; AUD, ‘alcohol use disorder’; CanUD, ‘cannabis use disorder’; OUD, ‘opioid use disorder’; TUD, ‘tobacco use disorder’; SWB, ‘subjective wellbeing (reverse coded) ’; PTSD, ‘posttraumatic stress disorder’; MDD, ‘major depressive disorder’; ANX, ‘anxiety’.

Two mathematically equivalent models provided adequate fit and factor loadings: (1) a two-factor correlated model and (2) a higher-order factor model. Fit statistics were $ \chi $ 2(97) = 3877.82, AIC = 3955.82, CFI = 0.91, and SRMR = 0.09. To ensure identification in the higher-order model, the loadings onto the second-order factor were constrained equal to the square root of the genetic correlation between EXT and INT (Loehlin, Reference Loehlin1996), a commonly accepted approach for testing a factor that explains shared variance between two correlated dimensions. Although a three-factor model separating SUDs from other externalizing traits provided reasonable fit ( $ \chi $ 2(101) = 4401.18, AIC = 4471.18, CFI = 0.90, and SRMR = 0.09), it did not improve the fit of the simpler two-factor model and thus did not justify the added complexity.

Genome-wide association studies

Eighty-three significant SNVs exhibited heterogeneous effects across the EXT spectrum. A plurality was most strongly associated with AgeSex (47.62%), followed by TUD (16.67%; Supplementary Table 2). After removing heterogenous SNVs, the EXT GWAS identified 409 lead SNVs (Supplementary Table 3). Of these, 92 (22.49%) were not identified or located nearby (within $ \pm $ 1000 kb) SNVs identified by the input GWAS, and four were not previously associated with any EXT trait. Three of the novel SNVs were on chromosome 4 (rs1961547, rs9316, and rs7682762), with the fourth on chromosome 22 (rs1473811). These SNVs were previously associated with chronotype, schizophrenia, and social support, among other traits (Supplementary Table 4).

For INT, 41 significant SNVs exhibited heterogeneity, with most (78.05%) showing the strongest associations with MDD (Supplementary Table 5). Removing heterogeneous SNVs left 85 lead SNVs (Supplementary Table 6). Of these, 23 (27.06%) were in loci not significant in the input GWAS, and two were not previously associated with INT (Supplementary Table 7). The novel SNVs were on chromosomes 3 and 4 (rs1381763 and rs4698408).

For EXT + INT, 39 significant SNVs exhibited heterogeneous effects (Supplementary Table 8). Of these, a plurality (41.03%) was most strongly associated with AgeSex, followed by AUD (17.95%). Removing heterogeneous SNVs left 256 lead SNVs (Figure 3 and Supplementary Table 9), 38 of which (14.84%) were not in loci identified by any of the input GWAS, though all were previously associated with an EXT or INT phenotype.

Figure 3. Manhattan plot of the GWAS results for externalizing and internalizing (EXT + INT) liability.

Note: Significant single nucleotide variants (SNVs) are highlighted in blue. Green diamonds and annotations denote the lead SNVs in loci not identified in the input GWAS for either of the two spectra.

Polygenicity and genetic overlap

The EXT and INT spectra displayed similar levels of genetic complexity. However, INT had lower discoverability ( $ {\hat{\sigma}}_{\beta } $ 2 = 1.40 × 10−5) than EXT ( $ {\hat{\sigma}}_{\beta } $ 2 = 1.44 × 10−4), suggesting that INT may require larger samples to detect significant SNVs. Despite an estimated rg of 0.37, almost all causal variants (96.83% of EXT and 92.42% of INT; Figure 4) overlapped. Of the shared causal variants, 62.92% were estimated to have concordant effect directions.

Figure 4. Results of bivariate causal mixture models for externalizing (EXT) and internalizing (INT).

Note: The Venn diagram on the left shows the estimated overlap in causal variants for externalizing and internalizing. The next two panels show conditional Q-Q plots of the observed versus expected -log10 p-values in trait 1 as a function of the significance of the association with trait 2 (and vice versa) at the level of p $ \le $ 0.1, p $ \le $ 0.01, and p $ \le $ 0.001. The final panel shows the negative log-likelihood as a function of polygenic overlap. The minimum model is represented by the point furthest to the left where the genetic overlap is estimated to be the minimum required to explain the genetic correlation between the two traits. The maximum model is represented as the point furthest on the right represents complete overlap where the least polygenic trait’s causal variants are a subset of the more polygenic trait. The best model (i.e. the one selected by the MiXeR analysis) is the lowest point on the chart.

Genetic correlations

There were 413 significant rg with EXT (Supplementary Table 10 and Supplementary Figure 4), among which tobacco phenotypes were among the strongest (current smoking: r g = 0.79, SE = 0.02; ever smoked: r g = 0.62, SE = 0.02), as were those with lower socioeconomic status (Townsend deprivation index: r g = 0.68, SE = 0.03; financial difficulties: r g = 0.58, SE = 0.03) and educational attainment (r g = −0.44, SE = 0.02). There were 311 phenotypes genetically correlated with INT (Supplementary Table 11 and Supplementary Figure 5), with the strongest including mood swings (r g = 0.90, SE = 0.01) and neuroticism (r g = 0.89, SE = 0.01). INT was also genetically correlated with several types of pain (abdominal: r g = 0.60, SE = 0.04; facial: r g = 0.51, SE = 0.08; chest: r g = 0.49, SE = 0.03; and multisite chronic pain: r g = 0.49, SE = 0.03). There were 474 significant rg with EXT + INT, with most being like those of the first-order factors (Supplementary Table 12 and Figure 5). Of the rg, 17.61% were discordant in direction for EXT and INT, and 168 were of a significantly different magnitude for the two spectra. The rg with the greatest difference in magnitude were neuroticism score, worrier/anxious feelings, mood swings, fed-up feelings, and worry too long after embarrassment, all of which had stronger positive associations with INT than EXT (Supplementary Figure 6).

Figure 5. Genetic correlations between the externalizing and internalizing (EXT + INT) factor and publicly available traits.

Note: The top 25 associations are annotated.

Discussion

Using genomic structural equation modeling, we examined the genetic architecture of EXT and INT spectra. Of the factor structures tested, a two-factor correlated model emerged as the best-fitting, with EXT and INT exhibiting a moderate genetic correlation (r g = 0.37) that could be modeled as a higher-order EXT + INT factor. GWAS of the latent factors identified 409 independent genomic loci associated with EXT, 85 with INT, and 256 with the higher-order EXT + INT factor. Evaluations of the heterogeneity and polygenicity of each spectrum yielded clear evidence of genetic complexity. Below, we discuss the findings in relation to dimensional models of psychopathology and their implications for biological mechanisms and future research.

Genetic insights and biological mechanisms

Our findings demonstrate the utility of multivariate approaches for boosting power to detect loci that might otherwise remain undetected when traits are examined in isolation (Grotzinger et al., Reference Grotzinger, Rhemtulla, de Vlaming, Ritchie, Mallard, Hill and Tucker-Drob2019). By leveraging the shared etiology of commonly co-occurring traits, we identified several novel loci. For both EXT and INT, approximately one-quarter of the genetic variants identified were not significant in the GWAS of the individual traits that comprise the spectrum. These results also emphasize the need to study shared liability alongside spectrum-specific effects. For example, 14.84% of loci identified for EXT + INT were captured by neither the EXT or INT factors individually nor any of the individual trait GWAS. Thus, there is added value in modeling higher-order constructs to capture biological mechanisms that transcend individual traits and spectra. This higher-order factor may represent ‘genetic convergence’, whereby overlapping genetic influences coalesce to drive the co-occurrence of psychopathology across spectra (Waszczuk et al., Reference Waszczuk, Eaton, Krueger, Shackman, Waldman, Zald and Kotov2020).

Spectrum-level heterogeneity and genetic complexity

We identified differences in the degree of genetic variability for EXT, INT, and their shared liability (EXT + INT). EXT exhibited the most heterogeneity, with 83 genetic variants showing varying effects across the EXT spectrum. INT, in contrast, showed less heterogeneity, with just 41 genetic variants that had differing effects across INT traits. The higher-order EXT + INT factor showed the least heterogeneity, with 39 genetic variants having differing effects across the spectra. These findings offer insights into two possible frameworks for understanding heterogeneity in hierarchical models of psychopathology (Markon, Reference Markon2019; Waszczuk et al., Reference Waszczuk, Eaton, Krueger, Shackman, Waldman, Zald and Kotov2020).

The first framework is a ‘top-up’ approach, in which the higher-order EXT + INT factor captures and accumulates the heterogeneity present within the two lower-order spectra (Markon, Reference Markon2019). Given this, we would expect a greater degree of heterogeneity for EXT + INT than the first-order spectra. In contrast, the second framework posits a ‘top-down’ model, whereby the EXT + INT factor reflects a more stable and cohesive genetic architecture than either first-order factor (Markon, Reference Markon2019). Heterogeneity, in this case, would be more pronounced at lower levels of the hierarchy. Our findings align more closely with this second perspective. Less heterogeneity for EXT + INT than EXT and INT individually suggests that the higher-order spectrum consolidates overlapping genetic influences into a more cohesive signal, challenging the assumption that broader spectra necessarily amplify complexity.

Insights from MiXeR reinforce the interpretation that the EXT + INT factor reflects a cohesive and stable genetic architecture. Despite a modest genetic correlation of 0.37 between EXT and INT, MiXeR estimated that nearly all causal genetic variants that influence EXT (96.83%) and INT (92.42%) were shared across the two spectra. Although many of these variants were estimated to have opposite directions of effect (37.08%), these findings underscore the potential value of higher-order constructs for reducing trait-specific noise, offering a clearer lens through which to understand the genetic etiology of psychopathology.

Genetic correlations and phenotypic patterns

Genetic correlations of the two spectra revealed distinct but overlapping phenotypes, with discordant directions for 17.61% of r g and significant differences in magnitude across 168 traits. EXT was predominantly genetically correlated with substance use and socioeconomic status, while most genetic correlations with INT were with anxiety- and mood-related traits. Despite this divergence, the top r g for the two spectra overlapped somewhat, particularly for pain-related phenotypes. EXT + INT was also genetically correlated with pain-related phenotypes, substance use, and mood. Although these patterns support differentiating between the two spectra, they also highlight areas of overlap where common genetic factors exhibit pleiotropy with other behavioral manifestations.

Implications for dimensional models of psychopathology

Our findings offer important insights for dimensional models of psychopathology like HiTOP and RDoC by supporting and refining their conceptual frameworks. HiTOP emphasizes a hierarchical organization of symptoms and spectra, proposing that broader dimensions like EXT + INT capture shared liability across lower-order spectra (Kotov et al., Reference Kotov, Krueger, Watson, Achenbach, Althoff, Bagby and Zimmerman2017). Although this aligns with our observation that the EXT + INT factor accounts for shared liability between the EXT and INT spectra, its lower heterogeneity challenges the expectation that heterogeneity accumulates in broader psychopathology spectra. Instead, higher-order psychopathology spectra may reflect more cohesive constructs than previously assumed, necessitating a reevaluation of how hierarchical models conceptualize heterogeneity across levels. Alternatively, the reduced heterogeneity may stem from the modeling approach. Future research should explore whether this pattern is specific to EXT + INT or generalizes to other higher-order psychopathology spectra.

A key feature of HiTOP is its emphasis on psychopathology as a continuum, with both clinical and subclinical traits contributing meaningfully to the model (Kotov et al., Reference Kotov, Krueger, Watson, Achenbach, Althoff, Bagby and Zimmerman2017; Kotov et al., Reference Kotov, Krueger, Watson, Cicero, Conway, DeYoung and Wright2021). Our findings support this dimensional conceptualization, as clinical and subclinical traits had similar loadings on the EXT and INT factors. Specifically, the average factor loading for EXT disorders was 0.69 and for subclinical EXT traits, it was 0.68. INT disorders loaded somewhat more strongly (0.84) than subclinical INT traits (0.74). These results suggest that, rather than reflecting distinct constructs, clinical and subclinical manifestations of psychopathology exist along a shared continuum of liability. This is consistent with HiTOP’s view that psychopathology is best understood dimensionally, rather than through the discrete diagnostic categories that currently prevail.

The mechanistic focus of RDoC provides a complementary lens for interpreting our findings. By examining genetic correlations and associated genetic variants for the spectra, we explore how HiTOP’s structural dimensions relate to RDoC’s units of analysis, including genetic and behavioral processes, which are also areas of active investigation within the HiTOP Consortium. The EXT and INT spectra align with distinct RDoC domains. EXT primarily maps onto the positive valence and cognitive systems, as evidenced by genetic correlations with substance use and risky decision-making, which reflect processes related to reward sensitivity and impulsivity. In contrast, INT aligns strongly with the negative valence systems and arousal/regulatory systems, as evidenced by genetic correlations with mood swings, neuroticism, and pain, which are linked to threat sensitivity and stress reactivity. Framing higher-order dimensions like EXT + INT as bridges between these distinct RDoC domains begins to unify structural and mechanistic approaches to psychopathology, a key focus of Part II of this pair of companion articles.

Limitations

A key limitation of this study is the inclusion of only EUR-like individuals. Although GWAS of EXT- and INT-related disorders are available for some non-EUR-like individuals, those of more precise, non-disorder psychiatric phenotypes are limited. Nonetheless, some research suggests a similar factor structure exists in individuals genetically similar to African reference panels (AFR-like). At the disorder level, a gSEM of AFR-like individuals identified substance use and psychiatric disorder factors that roughly aligned with EXT and INT (Khan et al., Reference Khan, Davis, Jinwala, Feuer, Toikumo, Hartwell and Kember2024). Additionally, the analyses showed that a higher-order factor accounted for genetic variance shared by substance use and psychiatric disorders, which to a degree corresponds to our EXT + INT factor. With the growth of diverse biobanks (Bianchi et al., Reference Bianchi, Brennan, Chiang, Criswell, D’Souza, Gibbons and Denny2024) and deep phenotyping (Sanchez-Roige & Palmer, Reference Sanchez-Roige and Palmer2020), direct replication in other ancestries should soon be possible and given high priority.

Another limitation is the study’s focus on EXT and INT, which does not fully capture psychopathology. Future research could incorporate a wider range of traits, including those related to thought or personality disorders, to provide a more comprehensive understanding of the genetic architecture underlying psychopathology. Finally, most existing GWAS are cross-sectional, and detailed demographic information is not always reported due to the meta-analytic nature of these studies. For developmental disorders such as ADHD and conduct disorder, cross-sectional approaches in which the ages of individuals represented vary may obscure genetic insights.

Conclusions

Our findings highlight the value of using genetic data to refine and evaluate dimensional models of psychopathology. By modeling EXT, INT, and their shared liability (EXT + INT), we identified a more cohesive genetic etiology operating at the higher-order level, challenging the assumption that broader spectra necessarily increase heterogeneity. Instead, the EXT + INT factor consolidated overlapping genetic influences, offering a clearer framework for understanding comorbidity. These findings can inform models like HiTOP and RDoC, emphasizing the need to balance shared liability with spectrum-specific effects. Future research, including the companion Part II article, will be crucial for identifying the specific and shared biological pathways that underlie these spectra. The effort to link dimensional models to biological mechanisms will serve to bridge structural (i.e. HiTOP) and mechanistic (i.e. RDoC) approaches to psychopathology, ultimately enhancing their clinical and theoretical utility.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/S0033291725000856.

Financial support

This work was supported by the Veterans Integrated Service Network 4 Mental Illness Research, Education and Clinical Center and by Department of Veterans Affairs grants I01 BX004820 to H.R.K., National Institute on Alcohol Abuse and Alcoholism grant AA028292 to R.L.K, and KNAW (Royal Netherlands Academy of Arts and Sciences) Academy Professor Award (PAH/6635) to D.I.B. The funders had no role in study design, data collection or analysis, decision to publish, or preparation of the manuscript.

Competing interests

Dr. Kranzler is a member of advisory boards for Altimmune, Clearmind Medicine, Dicerna Pharmaceuticals, Enthion Pharmaceuticals, Lilly Pharmaceuticals, and Sophrosyne Pharmaceuticals; a consultant to Altimmune and Sobrera Pharmaceuticals; the recipient of research funding and medication supplies for an investigator-initiated study from Alkermes; and a member of the American Society of Clinical Psychopharmacology’s Alcohol Clinical Trials Initiative, which was supported in the last 3 years by Alkermes, Dicerna, Ethypharm, Lundbeck, Mitsubishi, Otsuka, and Pear Therapeutics. Drs. Kranzler and Gelernter hold U.S. patent 10,900,082 titled: ‘Genotype-guided dosing of opioid agonists’, issued 26 January 2021. The other authors have no disclosures to make.

References

Abdellaoui, A., Sanchez-Roige, S., Sealock, J., Treur, J. L., Dennis, J., Fontanillas, P., … Boomsma, D. I. (2019). Phenome-wide investigation of health outcomes associated with genetic predisposition to loneliness. Human Molecular Genetics, 28(22), 38533865. https://doi.org/10.1093/hmg/ddz219CrossRefGoogle ScholarPubMed
Allegrini, A. G., Cheesman, R., Rimfeld, K., Selzam, S., Pingault, J.-B., Eley, T. C., & Plomin, R. (2020). The p factor: Genetic analyses support a general dimension of psychopathology in childhood and adolescence. Journal of Child Psychology and Psychiatry, 61(1), 3039. https://doi.org/10.1111/jcpp.13113CrossRefGoogle Scholar
Als, T. D., Kurki, M. I., Grove, J., Voloudakis, G., Therrien, K., Tasanko, E., … Børglum, A. D. (2023). Depression pathophysiology, risk prediction of recurrence and comorbid psychiatric disorders using genome-wide analyses. Nature Medicine, 29(7), 18321844. https://doi.org/10.1038/s41591-023-02352-1CrossRefGoogle ScholarPubMed
Bianchi, D. W., Brennan, P. F., Chiang, M. F., Criswell, L. A., D’Souza, R. N., Gibbons, G. H., … Denny, J. C. (2024). The All of Us Research Program is an opportunity to enhance the diversity of US biomedical research. Nature Medicine, 30(2), 330333. https://doi.org/10.1038/s41591-023-02744-3CrossRefGoogle Scholar
Bulik-Sullivan, B. K., Loh, P.-R., Finucane, H. K., Ripke, S., Yang, J., Patterson, N., … Schizophrenia Working Group of the Psychiatric Genomics, Consortium. (2015). LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics, 47(3), 291295. https://doi.org/10.1038/ng.3211CrossRefGoogle ScholarPubMed
Caspi, A., Houts, R. M., Belsky, D. W., Goldman-Mellor, S. J., Harrington, H., Israel, S., … Moffitt, T. E. (2013). The p factor: One general psychopathology factor in the structure of psychiatric disorders? Clinical Psychological Science, 2(2), 119137. https://doi.org/10.1177/2167702613497473CrossRefGoogle Scholar
Commisso, M., Geoffroy, M.-C., Temcheff, C., Scardera, S., Vergunst, F., Côté, S. M., … Orri, M. (2024). Association of childhood externalizing, internalizing, comorbid problems with criminal convictions by early adulthood. Journal of Psychiatric Research, 172, 915. https://doi.org/10.1016/j.jpsychires.2024.01.039CrossRefGoogle ScholarPubMed
Commisso, M., Temcheff, C., Orri, M., Poirier, M., Lau, M., Côté, S., … Geoffroy, M.-C. (2023). Childhood externalizing, internalizing and comorbid problems: Distinguishing young adults who think about suicide from those who attempt suicide. Psychological Medicine, 53(3), 10301037. https://doi.org/10.1017/S0033291721002464CrossRefGoogle ScholarPubMed
Cuthbert, B. N. (2015). Research domain criteria: Toward future psychiatric nosologies. Dialogues in Clinical Neuroscience, 17(1), 8997. https://doi.org/10.31887/DCNS.2015.17.1/bcuthbertCrossRefGoogle ScholarPubMed
Demontis, D., Walters, G. B., Athanasiadis, G., Walters, R., Therrien, K., Nielsen, T. T., … Børglum, A. D. (2023). Genome-wide analyses of ADHD identify 27 risk loci, refine the genetic architecture and implicate several cognitive domains. Nature Genetics, 55(2), 198208. https://doi.org/10.1038/s41588-022-01285-8CrossRefGoogle ScholarPubMed
Frei, O., Holland, D., Smeland, O. B., Shadrin, A. A., Fan, C. C., Maeland, S., … Dale, A. M. (2019). Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation. Nature Communications, 10(1), 2417. https://doi.org/10.1038/s41467-019-10310-0CrossRefGoogle ScholarPubMed
Galatzer-Levy, I. R., & Bryant, R. A. (2013). 636,120 ways to have posttraumatic stress disorder. Perspectives on Psychological Science, 8(6), 651662. https://doi.org/10.1177/1745691613504115CrossRefGoogle ScholarPubMed
Grotzinger, A. D., Mallard, T. T., Akingbuwa, W. A., Ip, H. F., Adams, M. J., Lewis, C. M., … Schizophrenia Working Group of the Psychiatric Genetics, Consortium. (2022). Genetic architecture of 11 major psychiatric disorders at biobehavioral, functional genomic and molecular genetic levels of analysis. Nature Genetics, 54(5), 548559. https://doi.org/10.1038/s41588-022-01057-4CrossRefGoogle ScholarPubMed
Grotzinger, A. D., Rhemtulla, M., de Vlaming, R., Ritchie, S. J., Mallard, T. T., Hill, W. D., … Tucker-Drob, E. M. (2019). Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nature Human Behaviour, 3(5), 513525. https://doi.org/10.1038/s41562-019-0566-xCrossRefGoogle ScholarPubMed
Hewitt, J. K., Silberg, J. L., Neale, M. C., Eaves, L. J., & Erickson, M. (1992). The analysis of parental ratings of children’s behavior using LISREL. Behavior Genetics, 22(3), 293317. https://doi.org/10.1007/BF01066663CrossRefGoogle ScholarPubMed
Holland, D., Frei, O., Desikan, R., Fan, C.-C., Shadrin, A. A., Smeland, O. B., … Dale, A. M. (2020). Beyond SNP heritability: Polygenicity and discoverability of phenotypes estimated with a univariate Gaussian mixture model. PLOS Genetics, 16(5), e1008612. https://doi.org/10.1371/journal.pgen.1008612CrossRefGoogle ScholarPubMed
Karlsson Linnér, R., Biroli, P., Kong, E., Meddens, S. F. W., Wedow, R., Fontana, M. A., … Beauchamp, J. P. (2019). Genome-wide association analyses of risk tolerance and risky behaviors in over 1 million individuals identify hundreds of loci and shared genetic influences. Nature Genetics, 51(2), 245257. https://doi.org/10.1038/s41588-018-0309-3CrossRefGoogle ScholarPubMed
Karlsson Linnér, R., Mallard, T. T., Barr, P. B., Sanchez-Roige, S., Madole, J. W., Driver, M. N., … Collaborators, C. (2021). Multivariate analysis of 1.5 million people identifies genetic associations with traits related to self-regulation and addiction. Nature Neuroscience, 24(10), 13671376. https://doi.org/10.1038/s41593-021-00908-3CrossRefGoogle ScholarPubMed
Kember, R. L., Vickers-Smith, R., Xu, H., Toikumo, S., Niarchou, M., Zhou, H., … Million Veteran, P. (2022). Cross-ancestry meta-analysis of opioid use disorder uncovers novel loci with predominant effects in brain regions associated with addiction. Nature Neuroscience, 25(10), 12791287. https://doi.org/10.1038/s41593-022-01160-zCrossRefGoogle ScholarPubMed
Kessler, R. C., Chiu, W. T., Demler, O., & Walters, E. E. (2005). Prevalence, severity, and comorbidity of 12-month DSM-IV disorders in the National Comorbidity Survey replication. Archives of General Psychiatry, 62(6), 617627. https://doi.org/10.1001/archpsyc.62.6.617CrossRefGoogle ScholarPubMed
Kessler, R. C., McGonagle, K. A., Zhao, S., Nelson, C. B., Hughes, M., Eshleman, S., … Kendler, K. S. (1994). Lifetime and 12-month prevalence of DSM-III-R psychiatric disorders in the United States: Results from the National Comorbidity Survey. Archives of General Psychiatry, 51(1), 819. https://doi.org/10.1001/archpsyc.1994.03950010008002CrossRefGoogle ScholarPubMed
Khan, Y., Davis, C. N., Jinwala, Z., Feuer, K. L., Toikumo, S., Hartwell, E. E., … Kember, R. L. (2024). Combining transdiagnostic and disorder-level GWAS enhances precision of psychiatric genetic risk profiles in a multi-ancestry sample. medRxiv. https://doi.org/10.1101/2024.05.09.24307111CrossRefGoogle Scholar
Kotov, R., Krueger, R. F., Watson, D., Achenbach, T. M., Althoff, R. R., Bagby, R. M., … Zimmerman, M. (2017). The hierarchical taxonomy of psychopathology (HiTOP): A dimensional alternative to traditional nosologies. Journal of Abnormal Psychology, 126(4), 454477. https://doi.org/10.1037/abn0000258CrossRefGoogle ScholarPubMed
Kotov, R., Krueger, R. F., Watson, D., Cicero, D. C., Conway, C. C., DeYoung, C. G., … Wright, A. G. C. (2021). The hierarchical taxonomy of psychopathology (HiTOP): A quantitative nosology based on consensus of evidence. Annual Review of Clinical Psychology, 17(1), 83108. https://doi.org/10.1146/annurev-clinpsy-081219-093304CrossRefGoogle ScholarPubMed
Kozak, M. J., & Cuthbert, B. N. (2016). The NIMH research domain criteria initiative: Background, issues, and pragmatics. Psychophysiology, 53(3), 286297. https://doi.org/10.1111/psyp.12518CrossRefGoogle ScholarPubMed
Krueger, R. F., Hobbs, K. A., Conway, C. C., Dick, D. M., Dretsch, M. N., Eaton, N. R., … HiTOP Utility Workgroup. (2021). Validity and utility of hierarchical taxonomy of psychopathology (HiTOP): II externalizing superspectrum. World Psychiatry, 20(2), 171193. https://doi.org/10.1002/wps.20844CrossRefGoogle ScholarPubMed
Krueger, R. F., & Markon, K. E. (2006). Reinterpreting comorbidity: A model-based approach to understanding and classifying psychopathology. Annual Review of Clinical Psychology, 2, 111133. https://doi.org/10.1146/annurev.clinpsy.2.022305.095213CrossRefGoogle ScholarPubMed
Lahey, B. B., Van Hulle, C. A., Singh, A. L., Waldman, I. D., & Rathouz, P. J. (2011). Higher-order genetic and environmental structure of prevalent forms of child and adolescent psychopathology. Archives of General Psychiatry, 68(2), 181189. https://doi.org/10.1001/archgenpsychiatry.2010.192CrossRefGoogle ScholarPubMed
Lee, P. H., Anttila, V., Won, H., Feng, Y.-C. A., Rosenthal, J., Zhu, Z., … Smoller, J. W. (2019). Genomic relationships, novel loci, and pleiotropic mechanisms across eight psychiatric disorders. Cell, 179(7), 14691482.e11. https://doi.org/10.1016/j.cell.2019.11.020CrossRefGoogle Scholar
Lee, P. H., Feng, Y.-C. A., & Smoller, J. W. (2021). Pleiotropy and cross-disorder genetics among psychiatric disorders. Biological Psychiatry, 89(1), 2031. https://doi.org/10.1016/j.biopsych.2020.09.026CrossRefGoogle ScholarPubMed
Levey, D. F., Galimberti, M., Deak, J. D., Wendt, F. R., Bhattacharya, A., Koller, D., … Veterans Affairs Million Veteran, Program. (2023). Multi-ancestry genome-wide association study of cannabis use disorder yields insight into disease biology and public health implications. Nature Genetics, 55(12), 20942103. https://doi.org/10.1038/s41588-023-01563-zCrossRefGoogle ScholarPubMed
Levey, D. F., Gelernter, J., Polimanti, R., Zhou, H., Cheng, Z., Aslan, M., … Stein, M. B. (2020). Reproducible genetic risk loci for anxiety: Results from ∼200,000 participants in the million veteran program. American Journal of Psychiatry, 177(3), 223232. https://doi.org/10.1176/appi.ajp.2019.19030256CrossRefGoogle ScholarPubMed
Loehlin, J. C. (1996). The Cholesky approach: A cautionary note. Behavior Genetics, 26(1), 6569. https://doi.org/10.1007/BF02361160CrossRefGoogle Scholar
Markon, K. E. (2019). Bifactor and hierarchical models: Specification, inference, and interpretation. Annual Review of Clinical Psychology, 15, 5169. https://doi.org/10.1146/annurev-clinpsy-050718-095522CrossRefGoogle ScholarPubMed
Michelini, G., Palumbo, I. M., DeYoung, C. G., Latzman, R. D., & Kotov, R. (2021). Linking RDoC and HiTOP: A new interface for advancing psychiatric nosology and neuroscience. Clinical Psychology Review, 86, 102025. https://doi.org/10.1016/j.cpr.2021.102025CrossRefGoogle ScholarPubMed
Neumann, A., Nolte, I. M., Pappa, I., Ahluwalia, T. S., Pettersson, E., Rodriguez, A., … Tiemeier, H. (2022). A genome-wide association study of total child psychiatric problems scores. PLoS One, 17(8), e0273116. https://doi.org/10.1371/journal.pone.0273116CrossRefGoogle ScholarPubMed
Okbay, A., Baselmans, B. M. L., De Neve, J.-E., Turley, P., Nivard, M. G., Fontana, M. A., … LifeLines Cohort, S. (2016). Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nature Genetics, 48(6), 624633. https://doi.org/10.1038/ng.3552CrossRefGoogle ScholarPubMed
Otowa, T., Hek, K., Lee, M., Byrne, E. M., Mirza, S. S., Nivard, M. G., … Hettema, J. M. (2016). Meta-analysis of genome-wide association studies of anxiety disorders. Molecular Psychiatry, 21(10), 13911399. https://doi.org/10.1038/mp.2015.197CrossRefGoogle ScholarPubMed
Pappa, I., Fedko, I. O., Mileva-Seitz, V. R., Hottenga, J. J., Bakermans-Kranenburg, M. J., Bartels, M., … Boomsma, D. I. (2015). Single nucleotide polymorphism heritability of behavior problems in childhood: Genome-wide complex trait analysis. Journal of the American Academy of Child and Adolescent Psychiatry, 54(9), 737744. https://doi.org/10.1016/j.jaac.2015.06.004CrossRefGoogle ScholarPubMed
Pettersson, E., Larsson, H., & Lichtenstein, P. (2016). Common psychiatric disorders share the same genetic origin: A multivariate sibling study of the Swedish population. Molecular Psychiatry, 21(5), 717721. https://doi.org/10.1038/mp.2015.116CrossRefGoogle ScholarPubMed
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., … Sham, P. C. (2007). PLINK: A tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics, 81(3), 559575. https://doi.org/10.1086/519795CrossRefGoogle ScholarPubMed
Purves, K. L., Coleman, J. R. I., Meier, S. M., Rayner, C., Davis, K. A. S., Cheesman, R., … Eley, T. C. (2020). A major role for common genetic variation in anxiety disorders. Molecular Psychiatry, 25(12), 32923303. https://doi.org/10.1038/s41380-019-0559-1CrossRefGoogle Scholar
Sanchez-Roige, S., & Palmer, A. A. (2020). Emerging phenotyping strategies will advance our understanding of psychiatric genetics. Nature Neuroscience, 23(4), 475480. https://doi.org/10.1038/s41593-020-0609-7CrossRefGoogle ScholarPubMed
Selzam, S., Coleman, J. R. I., Caspi, A., Moffitt, T. E., & Plomin, R. (2018). A polygenic p factor for major psychiatric disorders. Translational Psychiatry, 8(1), 205. https://doi.org/10.1038/s41398-018-0217-4CrossRefGoogle ScholarPubMed
Silberg, J. L., Erickson, M. T., Meyer, J. M., Eaves, L. J., Rutter, M. L., & Hewitt, J. K. (1994). The application of structural equation modeling to maternal ratings of twins’ behavioral and emotional problems. Journal of Consulting and Clinical Psychology, 62(3), 510521.Google ScholarPubMed
Stein, M. B., Levey, D. F., Cheng, Z., Wendt, F. R., Harrington, K., Pathak, G. A., … Program, V. A. M. V. (2021). Genome-wide association analyses of post-traumatic stress disorder and its symptom subdomains in the Million Veteran Program. Nature Genetics, 53(2), 174184. https://doi.org/10.1038/s41588-020-00767-xCrossRefGoogle ScholarPubMed
The 1000 Genomes Project Consortium. (2015). A global reference for human genetic variation. Nature, 526(7571), 6874. https://doi.org/10.1038/nature15393CrossRefGoogle Scholar
The International HapMap 3 Consortium. (2010). Integrating common and rare genetic variation in diverse human populations. Nature, 467(7311), 5258. https://doi.org/10.1038/nature09298CrossRefGoogle Scholar
Tielbeek, J. J., Johansson, A., Polderman, T. J. C., Rautiainen, M.-R., Jansen, P., Taylor, M., … Broad Antisocial Behavior Consortium collaborators. (2017). Genome-wide association studies of a broad spectrum of antisocial behavior. JAMA Psychiatry, 74(12), 12421250. https://doi.org/10.1001/jamapsychiatry.2017.3069CrossRefGoogle ScholarPubMed
Toikumo, S., Jennings, M. V., Pham, B. K., Lee, H., Mallard, T. T., Bianchi, S. B., … PsycheMERGE Substance Use Disorder Workgroup. (2024). Multi-ancestry meta-analysis of tobacco use disorder identifies 461 potential risk genes and reveals associations with multiple health outcomes. Nature Human Behaviour, 8(6), 11771193. https://doi.org/10.1038/s41562-024-01851-6CrossRefGoogle ScholarPubMed
Turley, P., Walters, R. K., Maghzian, O., Okbay, A., Lee, J. J., Fontana, M. A., … Social Science Genetic Association Consortium. (2018). Multi-trait analysis of genome-wide association summary statistics using MTAG. Nature Genetics, 50(2), 229237. https://doi.org/10.1038/s41588-017-0009-4CrossRefGoogle ScholarPubMed
Vergunst, F., Commisso, M., Geoffroy, M.-C., Temcheff, C., Poirier, M., Park, J., … Orri, M. (2023). Association of childhood externalizing, internalizing, and comorbid symptoms with long-term economic and social outcomes. JAMA Network Open, 6(1), e2249568e2249568. https://doi.org/10.1001/jamanetworkopen.2022.49568CrossRefGoogle ScholarPubMed
Waszczuk, M. A., Eaton, N. R., Krueger, R. F., Shackman, A. J., Waldman, I. D., Zald, D. H., … Kotov, R. (2020). Redefining phenotypes to advance psychiatric genetics: Implications from hierarchical taxonomy of psychopathology. Journal of Abnormal Psychology, 129(2), 143161. https://doi.org/10.1037/abn0000486CrossRefGoogle ScholarPubMed
Watson, H. J., Yilmaz, Z., Thornton, L. M., Hübel, C., Coleman, J. R. I., Gaspar, H. A., … Eating Disorders Working Group of the Psychiatric Genomics Consortium. (2019). Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nature Genetics, 51(8), 12071214. https://doi.org/10.1038/s41588-019-0439-2CrossRefGoogle ScholarPubMed
Zhou, H., Kember, R. L., Deak, J. D., Xu, H., Toikumo, S., Yuan, K., … Million Veteran Program. (2023). Multi-ancestry study of the genetics of problematic alcohol use in over 1 million individuals. Nature Medicine 29, 31843192. https://doi.org/10.1038/s41591-023-02653-5CrossRefGoogle ScholarPubMed
Figure 0

Figure 1. Genetic correlations between the externalizing and internalizing traits. ADHD, ‘attention deficit hyperactivity disorder’; AgeSex, ‘age at first sexual intercourse (reverse coded)’; ASB, ‘antisocial behavior’; AUD, ‘alcohol use disorder’; CanUD, ‘cannabis use disorder’; MDD, ‘major depressive disorder’; NumSex, ‘number of sexual partners’; OUD, ‘opioid use disorder’; PTSD, ‘posttraumatic stress disorder’; Risk, ‘general risk tolerance’; Speeding, ‘automobile speeding propensity’; TUD, ‘tobacco use disorder’. Traits are ordered alphabetically. AgeSex and Wellbeing are reverse-coded.

Figure 1

Figure 2. Confirmatory factor analyses of externalizing and internalizing psychopathology. (a) correlated two-factor model, (b) higher-order factor model. Fit of both: $ {\chi}^2 $(97), 3877.82; AIC, 3955.82; CFI, 0.91; SRMR, 0.09. EXT, ‘externalizing’; INT, ‘internalizing’; ADHD, ‘attention deficit hyperactivity disorder’; AgeSex, ‘age at first sex (reverse coded)’; NumSex, ‘number of sexual partners’; ASB, ‘antisocial behavior’; AUD, ‘alcohol use disorder’; CanUD, ‘cannabis use disorder’; OUD, ‘opioid use disorder’; TUD, ‘tobacco use disorder’; SWB, ‘subjective wellbeing (reverse coded) ’; PTSD, ‘posttraumatic stress disorder’; MDD, ‘major depressive disorder’; ANX, ‘anxiety’.

Figure 2

Figure 3. Manhattan plot of the GWAS results for externalizing and internalizing (EXT + INT) liability.Note: Significant single nucleotide variants (SNVs) are highlighted in blue. Green diamonds and annotations denote the lead SNVs in loci not identified in the input GWAS for either of the two spectra.

Figure 3

Figure 4. Results of bivariate causal mixture models for externalizing (EXT) and internalizing (INT).Note: The Venn diagram on the left shows the estimated overlap in causal variants for externalizing and internalizing. The next two panels show conditional Q-Q plots of the observed versus expected -log10 p-values in trait 1 as a function of the significance of the association with trait 2 (and vice versa) at the level of p$ \le $ 0.1, p$ \le $ 0.01, and p$ \le $ 0.001. The final panel shows the negative log-likelihood as a function of polygenic overlap. The minimum model is represented by the point furthest to the left where the genetic overlap is estimated to be the minimum required to explain the genetic correlation between the two traits. The maximum model is represented as the point furthest on the right represents complete overlap where the least polygenic trait’s causal variants are a subset of the more polygenic trait. The best model (i.e. the one selected by the MiXeR analysis) is the lowest point on the chart.

Figure 4

Figure 5. Genetic correlations between the externalizing and internalizing (EXT + INT) factor and publicly available traits.Note: The top 25 associations are annotated.

Supplementary material: File

Davis et al. supplementary material

Davis et al. supplementary material
Download Davis et al. supplementary material(File)
File 5.9 MB