Introduction
Developmental psychopathology is a maturing discipline, and this is exemplified by advances in how vulnerability factors are studied (Cicchetti & Toth, Reference Cicchetti and Toth2009). Rather than simply demonstrating that a given vulnerability factor predicts an outcome, a primary aim of modern vulnerability factors research is to establish which variables or conditions influence the relationship between a vulnerability factor and an outcome as well as whether there are intermediate processes between the vulnerability factor and the disorder. These topics are respectively explored using moderation and mediation analyses (Baron & Kenny, Reference Baron and Kenny1986); these two models highlight the need to consider additional variables in order to understand fully the relationship between a vulnerability factor and outcome. The questions addressed by moderation and mediation appear distinct (Baron & Kenny, Reference Baron and Kenny1986; Kraemer, Kiernan, Essex, & Kupfer, Reference Kraemer, Kiernan, Essex and Kupfer2008; Kraemer, Wilson, Fairburn, & Agras, Reference Kraemer, Wilson, Fairburn and Agras2002), but the information gained from these models can be complementary and provide a richer theoretical picture. This has led to the growth of models that integrate moderation and mediation within the same analysis (Fairchild & MacKinnon, Reference Fairchild and MacKinnon2009; Hayes, Reference Hayes2015; Muller, Descartes, Judd, & Yzerbyt, Reference Muller, Descartes, Judd and Yzerbyt2005; Preacher, Rucker, & Hayes, Reference Preacher, Rucker and Hayes2007).
Curiously, there is one particular scenario that is consistently overlooked in developmental psychopathology – namely, when moderation and mediation occur with just three variables. In this model, a single variable assumes the role of both moderator and mediator at the same time. At first glance, researchers may assume that a variable cannot simultaneously be both a mediator and moderator, perhaps on theoretical grounds, and that if this scenario occurs, it must be exceptionally rare. However, we argue that such cases are not only plausible in developmental psychopathology research, but might be much more common than is typically assumed – nearly 15 years ago, Grant et al. (Reference Grant, Compas, Thurm, Mcmahon, Gipson, Campbell and Westerholm2006) produced a comprehensive review of the role of stress in child and adolescent psychopathology in which they documented numerous examples of a single variable acting as a moderator in one set of studies and then also as a mediator in another set of studies. However, although it is quite common for the same variable to be conceptualized as a moderator and a mediator in separate studies or by separate groups of researchers, whether a variable functions as both a moderator and a mediator simultaneously is virtually never examined in a single analytic model. Thus, the overarching goal of this paper is to assist researchers in advancing the study of vulnerability factors by highlighting scenarios in which a mediator is also a moderator.
We first begin by defining the three most common theories of vulnerability factor–psychopathology relationships and how they are analyzed. We then discuss how each of these three vulnerability factor types may be complementary rather than mutually exclusive, and describe an integrative analytic framework discussed by statisticians (Hayes, Reference Hayes2015; Preacher et al., Reference Preacher, Rucker and Hayes2007), epidemiologists (Valeri & VanderWeele, Reference Valeri and VanderWeele2013), and in some applied contexts (Kraemer et al., Reference Kraemer, Wilson, Fairburn and Agras2002) but rarely implemented in developmental psychopathology. We then review findings from several specific areas of research to illustrate when this integrative approach may apply. We also discuss possible reasons why this type of moderated mediation model has not gained more widespread use and address criticisms of the model. Next, we provide recommendations about when to use this model, including the timing for when to measure variables and how to select the correct model, and briefly discuss sample size. We also point to existing statistical tools for testing moderation and mediation in a three-variable model. Finally, we discuss important caveats, limitations, advanced analytic approaches, and considerations for developmentally informed designs.
Basic Conceptual Perspectives and Analytic Approaches for Vulnerability Factors
“Risk factor” is a generic term referring to a variable that is believed to increase the probability of experiencing a negative outcome; for extensive discussions of risk factors see the works of Kraemer et al. (Reference Kraemer, Kazdin, Offord, Kessler, Jensen and Kupfer1997), Lenzenweger (Reference Lenzenweger2013) and Glahn et al. (Reference Glahn, Knowles, McKay, Sprooten, Raventós, Blangero and Almasy2014). Conceptually, vulnerability factors are thought to lead to symptoms or disorders primarily via one of three ways: (a) as a direct predictor, (b) in conjunction with a moderator, or (c) through a mediator. The conceptual and statistical models corresponding to these three types of vulnerability factors are shown in Figure 1.
The direct predictor is the simplest model – a risk factor predicts a subsequent outcome without requiring other variables (Figure 1a). Studies of direct effect predictors such as these minimally require a longitudinal design with two assessments, where the risk factor is measured at the first time point and the outcome is measured at the second time point. Often in these designs, researchers also include the outcome variable at the first time point as a covariate to show that the risk factor predicts the outcome at the second time point over and above the way the outcome is predicted by its prior level.
An alternative model posits that a vulnerability factor is moderated by another variable to increase symptoms (Figure 1b). That is, two variables statistically interact to predict later psychopathology. The “third variable” in this model is often termed a moderator, but can also be thought of as a vulnerability factor in its own right.Footnote 1 Moderation can operate in several different ways. In some cases, the moderator variable is described as a catalyst that interacts with an underlying vulnerability leading to psychopathology. In this way, symptoms only increase when the vulnerability and moderator are both present. This is the pattern described in the classic diathesis stress model (Monroe & Simons, Reference Monroe and Simons1991). Another possibility is known as the differential susceptibility model (Belsky & Pluess, Reference Belsky and Pluess2009); this occurs when high levels of the moderator increase risk for psychopathology (e.g., the association between the vulnerability factor and psychopathology is positive), but low levels of the moderator actually decrease risk for psychopathology (the direction of the association between the vulnerability factor and psychopathology is negative). Another version, known as the dual vulnerability model (e.g., Morris, Ciesla, & Garber, Reference Morris, Ciesla and Garber2008), describes a situation in which high levels of either the vulnerability factor or the moderator can lead to psychopathology, but psychopathology will not occur when both the vulnerability factor and moderator are low. With regard to timing, the moderator and vulnerability factor are often measured contemporaneously and psychopathology is measured in a follow-up assessment. Issues concerning timing of variables are discussed in a later section.
Lastly, in a mediation framework, a vulnerability factor may be associated with an increase in symptoms via its effect on the third variable (Figure 1c). In these models, it is common that a vulnerability factor predicts the mediator (path a in Figure 1c), which in turn predicts psychopathology (path b in Figure 1c).Footnote 2 There are several directions in which the mediation may manifest. It is possible that the original vulnerability factor may increase or decrease the mediator, and the mediator may also increase or decrease psychopathology (when the effect of the mediator is to decrease psychopathology, it is considered a protective mediator).
A Third Variable that is Both a Moderator and a Mediator
While there has been considerable attention paid to distinguishing moderators from mediators (Kraemer et al., Reference Kraemer, Kiernan, Essex and Kupfer2008) – and there is good reason to clearly describe these terms (Baron & Kenny, Reference Baron and Kenny1986) – there may also be instances in which a variable functions simultaneously as both a moderator and a mediator (Figure 2). From an empirical perspective, there are many instances when this might occur. For example, Grant et al. (Reference Grant, Compas, Thurm, Mcmahon, Gipson, Campbell and Westerholm2006) reviewed hundreds of moderation and mediation studies examining the links between stressors and psychopathology in children and adolescents, and found numerous variables (e.g., coping skills, social support, and familial functioning) and various cognitive processes (e.g., attributional style and maladaptive beliefs) that were shown to be moderators in one study but mediators in another. However, moderators and mediators are rarely tested within the same analytic model. Instead of viewing moderation and mediation as distinct, we can instead consider them to reflect different components of the associations within a “three-variable system” that can be examined not only within a single study, but within the same analytic model.
When moderation and mediation occur in a three-variable system, the implication is that the indirect effect of M linking X to Y is not uniform across the full range of values of X and M. In other words, the indirect effect varies for the sample depending on the levels of X and M. When mediation is examined without an interaction between X and M, we are only able to capture the indirect effect for the sample on average, but it is plausible that the indirect effect may not be uniform across the entire sample. One example of this is when high and low values of X and M differentially influence the magnitude of the indirect effect, such that the effect is higher when X and M are high relative to when X and M are low. Conversely, the mediation effect is lower when the vulnerability factors X and M are low. This will be clearer when considering the examples in subsequent sections. This model is particularly relevant to developmental psychopathology when two vulnerability factors not only relate to an outcome, but also relate to each other via a developmental process in which one vulnerability factor (X) influences the development of the second vulnerability factor (M). From a more mechanistic and process-oriented approach, it is important to acknowledge that the degree of risk imposed by a vulnerability factor on an adverse outcome can be influenced by associations with one or more other vulnerability factors.
To illustrate moderation and mediation in a three-variable system, we consider the widely discussed phenomena of Gene × Environment interaction (G × E) and Gene by Environment correlation (rGE) as they relate to psychopathology (Rutter, Moffitt, & Caspi, Reference Rutter, Moffitt and Caspi2006). Suppose a gene interacts with an environmental stressor to increase risk for psychopathology (G × E); however, the gene also predicts the environmental stressor with which it interacts (rGE). While candidate genes studies are being increasingly replaced with polygenic risk score approaches, we can consider a candidate gene as an example for illustrative purposes, focusing on the dopamine D2 receptor gene (DRD2), parenting, and anxious–depression symptoms in childhood (Hayden et al., Reference Hayden, Klein, Dougherty, Olino, Laptook, Dyson and Singh2010). Hayden et al. (Reference Hayden, Klein, Dougherty, Olino, Laptook, Dyson and Singh2010) found evidence that the a1 allele of DRD2 exhibited a significant association with anxious–depression symptoms (path x or path c in Figure 1). Furthermore, a significant interaction emerged between DRD2 and intrusive parenting to predict anxiety–depression (path xz in Figure 1b). At the same time, DRD2 also predicted intrusive parenting (path a in Figure 1c). In this example, the environmental variable – intrusive parenting – is clearly acting simultaneously as moderator and mediator of the gene–psychopathology relationship. This is because parenting interacts with the candidate gene while, at the same time, parenting is being predicted by the very same gene. More forcefully stated, it is not conceptually reasonable to separate the moderating and mediating effect of the environment on a gene–disorder relationship when both contribute to risk (Lau & Eley, Reference Lau and Eley2008; Price & Jaffee, Reference Price and Jaffee2008). However, cases in which moderation and mediation co-occur are not restricted to G × E interplay, but can generalize to a wide range of scenarios with vulnerability factors at different levels of analysis. A similar example can be found at the level of a gene, physiology, and psychopathology. Goodyer, Bacon, Ban, Croudace, and Herbert (Reference Goodyer, Bacon, Ban, Croudace and Herbert2009) found that the short allele of the serotonin transporter gene was associated with higher morning cortisol (an a path) and, at the same time, the short allele and morning cortisol interacted to predict depression onset (an xz path). Unfortunately, such examples are rarely reported, although exceptions can be found (Cole, Zapp, Fettig, & Pérez-Edgar, Reference Cole, Zapp, Fettig and Pérez-Edgar2016; Gerhart, Baker, Hoerger, & Ronan, Reference Gerhart, Baker, Hoerger and Ronan2014; Maniates, Stoop, Miller, Halberstadt, & Wolf, Reference Maniates, Stoop, Miller, Halberstadt and Wolf2018). However, by merging Figure 1b and Figure 1c together, we can see that simultaneously testing the three primary types of risk factor–psychopathology relationships can be readily achieved within a single model.
We believe that examining moderation and mediation in the same three-variable system is potentially beneficial for advancing the study of vulnerability factors. We depict this model in Figure 2. This approach allows for simultaneous evaluation of each of the three common vulnerability factor–disorder relationships within a single statistical model. This approach has advantages over a piecemeal approach in which each variant is tested separately. We believe that failing to test for both moderation and mediation limits theory development and slows research progress. Let us imagine a scenario in which the same vulnerability factor, disorder, and a third variable are examined in two separate studies. The first research group tests only moderation and finds a significant effect. The second research group tests only mediation and also finds a significant effect. While it is possible that both results are correct and complement one another, we cannot be sure. It could be that both research groups tested moderation and mediation, but one group elected to report only moderation and the other group elected to report only mediation (each group possibly reporting only the model they found to be statistically significant). In other words, each report might have concealed a failed replication of the other research group's finding. This is not to say that simultaneous moderation and mediation should always be tested in a three-variable system – specific suggestions for when this model is most sensible are given later in this article – but there may be many scenarios in which we fail to conduct or report critical tests and this, in turn, may have deleterious repercussions for theory and reproducibility. We are not advocating that researchers always “test everything,” but that when theory dictates this approach, researchers using the three-variable system are inherently encouraged to report everything, regardless of what ends up being significant. In the following sections, we review two vulnerability factor literatures in which moderated mediation is plausible, but – to the best of our knowledge – has yet to been tested within a single analytic model.
Parental externalizing psychopathology, harsh parenting, and offspring externalizing
Moderation and mediation in a three-variable system may be relevant to the intergenerational transmission of psychopathology. One example is the relationships among parents with externalizing histories, harsh or abusive parenting, and externalizing behavior of their offspring. Moderation and mediation in a three-variable system hypothesis for this scenario might be that the indirect effect of harsh parenting on the relationship between parental externalizing and child externalizing is larger for those with higher levels of harsh parenting, but that the indirect effect is small or nonsignificant when harsh parenting is not present. Parental history of externalizing behavior (Chassin, Rogosch, & Barrera, Reference Chassin, Rogosch and Barrera1991; Chronis et al., Reference Chronis, Lahey, Pelham, Kipp, Baumann and Lee2003; Hicks, Foster, Iacono, & McGue, Reference Hicks, Foster, Iacono and McGue2013; Hicks, Krueger, Iacono, McGue, & Patrick, Reference Hicks, Krueger, Iacono, McGue and Patrick2004) and harsh parenting (Deater-Deckard & Dodge, Reference Deater-Deckard and Dodge1997; Pinquart, Reference Pinquart2017) are both a robust predictor of externalizing problems in offspring.
Several studies have found evidence for moderation, reporting that harsher (or less warm) parenting moderates the effect of parent externalizing on offspring symptoms (Conners-Burrow et al., Reference Conners-Burrow, McKelvey, Pemberton, Lagory, Mesman and Whiteside-Mansell2013, Reference Conners-Burrow, McKelvey, Pemberton, Mesman, Holmes and Bradley2015; DeGarmo, Reference DeGarmo2010; Keller, Cummings, Davies, & Mitchell, Reference Keller, Cummings, Davies and Mitchell2008; although see Frick et al., Reference Frick, Lahey, Loeber, Stouthamer-Loeber, Christ and Hanson1992 for negative findings). At the same time, some others find evidence of mediation. For instance, parent–child conflict and harsh parenting mediates the association between parental externalizing/alcoholism and child externalizing behaviors (Bailey, Hill, Oesterle, & Hawkins, Reference Bailey, Hill, Oesterle and Hawkins2009; Loukas, Fitzgerald, Zucker, & Von Eye, Reference Loukas, Fitzgerald, Zucker and Von Eye2001). Thus, there is some indication that both moderation and mediation may be present in this three-variable system. In addition, while we focused specifically on parental and offspring externalizing, it is worth nothing that parenting practices may exhibit similar moderating and mediating functions in explaining the link between other types of parental psychopathology and child problems (Callender, Olson, Choe, & Sameroff, Reference Callender, Olson, Choe and Sameroff2012; Kane & Garber, Reference Kane and Garber2009). Studies in this area also suggest important developmental considerations with regard to harsh parenting, including the recency of exposure, the accumulation of exposure, and when exposures occur during the course of development (Dunn et al., Reference Dunn, Soare, Raffeld, Busso, Crawford, Davis and Susser2018). Whether future studies find support of moderation and mediation may differ depending on these issues.
Neuroticism, stressful life events, and depression
Neuroticism, stressful life events, and their relationship to depression is another example of moderation and mediation in a three-variable system. Neuroticism and stressful life events prospectively predict depression (Dougherty et al., Reference Dougherty, Bufferd, Carlson, Dyson, Olino, Durbin and Klein2011; Goldstein, Kotov, Perlman, Watson, & Klein, Reference Goldstein, Kotov, Perlman, Watson and Klein2018; Hakulinen et al., Reference Hakulinen, Elovainio, Pulkki-Råback, Virtanen, Kivimaki, Jokela and Jokela2015; Jeronimus, Kotov, Riese, & Ormel, Reference Jeronimus, Kotov, Riese and Ormel2016; Keller, Neale, & Kendler, Reference Keller, Neale and Kendler2007; Kendler, Karkowski, & Prescott, Reference Kendler, Karkowski and Prescott1998; Monroe, Rohde, Seeley, & Lewinsohn, Reference Monroe, Rohde, Seeley and Lewinsohn1999; Vrshek-Schallhorn et al., Reference Vrshek-Schallhorn, Stroud, Mineka, Hammen, Zinbarg, Wolitzky-Taylor and Craske2015). Some studies support the possibility of moderation between neuroticism and stressful life events (Kendler, Kuhn, & Prescott, Reference Kendler, Kuhn and Prescott2004; Van Os & Jones, Reference Van Os and Jones1999; Vinkers et al., Reference Vinkers, Joëls, Milaneschi, Kahn, Penninx and Boks2014); others, however, do not (Goldstein, Perlman, Eaton, Kotov, & Klein, Reference Goldstein, Perlman, Eaton, Kotov and Klein2019; Mineka et al., Reference Mineka, Williams, Wolitzky-Taylor, Vrshek-Schallhorn, Craske, Hammen and Zinbarg2020; Spinhoven et al., Reference Spinhoven, Roelofs, Hovens, Elzinga, van Oppen, Zitman and Penninx2011). There is also evidence that stressful life events may mediate the relationship between neuroticism and depression in youth (Goldstein et al., Reference Goldstein, Perlman, Eaton, Kotov and Klein2019; Kercher, Rapee, & Schniering, Reference Kercher, Rapee and Schniering2009; Wetter & Hankin, Reference Wetter and Hankin2009); this is often referred to as the stress generation model (Hammen, Reference Hammen1991). Considering both sets of findings suggests the possibility that moderation and mediation may exist within a three-variable system. However, examining stressful life events as both a moderator and mediator of the neuroticism–depression relationship has rarely been reported within a single sample, let alone in the same analytic model (Goldstein et al., Reference Goldstein, Perlman, Eaton, Kotov and Klein2019; Spinhoven et al., Reference Spinhoven, Roelofs, Hovens, Elzinga, van Oppen, Zitman and Penninx2011). It is also noteworthy that neuroticism and depression exhibit varying degrees of stability across development (Roberts & DelVecchio, Reference Roberts and DelVecchio2000; Salk, Hyde, & Abramson, Reference Salk, Hyde and Abramson2017) – mediation by stressful life events has only been found in young samples, not in older adults (Ormel, Oldehinkel, & Brilman, Reference Ormel, Oldehinkel and Brilman2001). As a result, the developmental period in question is likely to be an important consideration when evaluating moderation and mediation in this three-variable system.
Myths about Moderation and Mediation
Despite the examples provided above, researchers may be skeptical about whether a variable can simultaneously serve as a moderator and a mediator in a three-variable system for several reasons, and this hesitancy may partially explain why this model is so rarely tested. The first issue is that, in the three-variable system, it may be unclear how the moderator is related to the indirect effect. The second issue involves assumptions for when to consider a variable as a moderator or a mediator. The third is concerns about statistical power.
Regarding the first issue, it is useful to compare the three-variable system to what we more typically see in moderation and mediation with four variables, which is illustrated in Figure 3. In four-variable models, the roles of moderator and mediator are represented by different constructs and there are two variants of this model that are particularly relevant. In one version, the moderator can influence the magnitude of the indirect effect by its effect on the X to M path (path a); this is called first-stage moderated mediation. The second version is when the moderator influences the M to Y path (path b); this is called second-stage moderated mediation (Edwards & Lambert, Reference Edwards and Lambert2007). In these four-variable models, the way the moderator and mediator relate to X and Y is unambiguous. In the three-variable system, the moderating and mediating effects are less intuitive. Up until now, we have largely discussed the M variable as influencing the indirect effect in the three-variable system. However, in the case of the three-variable system, it may at first glance appear that the moderator does not influence the indirect effect so it cannot be a valid model. The M variable cannot be moderating the effect that X has on path a (first-stage moderated mediation) as the product of X and M predicts Y, not M. Moreover, we are accustomed to conceptual diagrams of the simple case of moderation, such as the upper panel of Figure 1b, which gives the impression that M is acting upon the X to Y path. Therefore, one might be tempted to conclude that the moderation effect in the three-variable system is unrelated to the indirect effect because M appears to be moderating the direct effect (X to Y path) in the model rather than the M to Y path.
In our view, this issue is related to an important misconception of moderating effects as “acting” on pathways. Methodologists have long noted that the only difference between a moderator and a “primary predictor” is conceptual and that, from a mathematical standpoint, the moderator and primary predictor are equivalent (e.g., Bauer & Curran, Reference Bauer and Curran2005; Finsaas & Goldstein, Reference Finsaas and Goldstein2021). Figure 1b shows how it may be arbitrary to label M or X as a moderator. The top panel of Figure 1b illustrates the theoretical model in which the moderator is depicted as acting on the X and Y paths; however, what is actually estimated in the statistical model appears in the lower panel, where it is clear that there is no mathematical difference between X and Z.Footnote 3 There is nothing novel about this statistical depiction of moderation – it is the very same diagram that is shown in the classic paper by Baron and Kenny (Reference Baron and Kenny1986) – but, perhaps because the conceptual model is what is often used to depict moderation, moderators are incorrectly believed to be acting on a pathway. This issue is compounded by how researchers typically present moderation analyses by including simple slope follow-up tests at specific values of the moderator. While there is nothing incorrect about these follow-up tests, they carry interpretive value and are often written in a way that reinforces the perception that there is something special about the moderating variable when, in reality, simple slopes can be calculated with either M or X as the moderator (Finsaas & Goldstein, Reference Finsaas and Goldstein2021).
This is relevant to the three-variable system because if the selection of a moderator is arbitrary from a mathematical standpoint, we can view the X variable as the moderator in just the same way as the M variable. The equations to estimate the model are identical using either conceptual framework. Instead of viewing the variable that is assessed in the intermediate period (labeled M in Figure 2) as the mediator and moderator (e.g., parenting practices, stressful life events), we can instead view M as only a mediator and the initial predictor, risk factor X (e.g., parental externalizing or neuroticism), as the moderator. Others writing about a three-variable system with moderation and mediation have referred to this model as an “independent variable as moderator” model or a model with an “Exposure × Mediator interaction” (independent variable and exposure meaning the initial vulnerability factor) (Hayes, Reference Hayes2015; Muthén, Muthen, & Asparouhov, Reference Muthén, Muthen and Asparouhov2016). If we consider X to be the moderator, the three-variable system more closely resembles a variant of second-stage moderated mediation (Edwards & Lambert, Reference Edwards and Lambert2007), since the moderator X influences the M to Y path (path b in Figure 2). Stated using the more traditional language of moderated mediation, in the case of the three-variable system, the indirect effect of X to Y via M will vary in magnitude as a function of X.
Instead, we argue that neither M nor X needs to hold the label of moderator, and that the term “moderated mediation” may confuse more than clarify in this instance. We prefer to view this model as a case in which moderation and mediation effects are both present, and the inclusion of both best explains the relationships among the three variables. Stated another way, by including mediation and moderation, we are able to provide the fullest account of the outcome variable Y (in statistical terms, by including moderation within a mediation model when both types of effects are present will lead to a greater reduction in the unexplained variance of Y). In addition, we can take an agnostic view in how to think of this model by describing it as a three-variable system with mediation and an interaction, as described similarly by Muthén et al. (Reference Muthén, Muthen and Asparouhov2016). This word choice highlights that, in moderation analyses, we need not be concerned with whether M influences the direct effect (size of X and Y) per se, but instead can think of an interaction as reflecting that two variables multiplicatively combine to predict an outcome. In other words, moderation analysis tests whether the effect of two variables on the outcome is greater than the sum of their independent contributions, which is consistent with the notion that allowing the interaction of X and M will reveal how their joint contribution on the indirect effect is not uniform at all levels of X or M.
A second source of skepticism about this model is related to confusion about the independence and timing criteria for when to assess moderators versus mediators. With regard to timing variables, methodologists widely agree that a mediator should occur after the initial vulnerability factor X, but before the outcome Y (Fairchild & MacKinnon, Reference Fairchild and MacKinnon2009; Kraemer et al., Reference Kraemer, Kiernan, Essex and Kupfer2008; MacKinnon, Lockwood, Hoffman, West, & Sheets, Reference MacKinnon, Lockwood, Hoffman, West and Sheets2002); however, the independence and timing of moderators have been debated. Baron and Kenny (Reference Baron and Kenny1986) did not provide requirements or address the timing of moderators, although they suggested that it might be desirable for a moderator to be independent of vulnerability factor X. In contrast, a second widely utilized framework – the MacArthur approach, developed by Kraemer et al. (Reference Kraemer, Wilson, Fairburn and Agras2002) – specifies that a moderator must be measured before the vulnerability factor and that the moderator should be independent of initial variable X (Kraemer et al., Reference Kraemer, Kiernan, Essex and Kupfer2008). The issue of independence matters because if X and M must be independent, then the a path in mediation would be zero. Because of the issue of variable timing and the requirement of independence between X and M in moderation, Kraemer et al. (Reference Kraemer, Wilson, Fairburn and Agras2002, Reference Kraemer, Kiernan, Essex and Kupfer2008) go so far as to state that a variable cannot be both a moderator and mediator at the same time. There are certainly some scenarios where temporal ordering and independence of the moderator could be appropriate and may even be advantageous. However, this is likely to be the case primarily in randomized clinical trials, which is the context in which Kraemer et al. (Reference Kraemer, Kiernan, Essex and Kupfer2008) developed their guidelines (Karazsia & Berlin, Reference Karazsia and Berlin2018). In clinical trials, temporal ordering of moderators is important if researchers want to examine whether symptom severity prior to treatment is a moderator of outcome. In addition, in treatment research, variables assessed after the first session(s) or doses could theoretically be influenced by the differential effectiveness of the treatment group (path a) and therefore might be more appropriate to be treated as a mediator rather than a moderator. Requiring that a moderator be strictly independent of the other predictor is also important in clinical trials as a way of ensuring proper random assignment (Kraemer et al., Reference Kraemer, Kiernan, Essex and Kupfer2008). For instance, in clinical trials, some of the moderators of interest may be time invariant (e.g., race), which would not logically be assumed to be mediators.Footnote 4
However, in observational research, vulnerability factors are almost never orthogonal (Walker, Downey, & Nightingale, Reference Walker, Downey and Nightingale1989); hence, this requirement is neither necessary nor feasible. As it is often impossible (and unethical) to randomly assign participants to exposure to vulnerability factors and because the temporal ordering of vulnerability factors is not always entirely clear, our position is that claiming the moderator must be first in time is not useful. We mention the issues of timing and independence of moderators in clinical research to highlight that there are some approaches that make stricter assumptions about moderators and it is possible that this might partially explain why researchers are hesitant to consider a variable as both a moderator and mediator; however, these assumptions are by no means mandatory requirements for all types of research or disciplines interested in vulnerability factor research.
However, if one wanted to conserve the rule that a moderator must temporally precede the independent variable, it is important to recall that assigning a variable to be the moderator can be somewhat arbitrary, as discussed above. By calling the initial vulnerability factor X the moderator in this model we are, in essence, preserving the temporal criteria of having a moderator be measured first, since X must come before M according to the stricter MacArthur rules of moderation. Similarly, we also described how the term “interaction” can be used to help highlight that the vulnerability factor and the third variable are actually on an even footing with one another, consistent with how this model has been referred to as a mediation model with an Exposure × Mediator interaction (Muthén et al., Reference Muthén, Muthen and Asparouhov2016; Valeri & VanderWeele, Reference Valeri and VanderWeele2013).
In summary, a researcher can consider M to function as both a mediator and a moderator (Figure 2). At the same time, a researcher could treat X as the moderator of the M to Y relationship and M as the mediator of X to Y. Whether M is conceptually viewed as both a moderator and mediator, or if X is viewed as the moderator instead of M, is a matter of theory and the researcher's specific question. Both are valid ways of considering this model and neither conceptualization would change the way the model is estimated.
A third concern about this model relates to power. Researchers may assume that because more paths are estimated within the model, the power to detect effects is lower. However, the power to detect significant relationships in the three-variable system is identical to the power to detect a significant interaction with correlated predictors. In moderation-only models, the correlation between X and M is factored into the model's power and impacts the variance explained in Y, even if that correlation is not reported (Asparouhov & Muthén, Reference Asparouhov and Muthén2020). In the three-variable system, the correlation of X and M is derived as a regression coefficient (to capture path a in the X to M relationship). The correlation is the same as the standardized regression coefficient, which is why the power for the three-variable system is the same as in a moderation-only model. As a result, the sample size requirements for moderation and mediation in a three-variable system are no different from those of a moderation-only model with correlated predictors (for details, see Asparouhov & Muthén, Reference Asparouhov and Muthén2020), though we acknowledge that moderation models in general require large samples, especially when the interaction effect is small.
Recommendations for Simultaneously Testing Moderation and Mediation in a Three-Variable System
What then are our recommendations regarding the special case of a three-variable system with moderation and mediation? Because selecting a de facto moderator is not always sensible in developmental psychopathology, we do not believe that the timing of the “moderator” is a critical determinant in applying this model. Even if one still held to the “moderator must be measured” first criterion, we would argue that because the primary predictor (e.g., parental externalizing, reward processing) and the moderating variable (e.g., parenting, stress) are treated the same mathematically, the vulnerability factor can instead be thought of as moderating the relationship between the mediating variable and the outcome in the three-variable system, as discussed by others (Hayes, Reference Hayes2015; Muthén et al., Reference Muthén, Muthen and Asparouhov2016; Valeri & VanderWeele, Reference Valeri and VanderWeele2013).
Like others, we agree that the mediator should occur temporally after the initial vulnerability factor and before the outcome because temporal ordering is critical for demonstrating mediation. If we want to claim that vulnerability factor X contributes to mediator M, then that vulnerability factor must temporally precede the mediator. However, there can be greater flexibility regarding the timing of the mediator relative to the outcome. While it is ideal for the mediator to be assessed before the outcome (to clearly establish that the mediator is a prospective predictor of the psychopathology outcome), there may be many cases where the mediator can be measured at the same time as the outcome. This would only be allowed if the temporal ordering of the mediator and outcome variables can be reliably determined despite assessing them contemporaneously. For instance, a well-validated stressful life event interview that assesses events occurring in the period between the vulnerability factor and outcome could be administered at the same assessment wave as the outcome variable under two conditions. First, the outcome measure (e.g., symptoms) must focus on a very brief window (e.g., the past 2 weeks), and the level of that outcome in the previous wave can be included as a covariate. Second, researchers must be able to clearly date the occurrence of the mediator (e.g., a stressful life event or life transition) or clearly describe the timescale properties of the mediator.
Another consideration for developmental psychopathologists is whether the a path should be significant. Strictly speaking, from a statistical perspective, this path does not need to be significant in order to infer mediation (e.g., Hayes & Rockwood, Reference Hayes and Rockwood2017; MacKinnon et al., Reference MacKinnon, Lockwood, Hoffman, West and Sheets2002; Rucker, Preacher, Tormala, & Petty, Reference Rucker, Preacher, Tormala and Petty2011). However, we believe that the a path should be significant if a theory makes explicit hypotheses about this path, and that this path is conceptually relevant to many theories in developmental psychopathology. Specifically, one of the appeals of this model is that the vulnerability factor X can be thought of as relevant to the development of M, which is supported when the a path is significant (as was the case in our examples described above; i.e., stress generation explicitly states that neuroticism should predict stress – an a path). Moreover, the conceptual roots of this model are based on G × E and rGE effects, with the latter relationship being defined by a significant a path. In extending this analytic model more broadly to developmental psychopathology, we think it is sensible for the a path to be significant in many cases, although it is not strictly required. Instead, we urge developmental psychopathology researchers to consider the theoretical implications of this model when the a path is or is not significant.
Lastly, we suggest and outline a specific set of steps for researchers to follow when deciding between whether their data are best represented by a regular regression model, a moderation-only or mediation-only model, or the three-variable system model. Researchers should fit the three-variable system model to the data and examine the significance of the X to M mediation path and the XM to Y interaction path (see section Guides for Statistical Implementation for details). If these paths are significant, there is a high likelihood that it is the correct model. If the product of these paths (ab 2) is significant then the three-variable system model is clearly superior to models that exclude these paths. However, if only the moderation effect is significant, but X to M is not (and ab 2 is not), researchers can then test a moderation-only model. Similarly, if the first leg of mediation is significant, but XM to Y is not, researchers could then test mediation alone. When neither of these paths are significant and the product ab 2 is not significant, it is likely that the best model would be a regression model with only direct, independent predictive effects (X to Y and M to Y).
Overall, our recommendations build upon existing guidance about moderation and mediation. For the three-variable system these recommendations can be summarized as follows.
(a) A moderation and mediation three-variable system should be measured with a minimum of three time points (although, for an exception see point (c)).
(b) The sequence of when to measure the components in the model is as follows: the primary vulnerability factor at Time 1, the third variable M at Time 2, and the outcome at Time 3.
(c) In some circumstances, two time points may be sufficient. In these cases, the primary vulnerability factor must be assessed at Time 1 and both the third variable M and the outcome are assessed at Time 2. However, this depends on whether it is possible to establish with confidence that the third variable temporally preceded the outcome variable (e.g., M occurred or measures a process that takes place during the interval between Time 1 and Time 2).
(d) When a given theory explicitly hypothesizes that vulnerability factor X may contribute to the development of vulnerability factor M, there should be a significant predictive association from vulnerability factor X to vulnerability factor M (a significant a path).
(e) When the comprehensive moderation and mediation in a three-variable system model is fit, the indirect effect (mediation), and interaction effect (moderation) are significantly different from zero it can generally be concluded that the moderation and mediation model is superior to either a regular regression, moderation-only, or mediation-only model. While there are several ways for reporting and evaluating this model, we encourage users to test the index of moderated mediation (Hayes, Reference Hayes2015), the product ab 2, which is a straightforward and simple metric (illustrated in greater detail in the next section).
Guides for Statistical Implementation
Our goal here was not to develop a new statistical model, but rather to call attention to the underappreciation and potential value of considering moderation and mediation in a three-variable system. This model has been mentioned for over a decade in widely cited papers (Kraemer et al., Reference Kraemer, Kiernan, Essex and Kupfer2008, Reference Kraemer, Wilson, Fairburn and Agras2002) and there are several papers discussing this model, with practical guides, by methodologists and statisticians (Hayes, Reference Hayes2015; Muthén et al., Reference Muthén, Muthen and Asparouhov2016; Valeri & VanderWeele, Reference Valeri and VanderWeele2013). However, it has received little attention in developmental psychopathology research.
When interpreting effects in a model that contains significant moderation and mediation examined simultaneously, they should be interpreted together rather than as distinct effects. Researchers can report findings following the approach of Hayes (Reference Hayes2015), referred to as the index of moderated mediation. Briefly, this index characterizes how the magnitude of the indirect effect changes depending on the value of a moderator. For the index of moderated mediation in the three-variable system, the initial vulnerability factor is treated as the moderator of the indirect effect of X to Y through M. In the case we have described, it is calculated as follows, based on the path coefficient labels in Figure 1, where X indicates the value of vulnerability factor X:
In this equation, the index of the moderated mediation effect is the coefficient ab 2 as it represents how much the indirect effect will change; details about the index of moderated mediation, statistical significance, and graphical depiction are provided by Hayes (Reference Hayes2015). There are several other ways of evaluating this model. One approach, which may be familiar to more researchers, is similar to the simple slopes approach, but in this case the “simple slopes” are conducted with respect to the indirect effect at small, medium, and large values of the moderator (Stride, Gardner, Catley, & Thomas, Reference Stride, Gardner, Catley and Thomas2015). This approach gives the same interpretative information as the index of moderated mediation and is not necessary to include if the index of moderated mediation is also reported.
Another approach for interpretation that is gaining traction as a possible best practice is the derivation of counterfactuals (Valeri & VanderWeele, Reference Valeri and VanderWeele2013; VanderWeele, Reference VanderWeele2014). Counterfactuals may not be as well known to developmental psychopathologists, but they have the advantage of isolating the components of the effect on the outcome variable so researchers can report the proportion of the effect that is attributable to the interaction/moderation effect alone, the mediation effect alone, and the combination of the moderation and mediation effect. A counterfactual relevant to the interaction plus indirect effect will be quite similar to the index of moderated mediation.
Fortunately, for researchers interested in applying this model, there are statistical tools for doing so. Moderation and mediation can be tested in three-variable systems using macros for SAS, SPSS (Valeri & VanderWeele, Reference Valeri and VanderWeele2013), and R packages lavaan (Rosseel, Reference Rosseel2012) or Mplus (Muthén & Muthén, Reference Muthén and Muthén2017). The SAS and SPSS macros have been developed for testing these models with a counterfactual approach by Valeri and VanderWeele (Reference Valeri and VanderWeele2013). Mplus syntax for this model has been posted online (Stride et al., Reference Stride, Gardner, Catley and Thomas2015) and is also described in detail by Muthén et al. (Reference Muthén, Muthen and Asparouhov2016). Mplus also includes features for implementing the counterfactual approach (Muthén & Asparouhov, Reference Muthén and Asparouhov2015). Given the existing guidance on how to evaluate three-variable system models statistically and the availability of tools to do so, there are few roadblocks for implementing this model. Calculating the index of moderated mediation is also relatively straightforward, and syntax has been developed for that approach as well (Hayes, Reference Hayes2015).
Important Considerations, Limitations, and Alternative Models
We have argued that, in many situations, researchers should consider testing moderation and mediation in a single analytic model. However, we should emphasize the caveat that this model should only be considered when there are reasonable theoretical reasons to do so. Furthermore, there are several issues that require additional discussion, including mediation analysis in nonexperimental contexts, the often low replicability of interactions, theoretical considerations of developmental processes, and advanced analytic models that may be more appropriate for a given research question.
It is important to acknowledge some challenges in mediation analyses in general as they are also inherently present in testing mediation and moderation in a three-variable system. First, studies of mediation should not be equated with causation or mechanism (Tryon, Reference Tryon2018). It may be tempting to claim that because X comes before and predicts M, it must be causing M, but this is an oversimplification. Instead, mediation should be thought of as a necessary, but not sufficient, statistical technique to demonstrate that something may be a mechanism. Causation is better supported by experimental study designs, and statistical tests cannot substitute for random assignment and manipulation of variables. However, making causal claims about mediators can be difficult even when experimental designs are used (Bullock, Green, & Ha, Reference Bullock, Green and Ha2010). For instance, if experimental manipulation of mediator M unintentionally impacts another mediator M 2 (even one that is not observed in the model), it is unclear whether the manipulated change in M or M 2 is contributing to a change in Y, and this unintentional manipulation could bias the indirect effect through M. Second, even if we can manipulate the mediator, it may not always be possible to manipulate all study participants in the same way (e.g., if the study involves a negative mood induction, some participants might not be impacted by the manipulation). As a result of not knowing which subjects are actually changing due to the manipulation, there is a good likelihood that the estimation of the indirect effect will be biased (e.g., those assigned to the manipulation who respond in an atypical fashion will alter estimation of the indirect effect). Regardless of whether the study is observational or experimental, failure to include additional mediator variables when they are present can lead to an overestimation of the indirect effect when testing a mediation model (Fritz, Kenny, & MacKinnon, Reference Fritz, Kenny and MacKinnon2016; Muthén et al., Reference Muthén, Muthen and Asparouhov2016). Third, measurement error can lead to biases in the indirect effect (Fritz et al., Reference Fritz, Kenny and MacKinnon2016).
Just like with mediation, many of the concerns with interactions are also present when examining moderation and mediation in a three-variable system. Importantly, interactions are difficult to replicate and replicate less often than main effects (Amir & Sharon, Reference Amir and Sharon1990; Open Science Collaboration, 2015). As noted previously, sample size consideration for moderation and mediation in a three-variable system is largely determined by the power to detect interactions. Estimation of the power to detect an interaction is more complex than the power for main effects in traditional regression analyses and these complexities, such as the correlation between predictors, may not always be properly considered when planning a study, leading to inadequate sample size (Aguinis & Stone-Romero, Reference Aguinis and Stone-Romero1997; McClelland & Judd, Reference McClelland and Judd1993). At the same time, effect sizes for interactions may be smaller than anticipated, which could lead researchers to select samples that are underpowered, leading to spurious type II errors (Aguinis, Beaty, Boik, & Pierce, Reference Aguinis, Beaty, Boik and Pierce2005). Similar to the issues discussed with mediation, measurement error is also an issue for interactions (Altmejd et al., Reference Altmejd, Dreber, Forsell, Huber, Imai, Johannesson and Camerer2019; McClelland & Judd, Reference McClelland and Judd1993). In addition, lower variance of X and M (or restrictions of their variance) can substantially lower power. Lastly, and in part as a result of the issues just mentioned, interactions that are tested in observational studies are less likely to be detected than in experimental studies (McClelland & Judd, Reference McClelland and Judd1993). Given that it is often impossible to use experimental designs in developmental psychopathology, researchers should pay even greater attention to sample size, select reliable measures, and – when possible – sample from the population in a way that increases the variances of X and Z.
Perhaps most importantly, developmental theory must be carefully considered when applying the three-variable system or any statistical model (Selig & Preacher, Reference Selig and Preacher2009). The three-variable system that we have described is not inherently a developmental model. What makes a model developmental depends on the nature of the question and the constructs investigated, how they are measured, and when they are measured. Selig and Preacher (Reference Selig and Preacher2009) describe several issues and provide guidance when testing mediation in longitudinal models. For instance, researchers must consider the lag between time points so as not to miss a true effect. If the lag is too short, a process may not have had sufficient time to unfold. If the lag is too long, the effects may have diminished. In addition, researchers need to consider whether or how their constructs change. The simple linear models that we have used as examples throughout this paper are not designed to capture change. While moderation and mediation in a three-variable system can be tested with as few as two time points, the use of three or more time points allows for testing the possibility that moderation and mediation influence change over time or are specific to one or multiple periods of time, which may be informative. Another consideration is the direction of effects. Each of the examples and figures presented here to illustrate moderation and mediation in a three-variable system have been unidirectional (vulnerability factor, third variable, and outcome), but it is possible that bidirectional effects may be present. Furthermore, when researchers already have good evidence to suggest that additional variables may play a moderating or mediating role they should also be included in the model (e.g., a hybrid of Figures 2 and 3; for other examples see Hayes, Reference Hayes2015, Reference Hayes2018).
Finally, simultaneous testing of moderation and mediation may be incorporated into more complex developmental models. While we have focused on testing the three-variable system in Figure 2, we also encourage researchers to test moderation and mediation simultaneously in a family of developmental models that are receiving increased attention for their ability to examine two or more co-developmental processes. Some of these models are well known, such as the parallel growth curve model (Cheong, MacKinnon, & Khoo, Reference Cheong, MacKinnon and Khoo2003; Muthén & Curran, Reference Muthén and Curran1997) and the trait–state–occasion model (Cole, Martin, & Steiger, Reference Cole, Martin and Steiger2005). Others are perhaps less widely used, such as the random-intercept cross-lagged panel design (the successor to traditional cross-lagged models), which may have a wide range of applications for researchers interested in teasing apart between-person and longitudinal within-person change and would fit well as an extension of the path model shown in Figure 2 (Berry & Willoughby, Reference Berry and Willoughby2017; Hamaker, Kuiper, & Grasman, Reference Hamaker, Kuiper and Grasman2015). Another hybrid model is the autoregressive latent trajectory model with structured residuals, which includes elements of linear growth and cross-lagged models (Grimm, Ram, & Estabrook, Reference Grimm, Ram and Estabrook2016; Mund & Nestler, Reference Mund and Nestler2019). Still others, such as the dual change score model, can be particularly flexible and capable of testing a variety of development relationships (Grimm et al., Reference Grimm, Ram and Estabrook2016; Mund & Nestler, Reference Mund and Nestler2019). While it is well beyond the scope of a single article to describe the full range of models that apply to developmental research, our point is that testing moderation and mediation simultaneously can be incorporated into many developmental analyses and researchers should not feel constrained to testing only the simple version that we used for illustration.
Conclusions
We have made the case for testing an integrated moderation–mediation model when working with three-variable systems in developmental psychopathology. This model is intuitive and can be applied to a variety of vulnerability factor–psychopathology relationships. This model is not a new concept, but is underappreciated and underutilized. We believe that this model holds promise for advancing developmental psychopathology by providing a framework that more comprehensively tests the three-variable vulnerability factor and outcome scenarios. As we have described, the three-variable system avoids a piecemeal approach to theory building about vulnerability factor–disorder relationships by testing the three common types of relationships simultaneously. In addition, this approach can speed up the process by which we are able to determine the replicability of moderation and mediation effects in vulnerability factor research.
We have illustrated the conceptual value of this model by giving a hypothetical example with genetics as well as describing several instances from diverse corners of the research literature where this model might help clarify discrepancies and improve theory. This integrative framework is of value for furthering progress in research domains that are well established and accelerating knowledge in emerging areas. We have also provided suggestions for testing this model using longitudinal designs. Ideally, there should be at least three waves of assessments, but in some situations this model can be applied with only two time points. However, we caution that this model should not be applied cavalierly, but that researchers should consider the nature and timing of the data they include in these models as well as the use of more advanced models that can accommodate testing moderation and mediation in a single analysis.
Overall, we believe that on conceptual and statistical grounds there is good reason to apply moderation and mediation simultaneously using a three-variable system when conducting developmental psychopathology research. Choosing to test only moderation or only mediation, in the absence of a compelling rationale, may limit theory development and slow research progress. Instead, when rather simple assessment timing criteria are met, simultaneously testing moderation and mediation in three-variable risk (or resilience) factor–outcomes systems should be considered as a first-line research strategy rather than an afterthought.
Funding Statement
This work was supported by National Institute of Mental Health training grant T32-MH-013043 (MCF) and National Institute of Mental Health grants R01-MH-107495 (TMO), R01-MH-093479 (RK), and U01-MH-113390 (DJG).
Conflicts of Interest
None.