Predicting relapse or recurrence of depression: systematic review of prognostic models

Andrew S. Moriarty*: Affiliation:
Mental Health and Addiction Research Group, Department of Health Sciences, University of York, UK and Hull York Medical School, University of York, UK
Nicholas Meader: Affiliation:
Centre for Reviews and Dissemination, University of York, UK and Cochrane Common Mental Disorders, University of York, UK
Kym I. E. Snell: Affiliation:
Centre for Prognosis Research, School of Medicine, Keele University, UK
Richard D. Riley: Affiliation:
Centre for Prognosis Research, School of Medicine, Keele University, UK
Lewis W. Paton: Affiliation:
Mental Health and Addiction Research Group, Department of Health Sciences, University of York, UK
Sarah Dawson: Affiliation:
Cochrane Common Mental Disorders, University of York, UK and Bristol Medical School, University of Bristol, UK
Jessica Hendon: Affiliation:
Centre for Reviews and Dissemination, University of York, UK and Cochrane Common Mental Disorders, University of York, UK
Carolyn A. Chew-Graham: Affiliation:
School of Medicine, Keele University, UK
Simon Gilbody: Affiliation:
Mental Health and Addiction Research Group, Department of Health Sciences, University of York, UK and Hull York Medical School, University of York, UK
Rachel Churchill: Affiliation:
Centre for Reviews and Dissemination, University of York, UK and Cochrane Common Mental Disorders, University of York, UK
Robert S. Phillips: Affiliation:
Centre for Reviews and Dissemination, University of York, UK
Shehzad Ali: Affiliation:
Mental Health and Addiction Research Group, Department of Health Sciences, University of York, UK and Department of Epidemiology and Biostatistics, Schulich School of Medicine & Dentistry, Western University, Canada
Dean McMillan: Affiliation:
Mental Health and Addiction Research Group, Department of Health Sciences, University of York, UK and Hull York Medical School, University of York, UK
*: Correspondence: Andrew S. Moriarty. Email: [email protected]

Article contents

Abstract
Background
Aims
Method
Results
Conclusions
Method
Results
Discussion
References

Rights & Permissions

Abstract

Background

Relapse and recurrence of depression are common, contributing to the overall burden of depression globally. Accurate prediction of relapse or recurrence while patients are well would allow the identification of high-risk individuals and may effectively guide the allocation of interventions to prevent relapse and recurrence.

Aims

To review prognostic models developed to predict the risk of relapse, recurrence, sustained remission, or recovery in adults with remitted major depressive disorder.

Method

We searched the Cochrane Library (current issue); Ovid MEDLINE (1946 onwards); Ovid Embase (1980 onwards); Ovid PsycINFO (1806 onwards); and Web of Science (1900 onwards) up to May 2021. We included development and external validation studies of multivariable prognostic models. We assessed risk of bias of included studies using the Prediction model risk of bias assessment tool (PROBAST).

Results

We identified 12 eligible prognostic model studies (11 unique prognostic models): 8 model development-only studies, 3 model development and external validation studies and 1 external validation-only study. Multiple estimates of performance measures were not available and meta-analysis was therefore not necessary. Eleven out of the 12 included studies were assessed as being at high overall risk of bias and none examined clinical utility.

Conclusions

Due to high risk of bias of the included studies, poor predictive performance and limited external validation of the models identified, presently available clinical prediction models for relapse and recurrence of depression are not yet sufficiently developed for deploying in clinical settings. There is a need for improved prognosis research in this clinical area and future studies should conform to best practice methodological and reporting guidelines.

Keywords

Depressive disorders epidemiology statistical methodology risk assessment primary care

Type: Review
Information: The British Journal of Psychiatry , Volume 221 , Issue 2 , August 2022 , pp. 448 - 458

DOI: https://doi.org/10.1192/bjp.2021.218 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: Copyright © The Author(s), 2022. Published by Cambridge University Press on behalf of the Royal College of Psychiatrists

Background

Depression is the leading cause of disability worldwide.¹ After a first episode of depression, approximately half of patients will experience a relapse or recurrence (re-emergence of depressive symptoms after an initial improvement),^{Reference Beshai, Dobson, Bockting and Quigley2} and most do so within the first 6 months.^{Reference Ali, Rhodes, Moreea, McMillan, Gilbody and Leach3} Those who experience a relapse or recurrence are more likely to relapse again in the future compared with those who do not.^{Reference Burcusa and Iacono4} There is evidence to suggest that relapse or recurrence of depression result in an increased risk of subsequent relapse^{Reference Burcusa and Iacono4} and, possibly, increased treatment resistance.^{Reference Post5} Reliable prediction of individuals’ risk of relapse and recurrence might enable a precision medicine approach to relapse prevention, personalising the allocation and potentially type of relapse prevention interventions offered to ensure maximum benefit. Prognostic factors are variables that are associated with an outcome of interest, although are not necessarily causal, and overall prognosis can be estimated within groups defined by the values of a prognostic factor. These are differentiated from prescriptive factors, which are associated with outcomes and also moderate treatment effects. Prognostic factors associated with relapse and recurrence include childhood maltreatment, history of recurrent depression and presence of residual depressive symptoms, among others, whereas evidence for prescriptive factors remains limited.^{Reference Buckman, Underwood, Clarke, Saunders, Hollon and Fearon6} Multivariable prognostic models combine information about multiple prognostic factors for a particular person to provide individualised risk predictions.^{Reference Riley, van der Windt, Croft and Moons7} There have been an increasing number of attempts to derive and validate prognostic models to predict depression-related outcomes.^{Reference Bone, Simmonds-Buckley, Thwaites, Sandford, Merzhvynska and Rubel8–11} There has been no previous systematic review to identify all prognostic models designed to predict relapse or recurrence of depression.

Objectives

To identify and critically appraise prognostic model development and validation studies aimed at predicting relapse, recurrence, sustained remission or recovery in adults with major depressive disorder who meet the criteria for remission or recovery. In addition, we planned to summarise and meta-analyse their predictive performance, to describe the characteristics of the models identified, and to review the clinical utility (net benefit) of the identified models, where possible.

Method

The protocol was preregistered in the Cochrane Database of Systematic Reviews (CD013491)^{Reference Moriarty, Meader, Gilbody, Chew-Graham, Churchill and Ali12,Reference Moriarty, Meader, Snell, Riley, Paton and Chew-Graham13} and is reported in line with the Preferred Reported Items for Systematic Reviews and Meta-Analyses (PRISMA) guideline.^{Reference Page, McKenzie, Bossuyt, Boutron, Hoffmann and Mulrow14}

Eligibility criteria

We specified the following inclusion criteria (see the Appendix for PICOTS criteria):^{Reference Debray, Damen, Snell, Ensor, Hooft and Reitsma15}

(a) adult population (18 years and over) with major depressive disorder (defined using validated diagnostic criteria) who met criteria for remission or recovery (i.e. no longer meeting diagnostic criteria for major depressive episode) at the point of prediction;
(b) any setting (primary, secondary, or community care);
(c) all multivariable prognostic models developed to predict individual risk of relapse, recurrence, sustained remission, or recovery of depression over any time period.

Remission and recovery are terms used to describe an improvement in depressive symptoms; remission meaning improved but still ‘in episode’ and recovery being the resolution of the underlying episode (usually after 6 to 12 months of remission).^{Reference Bockting, Hollon, Jarrett, Kuyken and Dobson16} Relapse occurs following some level of remission but precedes recovery, whereas recurrence is the onset of a new episode of depression following recovery.^{Reference Frank, Prien, Jarrett, Keller, Kupfer and Lavori17,Reference Rush, Kraemer, Sackeim, Fava, Trivedi and Frank18} Sustained remission can be thought of as the inverse, or opposite of relapse; and recovery as the inverse of recurrence. Both of these hold potentially valuable prognostic information pertinent to relapse risk prediction models in depression, and are therefore included as outcomes in this review. The precise temporal cut-offs of these terms have not been robustly validated empirically and are inconsistently operationalised in the literature.^{Reference Buckman, Underwood, Clarke, Saunders, Hollon and Fearon6} For this reason, we accepted all definitions of these terms, as operationalised by the authors of the primary studies.

We included all three types of prognostic model study:

(a) development studies with internal validation (which derive a model for individualised prediction and quantify predictive performance in the development data-set);
(b) development with external validation (which develop a model and then quantify the performance in data external to the development set); and
(c) external validation only (attempt to externally validate an existing model).^{Reference Wolff, Moons, Riley, Whiting, Westwood and Collins19}

External validation did not include randomly splitting the development data-set to produce two separate data-sets (an approach more appropriately considered an inefficient form of internal validation),^{Reference Riley, van der Windt, Croft and Moons7} but did include studies where a validation data-set was produced by a non-random split, for example, participants from the same institution but at different time points (temporal validation) or by location (geographical validation).^{Reference Collins, De Groot, Dutton, Omar, Shanyinde and Tajar20}

We excluded models developed in populations with comorbid severe mental illness (for example, schizophrenia and bipolar affective disorder), as these patients typically receive more intensive psychiatric input and the results would be less generalisable. We excluded studies where the intention was not to provide individualised risk predictions (for example those aimed at quantifying the adjusted effects of prognostic factors).

Information sources and search strategy

We searched the Cochrane Library (current issue); Ovid MEDLINE (1946 onwards); Ovid Embase (1980 onwards); and Ovid PsycINFO (1806 onwards) up to May 2021, using relevant subject headings (controlled vocabularies) and search syntax, appropriate to each resource. We also searched several grey literature resources primarily for dissertations and theses (Open Grey (www.opengrey.eu); ProQuest Dissertations & Theses Global (www.proquest.com/products-services/pqdtglobal.html); DART-Europe E-theses Portal (www.dart-europe.eu); EThOS - the British Libraries e-theses online service (ethos.bl.uk); Open Access Theses and Dissertations (oatd.org)), also up to May 2021. We applied no restrictions by date, language or publication status. We checked the reference lists of all included articles and conducted forward citation searches on the Web of Science (12 March 2021 and 19 May 2021), to identify additional studies missed from the original electronic searches (for example unpublished or in-press citations). We contacted corresponding authors for information on unpublished or ongoing studies.

Selection of studies

Two review authors (A.S.M. and N.M.) independently reviewed the titles and abstracts of studies identified by the search strategy. We excluded prognostic model studies that clearly did not meet our inclusion criteria at the title and abstract screening stage. For any studies where there was uncertainty, we undertook a full-text review. We resolved disagreement in judgements through discussion or, if necessary, by referral to a third review author (K.I.E.S. or D.M.).

Data collection

Two independent review authors (A.S.M. and N.M.) conducted the data extraction, commencing 1 September 2020. The Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS), which has been specifically designed for systematic reviews of prognostic models, was used to guide data extraction.^{Reference Debray, Damen, Snell, Ensor, Hooft and Reitsma15} This included the following measures of predictive performance, where available:

calibration, which measures the extent to which risk predictions and observed outcomes are in agreement (measures extracted included calibration slope, ratio of observed (O) to expected (E) events (O:E ratio), calibration plots); and
discrimination, the model's ability to separate patients who develop the outcome of interest and those who do not (usually measured using the Concordance (C)-statistic or area under the receiver operator curve (AUC)).

Where these measures were not available directly, we planned to calculate them from other information available with reference to recent guidance.^{Reference Debray, Damen, Riley, Snell, Reitsma and Hooft21} We also planned to extract information on clinical utility, where available. Clinical utility is important to consider when a model's predicted risks are to be used to inform decision-making. It can be measured by the net benefit at a particular risk threshold, and by plotting decision curves of the net benefit across a range of relevant thresholds.^{Reference Vickers, Van Calster and Steyerberg22}

Data synthesis and meta-analysis approaches

If a sufficient number of external validation studies were identified for a particular model, we planned to conduct random-effects meta-analyses to summarise the performance of prognostic models, as the data were likely to be highly heterogeneous. In the absence of sufficient data for a meta-analysis, we have used a narrative synthesis instead.

Risk of bias assessment in included studies

Two independent review authors (A.S.M. and N.M.) assessed risk of bias (ROB) using the Prediction model risk of bias assessment tool (PROBAST), which assesses ROB (low, high or unclear) over four domains (participants, predictors, outcomes and analysis) and applicability (concerns about applicability; also low, high, or unclear) in the first three of the domains.^{Reference Riley, van der Windt, Croft and Moons7,Reference Wolff, Moons, Riley, Whiting, Westwood and Collins19,Reference Moons, Wolff, Riley, Whiting, Westwood and Collins23}

For the ‘Analysis’ domain, when determining whether an appropriate sample size was used, we adhered to PROBAST recommendations, which use the rule of thumb using events per candidate predictor parameter (EPP). The PROBAST guidance suggests an EPP of 20 and over for development studies (although those between 10 and 20 EPP can be rated ‘probably yes’ or ‘probably no’, depending on outcome frequency, overall model performance and distribution of predictors in the model) and 100 participants with the outcome and 100 without the outcome for external validation studies. For handling of missing data, multiple imputation is considered the most appropriate method when data are missing at random^{Reference Riley, van der Windt, Croft and Moons7} and is recommended by PROBAST.^{Reference Moons, Wolff, Riley, Whiting, Westwood and Collins23} The PROBAST tool has been developed primarily for studies that used a more traditional regression method and guidance on best practice for machine learning models is less widely available. In the case of any machine learning models identified, we applied the PROBAST guidance as described for traditional regression techniques, but judgements should be interpreted with these limitations in mind.

Results

Results of the search

We identified a total of 8694 studies initially, with one study located through a forward citation search performed on 12 March 2021.^{Reference van Loo, Bigdeli, Milaneschi, Aggen and Kendler24} Deduplicated records (n = 5777) records underwent title and abstract screening by two independent review authors (A.S.M. and N.M.), 51 underwent full-text screening and 12 studies were included in the final review (2 full-text articles required referral to K.I.E.S. and were excluded following this referral). These included 11 unique prognostic models; 1 of the studies^{Reference van Loo, Bigdeli, Milaneschi, Aggen and Kendler24} externally validated a model developed elsewhere.^{Reference Van Loo, Aggen, Gardner and Kendler25} Studies excluded after full-text screening (n = 37) fell into two categories: not meeting study design criteria (i.e. model not intended for prediction) or not meeting participant population criteria. Two studies (awaiting further information) were conference proceedings; we were unable to obtain further information on these studies and so did not include them in the review^{Reference Trivedi, Morrison, Daly, Singh, Fedgchin and Jamieson26,Reference Cohen, DeRubeis, Hayes, Watkins, Lewis and Byng27} (Fig. 1).

Fig. 1 PRISMA Flow Diagram.

Description of studies

Of the included studies (Table 1), three were development and external validation studies,^{Reference Klein, Holtman, Bockting, Heymans and Burger28–Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30} eight were development-only studies^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference Backs-Dermott, Dobson and Jones31–Reference Pintor, Torres, Navarro, Martinez de Osaba, Matrai and Gastó37} and one^{Reference van Loo, Bigdeli, Milaneschi, Aggen and Kendler24} was an external validation study. Three^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference Mocking, Naviaux, Li, Wang, Monk and Bright35,Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} of the development-only studies reported internal validation. No prognostic model was externally validated in more than one included study and, therefore, a meta-analysis was not necessary. All included studies used prospectively gathered data for developing the prognostic models. Four of the models were developed in secondary care,^{Reference Berlanga, Heinze, Torres, Apiquián and Caballero32–Reference Judd, Schettler and Rush34,Reference Pintor, Torres, Navarro, Martinez de Osaba, Matrai and Gastó37} whereas the other seven were developed in primary care^{Reference Klein, Holtman, Bockting, Heymans and Burger28,Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} or community settings.^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference van Loo, Aggen, Gardner and Kendler29–Reference Backs-Dermott, Dobson and Jones31,Reference Mocking, Naviaux, Li, Wang, Monk and Bright35} Van Loo et al (2020) used a data-set drawn from primary care, secondary care and community settings (the Netherlands Study of Depression and Anxiety (NESDA)) for external validation.^{Reference van Loo, Bigdeli, Milaneschi, Aggen and Kendler24} Further details of the studies can be found in Supplementary Table S1 (available at https://doi.org/10.1192/bjp.2021.218).

Table 1 Characteristics of included studies

MDE, major depressive episode; NA, not applicable; SCID, Structured Clinical Interview for DSM-IV; RCT, randomised controlled trial; MDD, major depressive disorders; SCL-90, Symptom Checklist 90; SEM, standard error of the mean IQR, interquartile range.

The Appendix summarises the specific outcome definitions used. The included studies covered a wide range of predictors (Table S2 outlines the different predictors included in the final models and how they were measured for the individual studies). Most commonly, these were disease-related characteristics and demographic factors. Some studies explored some less common predictors such as: neuropsychological predictors (emotional categorisation, emotional memory, and facial expression recognition);^{Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} personality characteristics such as neuroticism;^{Reference Berlanga, Heinze, Torres, Apiquián and Caballero32} psychosocial predictors such as life stress and interpersonal difficulties;^{Reference Backs-Dermott, Dobson and Jones31} biochemical predictors such as results from the corticotrophin-releasing factor test;^{Reference Pintor, Torres, Navarro, Martinez de Osaba, Matrai and Gastó37} peripheral blood metabolomic markers;^{Reference Mocking, Naviaux, Li, Wang, Monk and Bright35} and combinations of items from the Symptom Checklist (SCL-90).^{Reference Judd, Schettler and Rush34}

Of the 11 development studies, nine used regression analysis (five used logistic regression^{Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30,Reference Berlanga, Heinze, Torres, Apiquián and Caballero32–Reference Judd, Schettler and Rush34,Reference Pintor, Torres, Navarro, Martinez de Osaba, Matrai and Gastó37} and four used Cox proportional hazards regression to study time to recurrence.^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference Klein, Holtman, Bockting, Heymans and Burger28,Reference van Loo, Aggen, Gardner and Kendler29,Reference Mocking, Naviaux, Li, Wang, Monk and Bright35} Of the remaining two included development studies, one used a machine learning support vector machine model to predict recurrence over a median period of 233 days^{Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} and the other used discriminant function analysis (DFA), a statistical method to identify which continuous variables (predictors) best discriminate between two or more groups (in this case, relapse or stable remission).^{Reference Backs-Dermott, Dobson and Jones31}

Predictive performance of prognostic models

The predictive performance of all included models is summarised in Table S2. Six of the model development studies identified^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference Klein, Holtman, Bockting, Heymans and Burger28–Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30,Reference Mocking, Naviaux, Li, Wang, Monk and Bright35,Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} reported internal validation to account for overfitting and optimism within the developed model. Three also reported external validation, using a data-set separate from the training data-set to give a truer reflection of model performance and generalisability.^{Reference Klein, Holtman, Bockting, Heymans and Burger28–Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30} Van Loo (2020)^{Reference van Loo, Bigdeli, Milaneschi, Aggen and Kendler24} presented the external validation of the model developed in Van Loo (2018).^{Reference Van Loo, Aggen, Gardner and Kendler25}

Klein (2018)^{Reference Klein, Holtman, Bockting, Heymans and Burger28} used a randomized controlled trial data-set separate from that used for development for external validation and presented a calibration slope of 0.56 (0.81 on internal validation) and a Harrell's C-statistic of 0.59 (0.56 on internal validation). Van Loo (2015)^{Reference van Loo, Aggen, Gardner and Kendler29} used a temporal cut-off to define their development and validation samples (temporal validation). They presented ‘comparable’ Kaplan–Meier curves as evidence that their prognostic model was well calibrated for people at lower risk of relapse but less so for higher-risk participants, and an AUC of 0.61 on external validation (0.79 on internal validation). Wang et al (2014)³¹ used data from the same source but from a different geographical region (geographical validation) to define development and external validation data-sets. The authors presented a C-statistic of 0.72, indicating good discrimination, and presented the result of the Hosmer–Lemeshow goodness-of-fit test (3.51, P = 0.9) as evidence of ‘excellent calibration’.

Van Loo et al (2020)^{Reference van Loo, Bigdeli, Milaneschi, Aggen and Kendler24} presented the results of the developed model in two ‘test’ sets. One of these, the Virginia Adult Twin Study of Psychiatric and Substance Use Disorder (VATSPSUD), was data from the same sample used in Van Loo et al (2018)²⁶ for model development and we have therefore classified this as an internal validation. The second test sample (NESDA) is separate from the development data-set and we have focused on this as the external validation. Discrimination was reported as good (AUC = 0.68 (95% CI 0.66–0.71) predicting recurrence over 0 to 2 years; AUC = 0.72 (95% CI 0.69–0.75) predicting recurrence over 0 to 9 years); calibration was not reported. Of the external validations included in this review, only Van Loo et al (2020)²⁵ included 95% CI for measures of predictive performance.

Klein et al (2018)²⁹ was the only included study to present all of the regression coefficients for the predictors included in the final model as well as the intercept and associated 95% CI. This model could therefore be used based on the information provided in the primary source. None of the included studies explored net benefit analysis (clinical utility) with respect to the developed models.

ROB and applicability assessment of included studies

We rated 11 of the 12 included studies as being at high overall ROB (see Fig. 2(a) and Supplementary Table 3). Only one study, Klein et al (2018),²⁹ was assessed to be at low ROB in all four domains. ROB was generally assessed as being low for most studies in the domains of participants and predictors. ROB was unclear for 8 out of 12 of the studies in the domain of outcomes, because the studies did not state that outcomes were determined masked to the predictor information. For the fourth domain (analysis), there was variable quality for the reported methods and some weaknesses and potential sources of bias were identified in this domain for 11 of the 12 included studies.

Fig. 2 (a): Risk of bias assessment (Prediction model risk of bias assessment tool (PROBAST)); (b): applicability assessment (PROBAST).

The most common weakness related to sample size or number of events, or both, a lack of which seriously and adversely impairs the ability of a statistical model in the real world because of a significant risk of overfitting.^{Reference Riley, Ensor, Snell, Harrell, Martin and Reitsma38} Most studies did not describe how the sample size was determined. Only one study^{Reference Klein, Holtman, Bockting, Heymans and Burger28} reported sufficient EPP for model development (104 recurrences for eight candidate predictor parameters). All other regression models^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference van Loo, Aggen, Gardner and Kendler29,Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30,Reference Berlanga, Heinze, Torres, Apiquián and Caballero32–Reference Mocking, Naviaux, Li, Wang, Monk and Bright35,Reference Pintor, Torres, Navarro, Martinez de Osaba, Matrai and Gastó37} had inadequate sample size, according to PROBAST (see Method). The sample size determination used by Backs-Dermott et al (2010),^{Reference Backs-Dermott, Dobson and Jones31} which used DFA, appeared to be appropriate according to their reported methods.

Ruhe et al (2019)³⁷ used a machine learning approach for model development.^{Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} Formal guidance is lacking to aid sample size determinations for prognostic model studies using machine learning techniques. The guidance and literature that does exist suggests that we should demand, if anything, significantly larger sample sizes when using a machine learning approach to prognostic model development, with one paper estimating that one would need more than ten times the EPP required for regression models to achieve a stable AUC and small optimism.^{Reference Van Der Ploeg, Austin and Steyerberg39} This study did not have an adequate sample size according to any of the existing guidance and recommendations. For Van Loo et al (2020),^{Reference van Loo, Bigdeli, Milaneschi, Aggen and Kendler24} although it was not explicitly stated, we made the assessment that the sample size probably met PROBAST requirements for external validation (at least 100 events).

Another limitation of the majority of the included studies (n = 8) was their handling of missing data. Multiple imputation was used to handle missing data in only four of the identified studies.^{Reference van Loo, Bigdeli, Milaneschi, Aggen and Kendler24,Reference Van Loo, Aggen, Gardner and Kendler25,Reference Klein, Holtman, Bockting, Heymans and Burger28,Reference Judd, Schettler and Rush34} The remaining studies either did not report their approach^{Reference Backs-Dermott, Dobson and Jones31–Reference Johansson, Lundh and Bjärehed33,Reference Pintor, Torres, Navarro, Martinez de Osaba, Matrai and Gastó37} or used non-PROBAST recommended approaches for handling missing data, such as imputing the mean^{Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} or single imputation.^{Reference van Loo, Aggen, Gardner and Kendler29,Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30} Finally, most studies (n = 11) did not present appropriate performance statistics. The PROBAST guidance recommends that, as a minimum, a calibration plot and discrimination statistics (C-statistic for binary and time-to-event outcome models) are presented as relevant performance measures for a prognostic model study.^{Reference Wolff, Moons, Riley, Whiting, Westwood and Collins19} Classification measures, such as sensitivity and specificity, can be presented in addition to calibration and discrimination statistics, but they have the drawback of loss of information and of requiring risk thresholds to be specified, often based on the data rather than on meaningful, clinical grounds. One study^{Reference Klein, Holtman, Bockting, Heymans and Burger28} presented both a calibration plot and C-statistic in line with minimum best practice.

We had low concern about applicability for all included studies except for one,^{Reference Berlanga, Heinze, Torres, Apiquián and Caballero32} which was rated at an unclear level of concern (Fig. 2(b)). It was unclear whether all participants had reached remission and it appears that a proportion of participants would have met the criteria for depression according to the Hamilton Rating Scale for Depression.

Discussion

This is the first systematic review looking at prognostic models predicting relapse and recurrence of depression. We have identified 11 unique models, across 12 included studies. None of the models underwent independent external validation (i.e. by researchers not involved in the original model development) or net benefit analysis to assess clinical utility. Only one of the included models was found to be at overall low ROB^{Reference Klein, Holtman, Bockting, Heymans and Burger28} and the discrimination and calibration of this model were poor on external validation. We were guided by the recent prognosis literature and guidance in developing our review methods, searches and in critically appraising the included studies. Our planned meta-analysis was not necessary because of an insufficient number of studies reporting performance statistics for the same model.

Comparison with the previous literature

The findings from this review align with previous prognosis research in this area, the majority of which has focused on prognostic factors. In contrast to prognostic models, which provide individualised risk prediction of particular outcomes conditional on multiple factors, prognostic factor studies focus on the factors themselves and whether they add (causal or prognostic) value over existing factors. Two recent systematic reviews and meta-analyses have explored prognostic factors associated with relapse and recurrence of depression.^{Reference Buckman, Underwood, Clarke, Saunders, Hollon and Fearon6,Reference Wojnarowski, Firth, Finegan and Delgadillo40} There is ‘strong evidence’ that residual depressive symptoms are prognostic for relapse and recurrence, and ‘good’ evidence that the number of previous episodes are associated with increased risk of relapse and recurrence.^{Reference Buckman, Underwood, Clarke, Saunders, Hollon and Fearon6} In addition, the following factors are associated with relapse and recurrence: childhood maltreatment, comorbid anxiety, neuroticism, age at first onset, rumination,^{Reference Buckman, Underwood, Clarke, Saunders, Hollon and Fearon6} experiencing a higher number of dependent chronic stressors, or a severe independent life event post-treatment.^{Reference Wojnarowski, Firth, Finegan and Delgadillo40}

Individual participant data meta-analyses have also been used to explore prognostic and prescriptive factors^{Reference Kuyken, Warren, Taylor, Whalley, Crane and Bondolfi41,Reference Breedvelt, Warren, Segal, Kuyken and Bockting42} and have been broadly in agreement, finding that younger age at onset, residual symptoms and a shorter duration of remission are associated with an increased risk of relapse. The prescriptive value of these factors remains uncertain. Previous research has also found a higher odds of recurrence associated with both psychosocial impairment and poor coping skills, and that avoidant coping style and ‘daily hassles/life events’ were predictive of recurrence.^{Reference Beshai, Dobson, Bockting and Quigley2,Reference Hardeveld, Spijker, De Graaf, Nolen and Beekman43}

The number of previous episodes was the most common included predictor across the models identified in this review (n = 6).^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference Klein, Holtman, Bockting, Heymans and Burger28–Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30,Reference Johansson, Lundh and Bjärehed33,Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} The presence of residual symptoms was used as a predictor only in one developed model.^{Reference Klein, Holtman, Bockting, Heymans and Burger28} Childhood maltreatment was included as a predictor in four of our included studies,^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference van Loo, Aggen, Gardner and Kendler29,Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30,Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} comorbid anxiety in three,^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference van Loo, Aggen, Gardner and Kendler29,Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30} neuroticism in one^{Reference Berlanga, Heinze, Torres, Apiquián and Caballero32} and age of onset in two models.^{Reference Van Loo, Aggen, Gardner and Kendler25,Reference Ruhe, Mocking, Figueroa, Seeverens, Ikani and Tyborowska36} Notably, rumination was not explored as a predictor in any of the included prognostic models, despite good evidence that this is associated with increased risk of relapse.^{Reference Buckman, Underwood, Clarke, Saunders, Hollon and Fearon6,Reference Hardeveld, Spijker, De Graaf, Nolen and Beekman43}

Wang et al (2014)^{Reference Wang, Patten, Sareen, Bolton, Schmitz and MacQueen30} found that marital status ‘contributed to’ the prediction of recurrence, whereas Johansson et al (2015)^{Reference Johansson, Lundh and Bjärehed33} included having a partner or not as one of the two predictors in their final model (odds ratio of 0.12 (95% CI 0.02–0.64), P = 0.01). The extant literature does not support marital status as a predictor of recurrence^{Reference Burcusa and Iacono4,Reference Evans, Hollon and DeRubeis44} and weaknesses in the methodology of the prognostic model studies mean that we cannot make conclusive statements about this but, given the strength of the association presented,^{Reference Johansson, Lundh and Bjärehed33} the prognostic significance of ‘having a partner or not’ warrants further investigation. The model development study by Van Loo et al (2018)^{Reference Van Loo, Aggen, Gardner and Kendler25} supports the findings of earlier research suggesting that gender is unlikely to be predictive of relapse.

There have been some previous attempts to derive and validate multivariable prognostic models to predict depression-related outcomes other than relapse and recurrence. Existing prognostic models for depression outcomes include a model (the Depression Outcomes Calculator-Six Items, (DOC-6©)) to predict remission (C-statistic (AUC) of 0.62, 95% CI 0.57–0.66) or persistent depressive symptoms (C-statistic (AUC) of 0.67, 95% CI 0.61–0.72) at 6 months’ post-diagnosis;¹¹ a model to predict persistent symptoms at six months (C-statistic not reported; R ² of 0.40 in the development sample and 0.27 in the validation sample);^{Reference Rubenstein L, Rayburn, Keeler, Ford, Rost and Sherbourne45} and a model to predict onset of depression in general practice attendees who did not currently have depression (C-statistic of 0.79, 95% CI 0.77–0.81).¹¹ The studies in this review present predictive performance statistics broadly in line with these, suggesting that successful individualised prediction might be possible for depression outcomes, but better quality studies and potentially different combinations of predictors are needed to explore this further.

Implications for clinical practice and research

Relapse and recurrence occur in a significant proportion of people with remitted depression and are a source of considerable morbidity. The economic burden of depression is higher in those who experience relapse or recurrence than in those who do not^{Reference Gauthier, Mucha, Shi and Guerin46} and, although interventions to prevent relapse or recurrence of depression (including pharmacological and psychological approaches) can be resource-intensive, they are effective^{Reference Clarke, Mayo-Wilson, Kenny and Pilling47–Reference Breedvelt, Brouwer, Harrer, Semkovska, Ebert and Cuijpers49} and cost-effective.^{Reference Klein, Wijnen, Lokkerbol, Buskens, Elgersma and van Rijsbergen50} Implementation research is needed to ensure that such interventions can be made available to a greater number of patients in a scalable and feasible way.

A potentially effective way of ensuring efficient allocation of relapse prevention interventions is by risk-stratifying patients according to risk of relapse and recurrence. Interventions can then be provided to those most likely to benefit from them. The aetiology of depression and depressive relapse is multifaceted, and multivariable models are likely to be a more helpful approach to predicting outcomes than relying on the presence or absence of single prognostic factors. None of the prognostic models identified in this review had sufficiently high-performance metrics to enable a personalised approach to relapse prevention for depression at present.

We reported some key methodological weaknesses in the studies identified in this review, particularly with respect to sample size. Unless the sample size is adequate, there will be limitations to how far we can trust the predictive performance statistics presented by the model development study as overfitting is likely. Going forward, it might be that data from multiple sources should be combined and harmonised to increase the available sample size for model development. A further consideration is that the data in the included studies were taken from samples collected for other purposes, for example randomised controlled trials and longitudinal cohort studies. Although these are considered acceptable and feasible sources of data for prognostic model studies,^{Reference Pajouheshnia, Groenwold, Peelen, Reitsma and Moons51} there may be advantages to prospectively gathering data (in a pre-designed prospective cohort study) with the explicit purpose of prognostic model development.^{Reference Riley, van der Windt, Croft and Moons7} A benefit of this is that researchers can control the collection and ensure standardised measurement of predictor and outcome information, but such an approach is more costly and time-consuming than the secondary analysis of pre-existing data and would require a commitment to resource and fund such work. The International Taskforce for relapse prevention of depression (ITFRA) (www.itfra.org) have begun to address these issues by bringing together data from trials of existing relapse prevention interventions and aiming to harmonise predictor and outcome measurement to improve personalised medicine in this area. Work is also underway aiming to move beyond stratification to provide more robust evidence for treatment moderators and prescriptive factors in relapse prevention.^{Reference Breedvelt, Warren, Brouwer, Karyotaki, Kuyken and Cuijpers52}

Most of the included predictors in the studies identified in this review were clinical or demographic variables. It is possible that including a greater number of biomarkers or genetic information may help move towards such a precision medicine approach, as has been shown promising in a number of other areas, including diagnosing mood disorders.^{Reference Le-Niculescu, Roseberry, Gill, Levey, Phalen and Mullen53} Nevertheless, such an approach may not be clinically feasible, and an important consideration for researchers is the context and setting in which a prognostic model is intended to be used. Models intended for a primary care setting, for example, may need to focus on a different set of predictors than those intended for use within a specialist service. Primary care-based models would ideally need to include predictors that were available and routinely collected in primary care, such as demographics, socioeconomic information, comorbidities and depression history characteristics.

This review has highlighted a range of statistical approaches to prognostic model development, from ‘traditional’ regression-based techniques to those using machine learning. Machine learning approaches offer the potential of greater predictive performances than more traditional approaches.^{Reference Tiffin and Paton54} However, this not always the case, as some studies^{Reference Tate, McCabe, Larsson, Lundström, Lichtenstein and Kuja-Halkola55} have shown. The technique can also be criticised for lack of interpretability, and variable reporting standards, although the forthcoming TRIPOD-AI may encourage greater consistency in this regard. When designing future prognosis research, researchers should be mindful of the relative benefits and disadvantages associated with different methodological approaches. Prognosis research has grown as an area over recent years^{Reference Riley, van der Windt, Croft and Moons7} and, with the development of the PROGRESS initiative, there are now standards and guidelines for conducting,^{Reference Steyerberg, Moons, van der Windt, Hayden and Perel56} reporting^{Reference Moons, Altman, Reitsma, Ioannidis, Macaskill and Steyerberg57} and appraising^{Reference Wolff, Moons, Riley, Whiting, Westwood and Collins19} prognostic model studies. Future studies looking to develop prognostic models for relapse and recurrence of depression should follow best practice guidance when designing methodology, and should be reported in line with the TRIPOD statement.^{Reference Moons, Altman, Reitsma, Ioannidis, Macaskill and Steyerberg57}

In conclusion, this review identified 11 prognostic models developed to predict the risk of relapse or recurrence in people with remitted depression. The models were developed in a variety of clinical settings and patient populations and with a range of included predictors. We are not yet at the point where we can reliably predict outcomes for a given person with remitted depression based on their demographic, clinical and disease-level characteristics. This review suggests that this might be possible, although the studies identified here were limited by their high ROB because of methodological weaknesses. Researchers should conform to best practice when developing prognostic models in future. Beyond this, any such prognostic models will require good-quality external validation, assessment of clinical utility and evaluation of implementation before they can successfully be translated into clinical practice.

Supplementary material

Supplementary material is available online at https://doi.org/10.1192/bjp.2021.218.

Acknowledgements

This article is based on a Cochrane review published in the Cochrane Database of Systematic Reviews (CDSR) 2021, Issue 5, DOI: 10.1002/14651858.CD013491.pub2 (see www.cochranelibrary.com for information).¹³ Cochrane Reviews are regularly updated as new evidence emerges and in response to feedback, and the CDSR should be consulted for the most recent version of the review. We thank the Cochrane Prognosis Methods Group for providing guidance and the editorial team of the Cochrane Common Mental Disorders (CCMD) Group. The authors are grateful to the following Patient Advisory Group members who contributed to and provided constructive feedback on the final review: Gregory Ball, Joanne Castleton, Gillian Payne, Sue Penn and Emma Williams. The authors thank Professor Trevor Sheldon and Professor Paul Tiffin, who have provided comments and advice on drafts of this review through their roles as Thesis Advisory Panel members. Thanks to Johanna Damen (Cochrane Prognosis Methods Group), Professor Patty Chondros (Department of General Practice, University of Melbourne) and Karen Morley (Cochrane Consumer) who provided peer review on the original Cochrane review.

Author contribution

A.S.M.: lead author of the review. Responsible for screening and selection of studies, data extraction, ‘Characteristics of studies’ tables, ROB and applicability assessment. N.M.: contributed to the write-up of the review. Responsible for screening and selection of studies, data extraction, ‘Characteristics of studies’ tables, ROB and applicability assessment. K.I.E.S.: third review author in screening of references and selection of studies and ‘Risk of bias’ assessment. Contributed to the write-up of the review. Methodological expertise. R.D.R.: contributed to the write-up of the review. Methodological expertise. L.W.P.: contributed to the write-up of the review. S.D.: developed and conduction information searching strategy. J.H.: contributed to review and write-up of manuscript. S.G.: contributed to the conception of the review. Content expertise. C.A.C.G.: contributed to the conception and write-up of the review. Content expertise. R.C.: contributed to the write-up of the review. R.S.P.: commented on the final draft and provided methodological expertise. S.A.: contributed to the conception of the review and commented on the final draft. D.M.: contributed to the conception of the review and commented on the final draft. Content expertise.

Funding

A.S.M. is funded by a NIHR Doctoral Research Fellowship for this research project (NIHR Doctoral Research Fellowship, Dr Andrew Moriarty, DRF-2018-11-ST2-044). K.I.E.S. is funded by the NIHR School for Primary Care Research (SPCR Launching Fellowship). This publication presents independent research funded by the NIHR. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health and Social Care.

Declaration of interest

None.

Appendix

PICOTS criteria

References

World Health Organization. Depression. WHO, 2018 (https://www.who.int/news-room/fact-sheets/detail/depression%0D).Google Scholar

Beshai, S, Dobson, KS, Bockting, CLH, Quigley, L. Relapse and recurrence prevention in depression: Current research and future prospects. Clin Psychol Rev 2011; 31: 1349–60.CrossRef Google Scholar PubMed

Ali, S, Rhodes, L, Moreea, O, McMillan, D, Gilbody, S, Leach, C, et al. How durable is the effect of low intensity CBT for depression and anxiety? Remission and relapse in a longitudinal cohort study. Behav Res Ther 2017; 94: 1–8.CrossRef Google Scholar

Burcusa, SL, Iacono, WG. Risk for recurrence in depression. Clin Psychol Rev 2007; 27: 959–85.CrossRef Google Scholar PubMed

Post, M. Transduction of psychosocial stress into the neurobiology of recurrent affective disorder. Depression 1992; 149: 999–1010.Google Scholar PubMed

Buckman, JEJ, Underwood, A, Clarke, K, Saunders, R, Hollon, SD, Fearon, P, et al. Risk factors for relapse and recurrence of depression in adults and how they operate: a four-phase systematic review and meta-synthesis. Clin Psychol Rev 2018; 64: 13–38.CrossRef Google Scholar PubMed

Riley, RD, van der Windt, D, Croft, P, Moons, K. Prognosis Research in Healthcare: Concepts, Methods, and Impact (1st edn). Oxford University Press, 2019.CrossRef Google Scholar

Bone, C, Simmonds-Buckley, M, Thwaites, R, Sandford, D, Merzhvynska, M, Rubel, J, et al. Dynamic prediction of psychological treatment outcomes: development and validation of a prediction model using routinely collected symptom data. Lancet Digit Heal 2021; 3: e231–40.CrossRef Google Scholar PubMed

van Bronswijk, SC, Lemmens, LHJM, Keefe, JR, Huibers, MJH, DeRubeis, RJ, Peeters, FPML. A prognostic index for long-term outcome after successful acute phase cognitive therapy and interpersonal psychotherapy for major depressive disorder. Depress Anxiety 2019; 36: 252–61.CrossRef Google Scholar PubMed

Angstman, KB, Garrison, GM, Gonzalez, CA, Cozine, DW, Cozine, EW, Katzelnick, DJ. Prediction of primary care depression outcomes at six months: validation of DOC-6 ©. J Am Board Fam Med 2017; 30: 281–7.CrossRef Google Scholar PubMed

King M, Walker C, Levy G, Bottomley C, Royston P, Weich S, et al. Development and validation of an international risk prediction algorithm for episodes of major depression in general practice attendees: the PredictD study. Arch Gen Psychiatry 2008; 65: 1368–76.CrossRef Google Scholar

Moriarty, AS, Meader, N, Gilbody, S, Chew-Graham, CA, Churchill, R, Ali, S, et al. Prognostic models for predicting relapse or recurrence of depression. Cochrane Database Syst Rev 2019; 12: CD013491 (https://www.cochranelibrary.com/cdsr/doi/10.1002/14651858.CD013491/full).Google Scholar

Moriarty, AS, Meader, N, Snell, KIE, Riley, RD, Paton, LW, Chew-Graham, CA, et al. Prognostic models for predicting relapse or recurrence of major depressive disorder in adults. Cochrane Database Syst Rev 2021; 5: CD013491 (https://www.cochranelibrary.com/cdsr/doi/10.1002/14651858.CD013491.pub2/full).Google Scholar PubMed

Page, MJ, McKenzie, JE, Bossuyt, PM, Boutron, I, Hoffmann, TC, Mulrow, CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021; 372: n71.Google Scholar PubMed

Debray, TPA, Damen, JAAG, Snell, KIE, Ensor, J, Hooft, L, Reitsma, JB, et al. A guide to systematic review and meta-analysis of prediction model performance. BMJ 2017; 356: i6460.Google Scholar

Bockting, CL, Hollon, SD, Jarrett, RB, Kuyken, W, Dobson, K. A lifetime approach to major depressive disorder: The contributions of psychological interventions in preventing relapse and recurrence. Clin Psychol Rev 2015; 41: 16–26.CrossRef Google Scholar PubMed

Frank, E, Prien, RF, Jarrett, RB, Keller, MB, Kupfer, DJ, Lavori, PW, et al. Conceptualization and rationale for consensus definitions of terms in major depressive disorder: remission, recovery, relapse, and recurrence. JAMA Psychiatry 1991; 48: 851–5.Google Scholar PubMed

Rush, AJ, Kraemer, HC, Sackeim, HA, Fava, M, Trivedi, MH, Frank, E, et al. Report by the ACNP Task Force on response and remission in major depressive disorder. Neuropsychopharmacology 2006; 31: 1841–53.CrossRef Google Scholar PubMed

Wolff, RF, Moons, KGM, Riley, RD, Whiting, PF, Westwood, M, Collins, GS, et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med 2019; 170: 51.CrossRef Google Scholar PubMed

Collins, GS, De Groot, JA, Dutton, S, Omar, O, Shanyinde, M, Tajar, A, et al. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol 2014; 14: 1–11.CrossRef Google Scholar PubMed

Debray, TPA, Damen, JAAG, Riley, RD, Snell, K, Reitsma, JB, Hooft, L, et al. A framework for meta-analysis of prediction model studies with binary and time-to-event outcomes. Stat Methods Med Res 2019; 28(9): 2768–86.CrossRef Google Scholar PubMed

Vickers, AJ, Van Calster, B, Steyerberg, EW. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ 2016; 352: 3–7.Google Scholar

Moons, KGM, Wolff, RF, Riley, RD, Whiting, PF, Westwood, M, Collins, GS, et al. PROBAST: a tool to assess risk of bias and applicability of prediction model studies: Explanation and elaboration. Ann Intern Med 2019; 170: W1–33.CrossRef Google Scholar PubMed

van Loo, HM, Bigdeli, TB, Milaneschi, Y, Aggen, SH, Kendler, KS. Data mining algorithm predicts a range of adverse outcomes in major depression. J Affect Disord 2020; 276: 945–53.CrossRef Google Scholar PubMed

Van Loo, HM, Aggen, SH, Gardner, CO, Kendler, KS. Sex similarities and differences in risk factors for recurrence of major depression. Psychol Med 2018; 48: 1685–93.CrossRef Google Scholar PubMed

Trivedi, M, Morrison, R, Daly, E, Singh, JB, Fedgchin, M, Jamieson, C, et al. Biobehavioral prediction of relapse in major depression: a prospective, multicenter, observational study. Neuropsychopharmacology 2016; 41 (Suppl 1): S517–8.Google Scholar

Cohen, Z, DeRubeis, R, Hayes, R, Watkins, E, Lewis, G, Byng, R, et al. The development and internal evaluation of a predictive model to identify for whom mindfulness-based cognitive therapy offers superior relapse prevention for recurrent depression versus maintenance antidepressant medication. Biol Psychiatry 2021; 89 (9 Supp): S36–S37.CrossRef Google Scholar

Klein, NS, Holtman, GA, Bockting, CLH, Heymans, MW, Burger, H. Development and validation of a clinical prediction tool to estimate the individual risk of depressive relapse or recurrence in individuals with recurrent depression. J Psychiatr Res 2018; 104: 1–7.CrossRef Google Scholar PubMed

van Loo, HM, Aggen, SH, Gardner, CO, Kendler, KS. Multiple risk factors predict recurrence of major depressive disorder in women. J Affect Disord 2015; 180: 52–61.CrossRef Google Scholar PubMed

Wang, JL, Patten, S, Sareen, J, Bolton, J, Schmitz, N, MacQueen, G. Development and validation of a prediction algorithm for use by health professionals in prediction of recurrence of major depression. Depress Anxiety 2014; 31: 451–7.CrossRef Google Scholar PubMed

Backs-Dermott, BJ, Dobson, KS, Jones, SL. An evaluation of an integrated model of relapse in depression. J Affect Disord 2010; 124: 60–7.CrossRef Google Scholar PubMed

Berlanga, C, Heinze, G, Torres, M, Apiquián, R, Caballero, A. Personality and clinical predictors of recurrence of depression. Psychiatr Serv 1999; 50: 376–80.CrossRef Google Scholar PubMed

Johansson, O, Lundh, LG, Bjärehed, J. 12-Month outcome and predictors of recurrence in psychiatric treatment of depression: a retrospective study. Psychiatr Q 2015; 86: 407–17.CrossRef Google Scholar PubMed

Judd, LL, Schettler, PJ, Rush, AJ. A brief clinical tool to estimate individual patients’ risk of depressive relapse following remission: proof of concept. Am J Psychiatry 2016; 173: 1140–6.CrossRef Google Scholar PubMed

Mocking, RJT, Naviaux, JC, Li, K, Wang, L, Monk, JM, Bright, AT, et al. Metabolic features of recurrent major depressive disorder in remission, and the risk of future recurrence. Transl Psychiatry 2021; 11: 37.CrossRef Google Scholar PubMed

Ruhe, HG, Mocking, RJT, Figueroa, CA, Seeverens, PWJ, Ikani, N, Tyborowska, A, et al. Emotional biases and recurrence in major depressive disorder. Results of 2.5 years follow-up of drug-free cohort vulnerable for recurrence. Front Psychiatry 2019; 10: 1–18.CrossRef Google Scholar PubMed

Pintor, L, Torres, X, Navarro, V, Martinez de Osaba, MJ, Matrai, S, Gastó, C. Prediction of relapse in melancholic depressive patients in a 2-year follow-up study with corticotropin releasing factor test. Prog Neuro-Psychopharmacology Biol Psychiatry 2009; 33: 463–9.CrossRef Google Scholar

Riley, RD, Ensor, J, Snell, KIE, Harrell, FE, Martin, GP, Reitsma, JB, et al. Calculating the sample size required for developing a clinical prediction model. BMJ 2020: m441.CrossRef Google Scholar PubMed

Van Der Ploeg, T, Austin, PC, Steyerberg, EW. Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints. BMC Med Res Methodol 2014; 14: 1–13.CrossRef Google Scholar PubMed

Wojnarowski, C, Firth, N, Finegan, M, Delgadillo, J. Predictors of depression relapse and recurrence after cognitive behavioural therapy: a systematic review and meta-analysis. Behav Cogn Psychother 2019; 47: 514–29.CrossRef Google Scholar PubMed

Kuyken, W, Warren, FC, Taylor, RS, Whalley, B, Crane, C, Bondolfi, G, et al. Efficacy of mindfulness-based cognitive therapy in prevention of depressive relapse an individual patient data meta-analysis from randomized trials. JAMA Psychiatry 2016; 73: 565–74.CrossRef Google Scholar PubMed

Breedvelt, JJF, Warren, FC, Segal, Z, Kuyken, W, Bockting, CL. Continuation of antidepressants vs sequential psychological interventions to prevent relapse in depression: an individual participant data meta-analysis. JAMA Psychiatry 2021; 78: 868–75.CrossRef Google Scholar PubMed

Hardeveld, F, Spijker, J, De Graaf, R, Nolen, WA, Beekman, ATF. Prevalence and predictors of recurrence of major depressive disorder in the adult population. Acta Psychiatr Scand 2010; 122: 184–91.CrossRef Google Scholar PubMed

Evans, MD, Hollon, SD, DeRubeis, RJ, et al. differential relapse following cognitive therapy and pharmacotherapy for depression. Arch Gen Psychiatry 1992; 49: 802–808.CrossRef Google Scholar PubMed

Rubenstein L, V, Rayburn, NR, Keeler, EB, Ford, DE, Rost, KM, Sherbourne, CD. Predicting outcomes of primary care patients with major depression: development of a depression prognosis index. Psychiatr Serv 2007; 58: 1049–56.CrossRef Google Scholar PubMed

Gauthier, G, Mucha, L, Shi, S, Guerin, A. Economic burden of relapse/recurrence in patients with major depressive disorder. J Drug Assess 2019; 8: 97–103.CrossRef Google Scholar PubMed

Clarke, K, Mayo-Wilson, E, Kenny, J, Pilling, S. Can non-pharmacological interventions prevent relapse in adults who have recovered from depression? A systematic review and meta-analysis of randomised controlled trials. Clin Psychol Rev 2015; 39: 58–70.CrossRef Google Scholar PubMed

Geddes, JR, Carney, SM, Davies, C, Furukawa, TA, Kupfer, DJ, Frank, E. Relapse prevention with antidepressant drug treatment in depressive disorders: A systematic review. Lancet 2003; 361: 653–61.CrossRef Google Scholar PubMed

Breedvelt, JJF, Brouwer, ME, Harrer, M, Semkovska, M, Ebert, DD, Cuijpers, P, et al. Psychological interventions as an alternative and add-on to antidepressant medication to prevent depressive relapse: systematic review and meta-analysis. Br J Psychiatry 2021; 219: 538–45.CrossRef Google Scholar PubMed

Klein, NS, Wijnen, BFM, Lokkerbol, J, Buskens, E, Elgersma, HJ, van Rijsbergen, GD, et al. Cost-effectiveness, cost-utility and the budget impact of antidepressants versus preventive cognitive therapy with or without tapering of antidepressants. BJPsych Open 2019; 5: 1–9.CrossRef Google Scholar PubMed

Pajouheshnia, R, Groenwold, RHH, Peelen, LM, Reitsma, JB, Moons, KGM. When and how to use data from randomised trials to develop or validate prognostic models. BMJ 2019; 365: l2154.CrossRef Google Scholar PubMed

Breedvelt, JJF, Warren, FCW, Brouwer, MEB, Karyotaki, E, Kuyken, W, Cuijpers, P, et al. Individual participant data (IPD) meta-analysis of psychological relapse prevention interventions versus control for patients in remission from depression: a protocol. BMJ Open 2020; 10: 1–8.CrossRef Google Scholar PubMed

Le-Niculescu, H, Roseberry, K, Gill, SS, Levey, DF, Phalen, PL, Mullen, J, et al. Precision medicine for mood disorders: objective assessment, risk prediction, pharmacogenomics, and repurposed drugs. Mol Psychiatry 2021; 26: 2776–2804.CrossRef Google Scholar PubMed

Tiffin, PA, Paton, LW. Rise of the machines? Machine learning approaches and mental health: opportunities and challenges. Br J Psychiatry 2018; 213: 509–10.CrossRef Google Scholar PubMed

Tate, AE, McCabe, RC, Larsson, H, Lundström, S, Lichtenstein, P, Kuja-Halkola, R. Predicting mental health problems in adolescence using machine learning techniques. PLoS One 2020; 14: e0230389.CrossRef Google Scholar

Steyerberg, EW, Moons, KGM, van der Windt, DA, Hayden, JA, Perel, P. Prognosis research strategy (PROGRESS) 3: prognostic model research. PLoS Med 2013; 10: e10.CrossRef Google Scholar PubMed

Moons, KGM, Altman, DG, Reitsma, JB, Ioannidis, JPA, Macaskill, P, Steyerberg, EW, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): Explanation and elaboration. Ann Intern Med 2015; 162: W1–73.CrossRef Google Scholar PubMed

Fig. 1 PRISMA Flow Diagram.

Table 1 Characteristics of included studies

Fig. 2 (a): Risk of bias assessment (Prediction model risk of bias assessment tool (PROBAST)); (b): applicability assessment (PROBAST).

Moriarty et al. supplementary material

Moriarty et al. supplementary material 1

File 64.8 KB

Moriarty et al. supplementary material

Moriarty et al. supplementary material 2

File 116.1 KB

Moriarty et al. supplementary material

Moriarty et al. supplementary material 3

File 15 KB

Submit a response

eLetters

No eLetters have been published for this article.

Article contents

Predicting relapse or recurrence of depression: systematic review of prognostic models

Abstract

Keywords

Background

Objectives

Method

Eligibility criteria

Information sources and search strategy

Selection of studies

Data collection

Data synthesis and meta-analysis approaches

Risk of bias assessment in included studies

Results

Results of the search

Description of studies

Predictive performance of prognostic models

ROB and applicability assessment of included studies

Discussion

Comparison with the previous literature

Implications for clinical practice and research

Supplementary material

Acknowledgements

Author contribution

Funding

Declaration of interest

Appendix

PICOTS criteria

References

Moriarty et al. supplementary material

Moriarty et al. supplementary material

Moriarty et al. supplementary material

eLetters

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests