Factual and counterfactual learning in major adolescent depressive disorder, evidence from an instrumental learning study

Qiang Shen; Shiguang Fu; Xiaoying Jiang; Xiaoyu Huang; Doudou Lin; Qingyan Xiao; Sitti Khadijah; Yaping Yan; Xiaoxing Xiong; Jia Jin; Richard P. Ebstein; Ting Xu; Yiquan Wang; Jun Feng

doi:10.1017/S0033291723001307

Factual and counterfactual learning in major adolescent depressive disorder, evidence from an instrumental learning study

Published online by Cambridge University Press: 10 May 2023

Qiang Shen ,

Doudou Lin ,

Yaping Yan ,

Xiaoxing Xiong and

Jia Jin

...Show all authors

Show author details

Qiang Shen: Affiliation:
Shanghai Key Laboratory of Brain-Machine Intelligence for Information Behavior (Ministry of Education), 201620, Shanghai, China School of Business and Management, Shanghai International Studies University, 201620, Shanghai, China Joint Lab of Finance and Business Intelligence, Guangdong Institute of Intelligence Science and Technology, 519031, Zhuhai, China
Shiguang Fu: Affiliation:
Shanghai Key Laboratory of Brain-Machine Intelligence for Information Behavior (Ministry of Education), 201620, Shanghai, China School of Business and Management, Shanghai International Studies University, 201620, Shanghai, China Joint Lab of Finance and Business Intelligence, Guangdong Institute of Intelligence Science and Technology, 519031, Zhuhai, China
Xiaoying Jiang: Affiliation:
Hangzhou Mental Health Center of Children and Adolescents, Hangzhou Seventh People's Hospital, 310006, Hangzhou, China
Xiaoyu Huang: Affiliation:
Hangzhou Mental Health Center of Children and Adolescents, Hangzhou Seventh People's Hospital, 310006, Hangzhou, China
Doudou Lin: Affiliation:
School of Management, Zhejiang University of Technology, 310023, Hangzhou, China
Qingyan Xiao: Affiliation:
Shanghai Key Laboratory of Brain-Machine Intelligence for Information Behavior (Ministry of Education), 201620, Shanghai, China School of Business and Management, Shanghai International Studies University, 201620, Shanghai, China Joint Lab of Finance and Business Intelligence, Guangdong Institute of Intelligence Science and Technology, 519031, Zhuhai, China
Sitti Khadijah: Affiliation:
School of Management, Zhejiang University of Technology, 310023, Hangzhou, China
Yaping Yan: Affiliation:
Department of Neurology, The Second Affiliated Hospital of Zhejiang University, 310009, Hangzhou, China
Xiaoxing Xiong: Affiliation:
Department of Neurosurgery, Renmin Hospital of Wuhan University, 430060, Wuhan, China
Jia Jin: Affiliation:
Shanghai Key Laboratory of Brain-Machine Intelligence for Information Behavior (Ministry of Education), 201620, Shanghai, China School of Business and Management, Shanghai International Studies University, 201620, Shanghai, China Joint Lab of Finance and Business Intelligence, Guangdong Institute of Intelligence Science and Technology, 519031, Zhuhai, China
Richard P. Ebstein: Affiliation:
China Center for Behavioral Economics and Finance, Southwestern University of Finance & Economics, 611130, Chengdu, China
Ting Xu: Affiliation:
School of Business, University of Ningbo, 315210, Ningbo, China
Yiquan Wang*: Affiliation:
Hangzhou Mental Health Center of Children and Adolescents, Hangzhou Seventh People's Hospital, 310006, Hangzhou, China
Jun Feng*: Affiliation:
School of Economics, Hefei University of Technology, 230601, Hefei, China
*: Corresponding author: Yiquan Wang; Email: [email protected]; Jun Feng; Email: [email protected]
Corresponding author: Yiquan Wang; Email: [email protected]; Jun Feng; Email: [email protected]

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Background

The incidence of adolescent depressive disorder is globally skyrocketing in recent decades, albeit the causes and the decision deficits depression incurs has yet to be well-examined. With an instrumental learning task, the aim of the current study is to investigate the extent to which learning behavior deviates from that observed in healthy adolescent controls and track the underlying mechanistic channel for such a deviation.

Methods

We recruited a group of adolescents with major depression and age-matched healthy control subjects to carry out the learning task with either gain or loss outcome and applied a reinforcement learning model that dissociates valence (positive v. negative) of reward prediction error and selection (chosen v. unchosen).

Results

The results demonstrated that adolescent depressive patients performed significantly less well than the control group. Learning rates suggested that the optimistic bias that overall characterizes healthy adolescent subjects was absent for the depressive adolescent patients. Moreover, depressed adolescents exhibited an increased pessimistic bias for the counterfactual outcome. Lastly, individual difference analysis suggested that these observed biases, which significantly deviated from that observed in normal controls, were linked with the severity of depressive symoptoms as measured by HAMD scores.

Conclusions

By leveraging an incentivized instrumental learning task with computational modeling within a reinforcement learning framework, the current study reveals a mechanistic decision-making deficit in adolescent depressive disorder. These findings, which have implications for the identification of behavioral markers in depression, could support the clinical evaluation, including both diagnosis and prognosis of this disorder.

Keywords

Choice bias depression reinforcement learning reward prediction error

Type: Original Article
Information: Psychological Medicine , Volume 54 , Issue 2 , January 2024 , pp. 256 - 266

DOI: https://doi.org/10.1017/S0033291723001307 [Opens in a new window]
Copyright: Copyright © The Author(s), 2023. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

These authors contributed equally to this work.

References

Auerbach, R. P., Pagliaccio, D., & Pizzagalli, D. A. (2019). Toward an improved understanding of anhedonia. JAMA Psychiatry, 76(6), 571–573. doi:10.1001/jamapsychiatry.2018.4600.CrossRef Google Scholar PubMed

Bakic, J., Pourtois, G., Jepma, M., Duprat, R., De Raedt, R., & Baeken, C. (2017). Spared internal but impaired external reward prediction error signals in major depressive disorder during reinforcement learning. Depression and Anxiety, 34(1), 89–96. doi:10.1002/da.22576.CrossRef Google Scholar PubMed

Bao, H. W. S. (2022). bruceR: Broadly useful convenien and efficient R functions. R package version 0.8.x. Retrieved from https://CRAN.R-project.org/package=bruceR.Google Scholar

Bavard, S., Rustichini, A., & Palminteri, S. (2021). Two sides of the same coin: Beneficial and detrimental consequences of range adaptation in human reinforcement learning. Science Advances, 7(14), eabe0340. doi:10.1126/sciadv.abe0340.CrossRef Google Scholar PubMed

Berwian, I. M., Wenzel, J. G., Collins, A. G., Seifritz, E., Stephan, K. E., Walter, H., & Huys, Q. J. (2020). Computational mechanisms of effort and reward decisions in patients with depression and their association with relapse after antidepressant discontinuation. JAMA Psychiatry, 77(5), 513–522. doi:10.1001/jamapsychiatry.2019.4971.CrossRef Google Scholar PubMed

Bishop, S. J., & Gagne, C. (2018). Anxiety, depression, and decision making: A computational perspective. Annual Review of Neuroscience, 41, 371–388. doi:10.1146/annurev-neuro-080317-062007.CrossRef Google Scholar PubMed

Brolsma, S. C., Vassena, E., Vrijsen, J. N., Sescousse, G., Collard, R. M., van Eijndhoven, P. F., & …Cools, R. (2021). Negative learning bias in depression revisited: Enhanced neural response to surprising reward across psychiatric disorders. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 6(3), 280–289. doi:10.1016/j.bpsc.2020.08.011.Google Scholar PubMed

Bromberg-Martin, E. S., & Sharot, T. (2020). The value of beliefs. Neuron, 106(4), 561–565. doi:10.1016/j.neuron.2020.05.001.CrossRef Google Scholar PubMed

Broomhall, A. G., Phillips, W. J., Hine, D. W., & Loi, N. M. (2017). Upward counterfactual thinking and depression: A meta-analysis. Clinical Psychology Review, 55, 56–73. doi:10.1016/j.cpr.2017.04.010.CrossRef Google Scholar PubMed

Chambon, V., Thero, H., Vidal, M., Vandendriessche, H., Haggard, P., & Palminteri, S. (2020). Information about action outcomes differentially affects learning from self-determined versus imposed choices. Nature Human Behaviour, 4(10), 1067–1079. doi:10.1038/s41562-020-0919-5.CrossRef Google Scholar PubMed

Chase, H. W., Frank, M. J., Michael, A., Bullmore, E. T., Sahakian, B. J., & Robbins, T. W. (2010). Approach and avoidance learning in patients with major depression and healthy controls: Relation to anhedonia. Psychological Medicine, 40(3), 433–440. doi:10.1017/S0033291709990468.CrossRef Google Scholar PubMed

Clayborne, Z. M., Varin, M., & Colman, I. (2019). Systematic review and meta-analysis: Adolescent depression and long-term psychosocial outcomes. Journal of the American Academy of Child & Adolescent Psychiatry, 58(1), 72–79. doi:10.1016/j.jaac.2018.07.896.CrossRef Google Scholar PubMed

Fontanesi, L., Gluth, S., Spektor, M. S., & Rieskamp, J. (2019). A reinforcement learning diffusion decision model for value-based decisions. Psychonomic Bulletin & Review, 26(4), 1099–1121. doi:10.3758/s13423-018-1554-2.CrossRef Google Scholar PubMed

Frank, M. J., Seeberger, L. C., & O'reilly, R. C. (2004). By carrot or by stick: Cognitive reinforcement learning in Parkinsonism. Science (New York, N.Y.), 306(5703), 1940–1943. doi:10.1126/science.1102941.CrossRef Google Scholar PubMed

Frank, R. H. (2016). Success and luck. Princeton: Princeton University Press.Google Scholar

Gaure, S. (2013). lfe: Linear group fixed effects. The R Journal, 5(2), 104–116. doi:10.32614/RJ-2013-031.CrossRef Google Scholar

Gillan, C. M., Otto, A. R., Phelps, E. A., & Daw, N. D. (2015). Model-based learning protects against forming habits. Cognitive, Affective, & Behavioral Neuroscience, 15(3), 523–536. doi:10.3758/s13415-015-0347-6.CrossRef Google Scholar PubMed

Harrell, F. E. Jr. (2021). rms: Regression Modeling Strategies. R package version 6.2-0. Retrieved from https://CRAN.R-project.org/package=rms.Google Scholar

Hartzmark, S. M., Hirshman, S. D., & Imas, A. (2021). Ownership, learning, and beliefs. The Quarterly Journal of Economics, 136(3), 1665–1717. doi:10.1093/qje/qjab010.CrossRef Google Scholar

Kappes, A., Harvey, A. H., Lohrenz, T., Montague, P. R., & Sharot, T. (2019). Confirmation bias in the utilization of others’ opinion strength. Nature Neuroscience, 23(1), 130–137. doi:10.1038/s41593-019-0549-2.CrossRef Google Scholar PubMed

Korn, C. W., Sharot, T., Walter, H., Heekeren, H. R., & Dolan, R. J. (2014). Depression is related to an absence of optimistically biased belief updating about future life events. Psychological Medicine, 44(3), 579–592. doi:10.1017/S0033291713001074.CrossRef Google Scholar

Kraines, M. A., Krug, C. P., & Wells, T. T. (2017). Decision justification theory in depression: Regret and self-blame. Cognitive Therapy and Research, 41(4), 556–561. doi:10.1007/s10608-017-9836-y.CrossRef Google Scholar

Kube, T., Schwarting, R., Rozenkrantz, L., Glombiewski, J. A., & Rief, W. (2020). Distorted cognitive processes in major depression: A predictive processing perspective. Biological Psychiatry, 87(5), 388–398. doi:10.1016/j.biopsych.2019.07.017.CrossRef Google Scholar PubMed

Kumar, P., Goer, F., Murray, L., Dillon, D. G., Beltzer, M. L., Cohen, A. L., … Pizzagalli, D. A. (2018). Impaired reward prediction error encoding and striatal-midbrain connectivity in depression. Neuropsychopharmacology, 43(7), 1581–1588. doi:10.1038/s41386-018-0032-x.CrossRef Google Scholar PubMed

Lefebvre, G., Lebreton, M., Meyniel, F., Bourgeois-Gironde, S., & Palminteri, S. (2017). Behavioural and neural characterization of optimistic reinforcement learning. Nature Human Behaviour, 1(4), 1–9. doi:10.1038/s41562-017-0067.CrossRef Google Scholar

Lefebvre, G., Summerfield, C., & Bogacz, R. (2021). A normative account of confirmation bias during reinforcement learning. Neural Computation, 34(2), 1–31. doi:10.1162/neco_a_01455.Google Scholar

Lu, W. (2019). Adolescent depression: National trends, risk factors, and healthcare disparities. American Journal of Health Behavior, 43(1), 181–194. doi:10.5993/AJHB.43.1.15.CrossRef Google Scholar PubMed

Ma, Y., Li, S., Wang, C., Liu, Y., Li, W., Yan, X., … Han, S. (2016). Distinct oxytocin effects on belief updating in response to desirable and undesirable feedback. Proceedings of the National Academy of Sciences, 113(33), 9256–9261. doi:10.1073/pnas.1604285113.CrossRef Google Scholar PubMed

McFadden, D. (1973). Conditional logit analysis of qualitative choice behavior. In Zarembka, P. (Ed.), Frontiers in econometrics (pp. 105–142). New York: Academic Press.Google Scholar

Miletić, S., Boag, R. J., Trutti, A. C., Stevenson, N., Forstmann, B. U., & Heathcote, A. (2021). A new model of decision processing in instrumental learning tasks. Elife, 10, e63055. doi:10.7554/eLife.63055.CrossRef Google Scholar PubMed

Miller, L., & Campo, J. V. (2021). Depression in adolescents. New England Journal of Medicine, 385(5), 445–449. doi:10.1056/NEJMra2033475.CrossRef Google Scholar PubMed

Montague, P. R., Dolan, R. J., Friston, K. J., & Dayan, P. (2012). Computational psychiatry. Trends in Cognitive Sciences, 16(1), 72–80. doi:10.1016/j.tics.2011.11.018.CrossRef Google Scholar PubMed

Mukherjee, D., Filipowicz, A. L. S., Vo, K., Satterthwaite, T. D., & Kable, J. W. (2020). Reward and punishment reversal-learning in major depressive disorder. Journal of Abnormal Psychology, 129(8), 810–823. doi:10.1037/abn0000641.CrossRef Google Scholar PubMed

Mullen, K., Ardia, D., Gil, D. L., Windover, D., & Cline, J. (2011). DEoptim: An R package for global optimization by differential evolution. Journal of Statistical Software, 40(6), 1–26. doi:10.18637/jss.v040.i06.CrossRef Google Scholar

Ng, T. H., Alloy, L. B., & Smith, D. V. (2019). Meta-analysis of reward processing in major depressive disorder reveals distinct abnormalities within the reward circuit. Translational Psychiatry, 9(1), 293. doi:10.1038/s41398-019-0644-x.CrossRef Google Scholar PubMed

Nielson, D. M., Keren, H., O'Callaghan, G., Jackson, S. M., Douka, I., Vidal-Ribas, P., … Stringaris, A. (2021). Great expectations: A critical review of and suggestions for the study of reward processing as a cause and predictor of depression. Biological Psychiatry, 89(2), 134–143. doi:10.1016/j.biopsych.2020.06.012.CrossRef Google Scholar

Niv, Y., Edlund, J. A., Dayan, P., & O'Doherty, J. P. (2012). Neural prediction errors reveal a risk-sensitive reinforcement-learning process in the human brain. Journal of Neuroscience, 32(2), 551–562. doi:10.1523/JNEUROSCI.5498-10.2012.CrossRef Google Scholar PubMed

Palminteri, S., Kilford, E. J., Coricelli, G., & Blakemore, S. J. (2016). The computational development of reinforcement learning during adolescence. PLoS Computational Biology, 12(6), e1004953. doi:10.1371/journal.pcbi.1004953.CrossRef Google Scholar PubMed

Palminteri, S., Lefebvre, G., Kilford, E. J., & Blakemore, S. J. (2017). Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing. PLoS Computational Biology, 13(8), e1005684. doi:10.1371/journal.pcbi.1005684.CrossRef Google Scholar PubMed

Paus, T., Keshavan, M., & Giedd, J. N. (2008). Why do many psychiatric disorders emerge during adolescence? Nature Reviews Neuroscience, 9(12), 947–957. doi:10.1038/nrn2513.CrossRef Google Scholar PubMed

Pedersen, M. L., & Frank, M. J. (2020). Simultaneous hierarchical Bayesian parameter estimation for reinforcement learning and drift diffusion models: A tutorial and links to neural data. Computational Brain & Behavior, 3(4), 458–471. doi:10.1007/s42113-020-00084-w.CrossRef Google Scholar PubMed

Pedersen, M. L., Frank, M. J., & Biele, G. (2017). The drift diffusion model as the choice rule in reinforcement learning. Psychonomic Bulletin & Review, 24(4), 1234–1251. doi:10.3758/s13423-016-1199-y.CrossRef Google Scholar PubMed

Pizzagalli, D. A., Iosifescu, D., Hallett, L. A., Ratner, K. G., & Fava, M. (2008). Reduced hedonic capacity in major depressive disorder: Evidence from a probabilistic reward task. Journal of Psychiatric Research, 43(1), 76–87. doi:10.1016/j.jpsychires.2008.03.001.CrossRef Google Scholar PubMed

Raio, C. M., Hartley, C. A., Orederu, T. A., Li, J., & Phelps, E. A. (2017). Stress attenuates the flexible updating of aversive value. Proceedings of the National Academy of Sciences, 114(42), 11241–11246. doi:10.1073/pnas.1702565114.CrossRef Google Scholar PubMed

Raven, J. C., & Court, J. H. (1998). Raven's progressive matrices and vocabulary scales (pp. 223–237). Oxford: Oxford Pyschologists Press.Google Scholar

Roese, N. J., Epstude, K. A. I., Fessel, F., Morrison, M., Smallman, R., Summerville, A., … Segerstrom, S. (2009). Repetitive regret, depression, and anxiety: Findings from a nationally representative survey. Journal of Social and Clinical Psychology, 28(6), 671–688. doi:10.1521/jscp.2009.28.6.671.CrossRef Google Scholar

Santomauro, D. F., Herrera, A. M. M., Shadid, J., Zheng, P., Ashbaugh, C., Pigott, D. M., … Ferrari, A. J. (2021). Global prevalence and burden of depressive and anxiety disorders in 204 countries and territories in 2020 due to the COVID-19 pandemic. The Lancet, 398(10312), 1700–1712. doi:10.1016/s0140-6736(21)02143-7.CrossRef Google Scholar

Seidel, E. M., Satterthwaite, T. D., Eickhoff, S. B., Schneider, F., Gur, R. C., Wolf, D. H., … Derntl, B. (2012). Neural correlates of depressive realism—An fMRI study on causal attribution in depression. Journal of Affective Disorders, 138(3), 268–276. doi:10.1016/j.jad.2012.01.041.CrossRef Google Scholar

Sharot, T. (2011). The optimism bias. Current Biology, 21(23), R941–R945. doi:10.1016/j.cub.2011.10.030.CrossRef Google Scholar PubMed

Sharot, T., & Garrett, N. (2016). Forming beliefs: Why valence matters. Trends in Cognitive Sciences, 20(1), 25–33. doi:10.1016/j.tics.2015.11.002.CrossRef Google Scholar PubMed

Sharot, T., Riccardi, A. M., Raio, C. M., & Phelps, E. A. (2007). Neural mechanisms mediating optimism bias. Nature, 450(7166), 102–105. doi:10.1016/j.cub.2011.10.030.CrossRef Google Scholar PubMed

Sharot, T., Velasquez, C. M., & Dolan, R. J. (2010). Do decisions shape preference? Evidence from blind choice. Psychological Science, 21(9), 1231–1235. doi:10.1177/0956797610379235.CrossRef Google Scholar PubMed

Stevanovic, D., Jancic, J., & Lakic, A. (2011). The impact of depression and anxiety disorder symptoms on the health-related quality of life of children and adolescents with epilepsy. Epilepsia, 52(8), e75–e78. doi:10.1111/j.1528-1167.2011.03133.x.CrossRef Google Scholar PubMed

Stringaris, A., Vidal-Ribas Belil, P., Artiges, E., Lemaitre, H., Gollier-Briant, F., & Wolke, S., … IMAGEN Consortium. (2015). The brain's response to reward anticipation and depression in adolescence: Dimensionality, specificity, and longitudinal predictions in a community-based sample. American Journal Psychiatry, 172(12), 1215–1223. doi:10.1176/appi.ajp.2015.14101298.CrossRef Google Scholar

Sugawara, M., & Katahira, K. (2021). Dissociation between asymmetric value updating and perseverance in human reinforcement learning. Scientific Reports, 11(1), 3574. doi:10.1038/s41598-020-80593-7.CrossRef Google Scholar PubMed

Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction. Boston: MIT press.Google Scholar

Tarantola, T. O., Folke, T., Boldt, A., Perez, O. D., & De Martino, B. (2021). Confirmation bias optimizes reward learning. bioRxiv, 2021-02. doi:10.1101/2021.02.27.433214.Google Scholar

Twenge, J. M., Cooper, A. B., Joiner, T. E., Duffy, M. E., & Binau, S. G. (2019). Age, period, and cohort trends in mood disorder indicators and suicide-related outcomes in a nationally representative dataset, 2005–2017. Journal of Abnormal Psychology, 128(3), 185–199. doi:10.1287/mnsc.2017.2931.CrossRef Google Scholar

Webb, R. (2019). The (neural) dynamics of stochastic choice. Management Science, 65(1), 230–255. doi:10.1287/mnsc.2017.2931.CrossRef Google Scholar

Wiehler, A., Chakroun, K., & Peters, J. (2021). Attenuated directed exploration during reinforcement learning in gambling disorder. Journal of Neuroscience, 41(11), 2512–2522. doi:10.1523/JNEUROSCI.1607-20.2021.CrossRef Google Scholar PubMed

Zhang, M., & He, Y. (2015). Psychiatric rating scale manual. Changsha: Hunan Science and Technology Press.Google Scholar

Shen et al. supplementary material

File 18.7 MB

Article contents

Factual and counterfactual learning in major adolescent depressive disorder, evidence from an instrumental learning study

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

Footnotes

References

Shen et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests