Hostname: page-component-586b7cd67f-dsjbd Total loading time: 0 Render date: 2024-11-30T21:06:59.972Z Has data issue: false hasContentIssue false

Building causal knowledge in behavior genetics without racial/ethnic diversity will result in weak causal knowledge

Published online by Cambridge University Press:  11 September 2023

Moin Syed*
Affiliation:
Department of Psychology, University of Minnesota, Minneapolis, MN, USA [email protected] https://cla.umn.edu/about/directory/profile/moin

Abstract

Behavior genetics often emphasizes methods over the underlying quality of the psychological information to which the methods are applied. A core aspect of this quality is the demographic diversity of the samples. Building causal genetic models based only on European-ancestry samples compromises their generalizability. Behavior genetics researchers must spend additional time and resources diversifying their samples before emphasizing causation.

Type
Open Peer Commentary
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press

Madole & Harden (M&H) propose that within-family genetic effects are analogous to randomized controlled trials (RCTs), and therefore can help identify potential genetic causes of psychological phenomena. In doing so, the authors repeat the frequent trope of RCTs as the “gold standard” (target article, sect. 1.1, para. 2) for establishing causal claims. Although it is clear that RCTs are optimally set up for such a task, using the language of “gold standard” has been criticized for obscuring the many limitations of the design (see Deaton & Cartwright [Reference Deaton and Cartwright2018] for a thorough treatment of the subject and Jones & Podolsky [Reference Jones and Podolsky2015] for a historical discussion). The uncritical use of RCTs as an analogue for genetic effects opens their arguments to many criticisms, ones that are already prominent in work on behavior genetics. Among others, these include insufficient attention to conceptualization (Nguyen, Syed, & McGue, Reference Nguyen, Syed and McGue2021) and measurement (Pelt, Schwabe, & Bartels, Reference Pelt, Schwabe and Bartels2022) of the target phenotype and lack of sample diversity when seeking to establish generalizable claims (Holden, Haughbrook, & Hart, Reference Holden, Haughbrook and Hart2022). By not incorporating these major concerns in their paper, M&H unfortunately compound, rather than resolve, these issues in their otherwise productive set of arguments.

The more general criticism levied toward both RCTs and behavior genetics is the over-emphasis on method over substance. M&H continue in this tradition, devoting careful attention to establishing and calibrating causal claims. This is clearly a useful and much-needed approach to thinking about causation, especially in psychology, but we must not overlook the quality of the substantive information about which we seek to make causal claims. Here, I highlight just one aspect of such quality, the racial/ethnic diversity of samples included in behavior genetic studies, and why such a consideration must be central to any effort to build generalizable causal knowledge.

It is a fact of the design that RCTs sacrifice external validity for the sake of internal validity, being high in efficacy, showing promising results in trials, but low in effectiveness, or lack of results when translated to real-life conditions (Flay et al., Reference Flay, Biglan, Boruch, Castro, Gottfredson, Kellam and Ji2005). RCTs have been further criticized by researchers in multicultural psychology and culturally adapted treatments for their lack of inclusion of racial/ethnic minorities and thus limited generalizability (Bernal & Scharrón-del-Río, Reference Bernal and Scharrón-del-Río2001; Castro, Barrera, & Holleran Steiker, Reference Castro, Barrera and Holleran Steiker2010; Whaley & Davis, Reference Whaley and Davis2007). Consistent with the mainstream view in psychology, the arguments made by M&H all assume a universal, replaceable person; it does not matter where the information comes from, so long as the proper modeling is applied (see also Yarkoni, Reference Yarkoni2022). But, of course, strong design features cannot overcome selection issues that can lead to misspecified models (Bradley et al., Reference Bradley, Kuriwaki, Isakov, Sejdinovic, Meng and Flaxman2021).

The research on lithium as a treatment for manic symptoms of bipolar disorder, highlighted by the authors, illustrates their lack of attention to sample diversity. The second-generation studies cited by M&H – those that seek to identify specific causal mechanisms beyond an average treatment effect – relied entirely on data from White men (Mertens et al., Reference Mertens, Wang, Kim, Yu, Pham, Yang and Yao2015; Santos et al., Reference Santos, Linker, Stern, Mendes, Shokhirev, Erikson and Gage2021; Stern et al., Reference Stern, Santos, Marchetto, Mendes, Rouleau, Biesmans and Gage2018). It may well be that the identified mechanisms are generalizable beyond this very narrow group, but it seems prudent to investigate this question prior to broadly claiming generalizable causal knowledge. Moreover, the authors frame this kind of second-generation investigation as addressing the problem of lack of generalizability beyond the first-generation average treatment effect, but it does so only with respect to individual differences, and not demographic/population heterogeneity.

An argument could be made to justify sample homogeneity in first-generation studies, when putative causal factors are initially identified, and explore generalizability as part of the second-generation process. This is related to the issue of “portability” of findings from genetic studies (or “effectiveness” with RCTs), which refers to the fact that average effects identified through first-generation behavior genetics studies may not generalize to new contexts or populations. Indeed, the impressive Lee et al. (Reference Lee, Wedow, Okbay, Kong, Maghzian, Zacher and Cesarini2018) study of educational attainment of 1.1 million individuals included only participants with European ancestry. When the researchers attempted to “port” the polygenic scores derived from the European-ancestry group to a sample of Black Americans, the 10.6% R 2 attenuated by 85%, a result that they indicated was “typical of what has been reported in other studies” (p. 1115). M&H fail to mention the degree of this problem, let alone what the implications are for building generalizable causal knowledge. For example, the omnigenic model highlights the need for diversity in discovery samples to identify and separate both core and peripheral variants (Mathieson, Reference Mathieson2021; see also Wojcik et al., Reference Wojcik, Graff, Nishimura, Tao, Haessler, Gignoux and Carlson2019). Thus, diversity is central to first-generation studies.

Moreover, in discussions of portability, left unsaid is from whom to whom the results are being ported, which is nearly always from White/European samples to other ancestry or racial groups. Rarely do we seek to, for example, generalize results from African samples to other groups (Adetula, Forscher, Basnight-Brown, Azouaghe, & IJzerman, Reference Adetula, Forscher, Basnight-Brown, Azouaghe and IJzerman2022). This dynamic sets up a standard in which the White/European results serve as the basis for the first-generation causal knowledge, and any deviations from it are problems to be solved or, more often, swept under the rug. Such a perspective is consistent with the deficit model that has for decades dominated psychological research on diversity (Cauce, Coronado, & Watson, Reference Cauce, Coronado, Watson, Hernandez and Mareasa1998), and unreasonably constrains the context of discovery.

The issues mentioned here are reminiscent of the aphorism “garbage in, garbage out” in the context of meta-analyses. That is, no degree of sophisticated analyses will save your substantive conclusions if the studies included therein are weak or uninformative. To be fair, M&H clearly know this, but the issue is treated more in passing rather than as a central concern in their quest to build stronger causal knowledge, a quest which I greatly support. They use a catch phrase in similar structure to “garbage in, garbage out” when discussing causal reasoning, “no causes in, no causes out,” which, fittingly, pertains to the reasoning and not the quality of the information within it. Combining these two, I might say, “no diversity in, no causes at all.”

Financial support

This research received no specific grant from any funding agency, commercial, or not-for-profit sectors.

Competing interest

None.

References

Adetula, A., Forscher, P. S., Basnight-Brown, D., Azouaghe, S., & IJzerman, H. (2022). Psychology should generalize from – not just to – Africa. Nature Reviews Psychology, 1, 370371. https://doi.org/10.1038/s44159-022-00070-yCrossRefGoogle Scholar
Bernal, G., & Scharrón-del-Río, M. R. (2001). Are empirically supported treatments valid for ethnic minorities? Toward an alternative approach for treatment research. Cultural Diversity and Ethnic Minority Psychology, 7(4), 328342. https://psycnet.apa.org/doi/10.1037/1099-9809.7.4.328CrossRefGoogle ScholarPubMed
Bradley, V. C., Kuriwaki, S., Isakov, M., Sejdinovic, D., Meng, X. L., & Flaxman, S. (2021). Unrepresentative big surveys significantly overestimated US vaccine uptake. Nature, 600(7890), 695700. https://doi.org/10.1038/s41586-021-04198-4CrossRefGoogle ScholarPubMed
Castro, F. G., Barrera, M. Jr, & Holleran Steiker, L. K. (2010). Issues and challenges in the design of culturally adapted evidence-based interventions. Annual Review of Clinical Psychology, 6, 213239. https://doi.org/10.1146/annurev-clinpsy-033109-132032CrossRefGoogle ScholarPubMed
Cauce, A. M., Coronado, N., & Watson, J. (1998). Conceptual, methodological, and statistical issues in culturally competent research. In Hernandez, M. & Mareasa, R. (Eds.), Promoting cultural competence in children's mental health services (pp. 305329). Paul Brooks.Google Scholar
Deaton, A., & Cartwright, N. (2018). Understanding and misunderstanding randomized controlled trials. Social Science & Medicine, 210, 221. https://doi.org/10.1016/j.socscimed.2017.12.005CrossRefGoogle ScholarPubMed
Flay, B. R., Biglan, A., Boruch, R. F., Castro, F. G., Gottfredson, D., Kellam, S., … Ji, P. (2005). Standards of evidence: Criteria for efficacy, effectiveness and dissemination. Prevention Science, 6(3), 151175. https://doi.org/10.1007/s11121-005-5553-yCrossRefGoogle ScholarPubMed
Holden, L. R., Haughbrook, R., & Hart, S. A. (2022). Developmental behavioral genetics research on school achievement is missing vulnerable children, to our detriment. New Directions for Child and Adolescent Development, 183-184, 4755. https://doi.org/10.1002/cad.20485.CrossRefGoogle ScholarPubMed
Jones, D. S., & Podolsky, S. H. (2015). The history and fate of the gold standard. The Lancet, 385(9977), 15021503. https://doi.org/10.1016/S0140-6736(15)60742-5CrossRefGoogle ScholarPubMed
Lee, J. J., Wedow, R., Okbay, A., Kong, E., Maghzian, O., Zacher, M., … Cesarini, D. (2018). Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nature Genetics, 50(8), 11121121. https://doi.org/10.1038/s41588-018-0147-3CrossRefGoogle ScholarPubMed
Mathieson, I. (2021). The omnigenic model and polygenic prediction of complex traits. The American Journal of Human Genetics, 108(9), 15581563. https://doi.org/10.1016/j.ajhg.2021.07.003CrossRefGoogle ScholarPubMed
Mertens, J., Wang, Q. W., Kim, Y., Yu, D. X., Pham, S., Yang, B., … Yao, J. (2015). Differential responses to lithium in hyperexcitable neurons from patients with bipolar disorder. Nature, 527(7576), 9599. https://doi.org/10.1038/nature15526CrossRefGoogle ScholarPubMed
Nguyen, P. L. L., Syed, M., & McGue, M. (2021). Behavior genetics research on personality: Moving beyond traits to examine characteristic adaptations. Social and Personality Psychology Compass, 15, e12628. https://doi.org/10.1111/spc3.12628CrossRefGoogle Scholar
Pelt, D., Schwabe, I., & Bartels, M. (2022). Using item response theory modeling to address bias in genetic findings for happiness, life satisfaction, and social support. PsyArXiv. https://doi.org/10.31234/osf.io/7uz9yGoogle Scholar
Santos, R., Linker, S. B., Stern, S., Mendes, A. P., Shokhirev, M. N., Erikson, G., … Gage, F. H. (2021). Deficient LEF1 expression is associated with lithium resistance and hyperexcitability in neurons derived from bipolar disorder patients. Molecular Psychiatry, 26(6), 24402456. https://doi.org/10.1038/s41380-020-00981-3CrossRefGoogle ScholarPubMed
Stern, S., Santos, R., Marchetto, M. C., Mendes, A. P. D., Rouleau, G. A., Biesmans, S., … Gage, F. H. (2018). Neurons derived from patients with bipolar disorder divide into intrinsically different sub-populations of neurons, predicting the patients’ responsiveness to lithium. Molecular Psychiatry, 23(6), 14531465. https://doi.org/10.1038/mp.2016.260CrossRefGoogle Scholar
Whaley, A. L., & Davis, K. E. (2007). Cultural competence and evidence-based practice in mental health services: A complementary perspective. American Psychologist, 62(6), 563574. https://doi.org/10.1037/0003-066X.62.6.563CrossRefGoogle ScholarPubMed
Wojcik, G. L., Graff, M., Nishimura, K. K., Tao, R., Haessler, J., Gignoux, C. R., … Carlson, C. S. (2019). Genetic analyses of diverse populations improves discovery for complex traits. Nature, 570(7762), 514518. https://doi.org/10.1038/s41586-019-1310-4CrossRefGoogle ScholarPubMed
Yarkoni, T. (2022). The generalizability crisis. Behavioral and Brain Sciences, 45, E1. https://doi.org/10.1017/S0140525X20001685.CrossRefGoogle Scholar