INTRODUCTION
The neuroscientific research of Damasio and his colleagues has greatly advanced our understanding of why “intuition” seems to guide so many aspects of human behavior (e.g., Bechara et al., Reference Bechara, Damasio and Damasio2000; Damasio, Reference Damasio1994). The principal tenet of their popular theory of decision-making—known as the Somatic Marker Hypothesis (SMH)—is that our ability to choose present options, which favorably influence future outcomes, depends crucially on an accumulated “emotional memory” of the consequences of our past interactions with similar events. In other words, we form a probabilistic impression of how a particular action will turn out in the future from an emotionally-biasing “gut” feeling, or emotional imprint, which was generated when that action caused either a positive or a negative reaction in the past. The SMH has garnered considerable empirical support, especially from case-control studies of those with impairment in the neural circuitry involved in judgment and memory—viz. patients with bilateral damage to the ventromedial prefrontal cortex (VMPFC)—and those with substance dependence.
The Iowa Gambling Task (IGT) is a cleverly designed research tool (Bechara et al., Reference Bechara, Damasio, Damasio and Anderson1994), which mimics the real-life contingencies required to choose advantageously in situations that involve either immediate or delayed rewards, and where the outcome of one's selection is probabilistic and uncertain. It is also the most frequently used neuropsychological measure of emotionally-based decision making, and the major plank of behavioral support for the SMH. While this task has strengths as an index of decision making, legitimate concerns have been raised about its ecological validity. For instance, most studies show that a substantial proportion of neurologically-normal (control) participants display a response pattern indistinguishable from patients with VMPFC lesions (see Dunn et al., Reference Dunn, Dalgleish and Lawrence2006). On the one hand, this may simply mirror the natural human diversity that exists in prudent decision-making ability. On the other hand, it casts doubt on the discriminative utility of the IGT as an index of impairment, and argues for the identification of individual differences that influence IGT performance irrespective of, or in addition to, clinical classifications (Glicksohn et al., Reference Glicksohn, Naor-Ziv and Leshem2007).
While some IGT studies have reported demographic data such as age, gender, and education level, or controlled for their influence, few have examined the effects of these factors in a systematic way. Educational attainment is a particularly relevant variable because it is strongly linked to intelligence (e.g., Bacete & Remirez, Reference Bacete and Remirez2001; Ganzach, Reference Ganzach2000), and because both factors are positively associated with novel problem solving and effective decision making (see Rindermann, Reference Rindermann2008). Of interest, Bechara and his colleagues have stated that performance on the IGT does not “seem to depend on education” (Bechara et al., Reference Bechara, Damasio and Damasio2000). More recently, Bechara (Reference Bechara2007) published normative data for the IGT and reported that education had only a “minor” effect on IGT scores. What is still unclear, however, is whether, and if so how, education level affects learning across trials on the IGT. We suspect it will have a significant moderating effect for the reasons mentioned above.
To date, we are aware of only one study that has undertaken a formal investigation of the education-IGT link in healthy participants (Evans et al., Reference Evans, Kemish and Turnbull2004). These authors reasoned that university-level education should be inversely correlated with performance on the IGT. While their findings appear to support this claim, they are rather counterintuitive in our view. Additionally, the study is limited by a very small sample, and by the fact that all participants were under the age of 25 years. The latter is especially relevant because the human prefrontal cortex (the locus of “executive function”) is still developing until adults are in their early to mid 20s (Gogtay et al., Reference Gogtay, Giedd, Lusk, Hayashi, Greenstein, Vaituzis, Nugent, Herman, Clasen, Toga, Rapoport and Thompson2004). The 25-year participant age limit also restricts the range of possible educational attainment.
Other IGT studies, in a variety of contexts, have included education level as a secondary factor in their analyses. Collectively, the results have been inconclusive, and none has replicated the findings of Evans et al. (Reference Evans, Kemish and Turnbull2004). For example, Fein et al. (Reference Fein, McGillivray and Finn2007) demonstrated a significant—albeit weak—positive relationship between years of education and IGT performance. However, the strength of this relationship was almost certainly attenuated by a restricted range in the education variable. A few other studies of nonrepresentative groups such as senior adults (Denburg et al., Reference Denburg, Recknor, Bechara and Tranel2006) and alcoholics (Dom et al., Reference Dom, De Wilde, Hulstijn, van den Brink and Sabbe2006) have also found a positive association between education and IGT performance. By contrast, no effect of education level was found in other special groups such as those with substance-dependence (Verdejo-Garcia et al., Reference Verdejo-Garcia, Vilar-Lopez, Perez-Garcia, Podell and Goldberg2006), suicide attempters (Jollant et al., Reference Jollant, Buresi, Guillaume, Jaussent, Bellivier, Leboyer, Castelnau, Malafosse and Courtet2007), and women with bulimia nervosa (Boeka & Lokken, Reference Boeka and Lokken2006).
Over the past 3 years, we have obtained IGT and demographic information, including education level, age, gender, and race, from a large sample of healthy adult women and men on two different versions of the IGT. The data provide an excellent opportunity to examine statistically the simultaneous influence of these variables on IGT performance, and to test our specific hypothesis that education level moderates learning across trials.
METHODS
Participants
Adult women (n = 285) and men (n = 167) between the ages of 25 and 50 years were enrolled in the study. Participants were recruited from advertisements placed at public institutions and in local newspapers, asking for “healthy volunteers to take part in a personality psychology study.” There was no difference in the mean age of the women (M age = 34.31 years; SD = 7.33) and the men (M age = 34.51 years; SD = 7.7). Seventy-two percent of the sample was Caucasian, and the racial distribution was not different between women and men (p = .378), nor among educational categories (p = .372). Given the high proportion of Caucasians in our sample, race will be treated as a dichotomous variable (Caucasians vs. non-Caucasians) in the subsequent data analyses.
As identified by a structured telephone screening before the testing appointment, participants were excluded if they had a serious medical condition or were being treated for any psychiatric condition including addiction disorders. This study was approved by the Human Research Ethics Board at the first author's institution.
Measures
Decision-Making was assessed by two computerized versions of the IGT. In the original task [IGT1] (Bechara et al., Reference Bechara, Damasio, Damasio and Anderson1994), participants were asked to choose among four decks of cards, two (A′& B′) yielding high immediate gain, but larger future losses; and two (C′& D′) yielding lower immediate gain but smaller future losses. They were told that the goal of the game is to accumulate as much money as possible by picking one card at a time, and that to win money, they should know that some decks are better than others. Scores on both tasks are typically calculated in 5 blocks of 20 trials (see Bechara & Damasio, Reference Bechara and Damasio2002), whereby the number of cards from the disadvantageous decks (A′& B′) are subtracted from the advantageous decks (C′& D′). Therefore, a positive score reflects a tendency to make good decisions. In the variant version of the task [IGT2], the contingencies are reversed—loss occurs on every trial while gain is delayed and intermittent (see Bechara et al., Reference Bechara, Dolan and Hindes2002). The advantageous decks (E′& G′) yield high immediate punishment, but higher future reward. The disadvantageous decks (F′& H′) yield low immediate punishment but lower future reward.
Education level
During a face-to-face interview, subjects were asked to report their highest level of education according to seven categories: 1 = some high school; 2 = completion of high school; 3 = some technical college; 4 = technical college diploma; 5 = some university; 6 = university undergraduate degree; 7 = current postgraduate training or postgraduate degree.
Procedure
After giving informed consent, the demographic variables were obtained from each participant. Approximately half the participants completed the ABCD task first while the remainder did the tasks in reverse order. The verbal instructions given to participants before the start of each gambling task were from a standard script provided by Bechara and his colleagues. To increase the probability that participants would play the games seriously, they were told that their payment for participation in the study would “depend on their performance on the computer tasks.” In fact, all participants were given the same cash payment at the end of the study.
Statistical Analyses
We used the lme4 package in R (an updated version of the software described in Pinheiro & Bates, Reference Pinheiro and Bates2000) to fit standard linear mixed-effects models to the data, assuming normally distributed errors and random effects. The traditional scoring of the gambling tasks results in a transformed proportion: 20 times the proportion of advantageous choices minus the proportion of disadvantageous choices. Tukey (Reference Tukey1977, page 498) calls this kind of variable a “plurality,” which, as a general matter, is not a favorable scale on which to analyze data by linear models. In this instance, the scores are negatively skewed, with approximately 13% of scores at the upper boundary of 20 and approximately 2% at the lower boundary of −20. We instead used the empirical logit (Cox & Snell, Reference Cox and Snell1989, page 32) of advantageous choices, defined as the log of the number of advantageous choices plus one-half, divided by the number of disadvantageous choices plus one-half. This transformation does not entirely remove floor and ceiling effects, but it does make the distribution of the dependent variable much more symmetric. To display the results of our analysis, we apply the inverse-logit transformation, to express the results on the traditional −20 to 20 plurality scale. As it turns out, we obtained nearly identical results whether we analyzed the plurality or the empirical-logit of advantageous choices, but we prefer the latter for the reasons stated.
RESULTS
Table 1 presents the net score means and standard deviations for the five blocks of trials for the two IGTs.
Table 2 displays the results of fitting several linear mixed-effects models to the data. In each case, we show the fixed and random effects, and the degrees of freedom for the model (i.e., its number parameters, including both fixed-effect coefficients and random-effect variance and covariance components). The deviance for each model is twice the negative of the maximized log-likelihood under the model and is a building block (analogous to the residual sum of squares for a traditional fixed-effects linear model) for likelihood-ratio tests of terms in the model. To test both fixed-effect parameters and variance and covariance components by likelihood-ratio tests, we used full maximum-likelihood (ML), as opposed to restricted maximum-likelihood (REML). Estimates of fixed effects for our final model (model 11 in Table 2) are essentially identical for the ML and REML estimates.
Model 1 is the most complex that we fit to the data, including fixed main effects of the “demographic” control variables sex, race, and age, interactions among education, task, and trial blocks (represented as a fourth-order polynomial), and random effects representing the interaction between task and trial blocks (again as a fourth-order polynomial). Because there are five trial blocks, a fourth-order polynomial will capture any pattern of change over their course. We checked first whether we could simplify the trend over trial blocks, by a step-down test, representing blocks in the fixed and random parts of the model by a third-order polynomial, producing model 2. The likelihood ratio test contrasting models 1 and 2, given in the first line of the lower panel of Table 2, shows that the quartic term in highly significant. Models 3 through 6 allow us to test the fixed effects of sex, race, age (represented as a second-order polynomial, because age effects are often quadratic), and the three-way interactions among education, task, and trial blocks. These tests are given in the lower panel of Table 2.
The race effect is just significant at the 5% level, and the sex effect just nonsignificant. Neither the age effect nor the interaction among education, task, and trial blocks proved to be statistically significant. Beginning with model 6 in Table 2, which eliminates the interaction among education, task, and trial blocks, we exclude each of the two-way interactions among these factors to test the corresponding fixed effects. Both the education-by-blocks and task-by-blocks interactions proved to be highly statistically significant, while the education-by-task interaction is not. Finally, contrasting models 10 and 1, we tested whether the variance and covariance components for the random effects of task by trial blocks are required—that is, whether there is significant inter-subject variation in trajectories. These variance and covariance components are highly statistically significant. Model 11 eliminates the nonsignificant terms, with the exception of the fixed effect of sex, which was nearly significant. As the last line of the lower panel in Table 2 indicates, this model provides a good fit to the data.
The substantive focus of our data analysis is on the interactions between education and task and between education and trial blocks. These are displayed in the “effect plot” in Figure 1, which is constructed using model 11 (see Fox, Reference Fox1987). The dummy regressors for sex and race, which enter the model additively, are each set to 0.5, representing a group composed equally of males and females, and of Caucasians and non-Caucasians. Then education, task, and trial blocks are allowed to range over all combinations of their values, computing the fitted value of the advantageous minus disadvantageous choices within each such combination; these fitted values are represented by the dots in the graph, connected by solid lines. Finally, the broken lines indicate plus-and-minus one standard error around the fit, and thus represent approximate point-wise 68% confidence intervals for the fitted values.
DISCUSSION
Our results demonstrated that learning on the IGT (as indicated by trial block scores) was moderated both by the type of task and by education level. Notably, however, we did not find a 3-way interaction among these variables. Nor was there a significant 2-way interaction between education and task. As seen in Figure 1, subjects' performance tended to improve more rapidly, and reach a higher eventual positive score, as the level of education increased. For task 1, the increase was roughly linear. On task 2, there was a tendency for performance to rise initially and then to decline—a pattern that was consistent even at higher levels of education. The task 2 results are particularly puzzling. Unfortunately, comparison with other data is not possible because this version of the IGT has been included in so little research. One possible explanation, given the higher mean scores on the first four blocks of task 2 compared with task 1, is that some subjects ran out of cards from the “good decks” and had to resort to choices from the “bad decks” near the end of the game.Footnote 1
The absence of a significant 3-way interaction in the analysis provides us with a replication of the IGT-education link in both tasks, and strengthens our confidence in the reliability of the findings. However, the causal direction of this relationship is unclear. For instance, there is evidence that both innate intelligence and accrued knowledge contribute to rational decision making and improved problem solving (see Rindermann, Reference Rindermann2008). Consequently, there may be cognitive skills that improve with schooling and thereby aid in IGT performance. On the other hand, it is also reasonable to presume that some causally-prior variable(s) contributes both to good IGT performance and to educational attainment. One possibility is that the cognitive ability required to make rational decisions, to accurately assess risks, and to demonstrate appropriate precaution—basic tenets of the SMH—is a dimensional, largely biologically-based, human trait that influences the degree of one's success in many aspects of life. Relevant, for example, is the finding that 53% of homeless people had suffered a traumatic brain injury, and of these, 72% had their first episode before the onset of homelessness (Hwang et al., Reference Hwang, Colantonio, Chiu, Tolomiczenko, Kiss, Cowan, Redelmeier and Levinson2007). Such results raise the distinct possibility that their injury may have caused cognitive difficulties that contributed to their poor life success.
Similar to Bechara (Reference Bechara2007), we found nonsignificant effects for age and gender. There was, however, a main effect for race indicating that Caucasians had slightly better IGT performance that their non-Caucasian counterparts. Interpretation of racial effects is very difficult in a multi-racial society because race is correlated with other potentially influential factors such as immigration status, economic status, and type of education.
Our study has several strengths including the large sample of fully adult participants, and the inclusion of several demographic variables in our analyses. The use of a categorical education-level variable also has some advantages because the more commonly used years-of-schooling typically has nonlinear and indeed discontinuous effects. For example, the difference between 11th grade and 12th grade is more important for income than the difference between 10th and 11th grade. On the other hand, a limitation of our study is the uneven frequency of observations in each of the education categories. Because the subjects in our study are relatively highly educated, we are able to estimate effects more precisely at higher levels of education, and this is reflected, for example, in the standard errors displayed in Figure 1. Nevertheless, we are able to demonstrate a statistically reliable and interpretable interaction between level of education and subjects' trajectories over trial blocks.
In summary, our results challenge the claim by Bechara (Reference Bechara2007) that education has only a “minor” influence on IGT performance; instead our data demonstrate a pronounced and systematic influence on the pattern of learning across trial blocks on two versions of the IGT. As such, it would behoove future researchers to stratify samples according to education level when the IGT is used in clinical case-control research paradigms, or to match cases and controls on this and other variables. However, it is also important to recognize that matching on quantity of formal education does not equate quality of education. There is good evidence, for example, that different educational experiences foster differences in problem-solving strategies, knowledge, and opportunities for practice—all of which can influence test performance (Manly & Echemendia, Reference Manly and Echemendia2007).
It will also be of interest for future researchers to untangle more clearly the complex nature of the relationship between IGT performance and education level. Does one acquire particular skills from higher education that foster better IGT performance, or is there a set of traits that predisposes one to make probabilistically advantageous choices in real-life as in neuropsychological tests like the IGT? Finally, if there are predisposing factors that influence cognitive development and subsequent test performance, to what degree are these innate, or developed through early (preschool) educational experiences and learning opportunities in the home (see Byrd et al., Reference Byrd, Miller, Reilly, Weber, Wall and Heaton2006)?
ACKNOWLEDGMENT
This research was funded by a grant to the first author (2005–2008) from the Social Sciences and Humanities Research Council of Canada (# 410-2005-2076)..