1. Introduction
Evidence demonstrates that—compared to the general population—both men and women with schizophrenia spectrum disorders (SSD) have an elevated risk of being convicted of non-violent criminal offences, a higher risk of being convicted of violent criminal offences, and an even higher risk of being convicted of homicide [Reference Fazel, Gulati, Linsell, Geddes and Grann1, Reference Wallace, Mullen and Burgess2]. Nonetheless, as a group, offenders suffering from SSD seem to be very heterogeneous. Efforts have been made to structure this heterogeneity [Reference Volavka and Citrome3], since this could lead to the identification of different patient pathways to criminal offending, and thereby open up new avenues for prevention and treatment.
One of these approaches is Hodgins’ [Reference Hodgins, Piatosa and Schiffer4] theoretical framework. This is a typology investigating offender patients suffering from “severe mental illness” (SMI) [Reference Tengström, Hodgins and Kullgren5, Reference Hodgins, Cree, Alderton and Mak6]. Although this typology of three subgroups has been reviewed in a multitude of recent studies (see Table 1), results have been inconsistent. Earlier research has provided evidence on two of the three subgroups of offenders affected by SMI in general as well as SSD in particular [5–13]. It distinguished “early starters” (ES) from “late starters” (LS) operationalizing ES quite differently as either committing an offence before age 18 [Reference Tengström, Hodgins and Kullgren5, Reference Laajasalo and Hakkanen9, Reference Pedersen, Rasmussen, Elsass and Hougaard11], or being diagnosed with conduct disorder before age 15 [Reference Mathieu and Côté10, Reference Sánchez-SanSegundo, Ferrer-Cascales, Herranz-Bellido and Pastor-Bravo12], or as offending before first evidence for an SMI [Reference Jones, Van den Bree, Ferriter and Taylor7, Reference Kooyman, Walsh, Stevens, Burns, Tyrer and Tattan8, Reference Simpson, Grimbos, Chan and Penney14–Reference Van Dongen, Buck and van Marle16]. LS were operationalized as either committing an offence after age 18 [Reference Tengström, Hodgins and Kullgren5, Reference Laajasalo and Hakkanen9, Reference Pedersen, Rasmussen, Elsass and Hougaard11], or not being diagnosed with conduct disorder before age 15 [Reference Mathieu and Côté10, Reference Sánchez-SanSegundo, Ferrer-Cascales, Herranz-Bellido and Pastor-Bravo12], or offending after evidence of an SMI had been reported [Reference Jones, Van den Bree, Ferriter and Taylor7, Reference Kooyman, Walsh, Stevens, Burns, Tyrer and Tattan8, Reference Simpson, Grimbos, Chan and Penney14–Reference Van Dongen, Buck and van Marle16]. Whereas such differences in operationalization of the ES-LS typology complicate any synthesis and comparison of results [Reference Simpson, Grimbos, Chan and Penney14, Reference Penney, Prosser and Simpson17], ES were generally found to have grown up more frequently in deprived families or separated from their biological parents, to have experienced physical abuse, to perform poorly in school, to have conduct problems, to use alcohol and illegal substances, and to commit a greater variety and number of crimes before being diagnosed with SMI (often in addition to a personality disorder).
More recent research on offenders with SSD [Reference Simpson, Grimbos, Chan and Penney14–Reference Van Dongen, Buck and van Marle16] was also able to provide evidence for the third subgroup of offenders proposed by Hodgins [Reference Hodgins18] termed “late late starters” (LLS) or “first offenders” (FO). These are a small group of predominantly male offenders in their late 30 s with chronic schizophrenia, but without any prior history of aggressive or antisocial behaviour, who typically engage in (actual or attempted) homicide of those caring for them. Despite being described as suffering from SSD [Reference Hodgins18], LLS/FO were also identified as a separate subgroup in a large sample of 1800 male and female offenders with SMI (including almost 25% mood disorders in addition to schizophrenia spectrum disorders) in Canada [Reference Crocker, Martin, Leclair, Nicholls and Seto19]. Yet, in contrast, the application of a different statistical approach to a smaller sample of 429 male and 78 female offenders with SMI in the same country resulted in a model with only two subgroups providing the best model fit [Reference Penney, Prosser and Simpson17].
Note: LLS = late late starter, FO = first offender, LS = late starter, ES = early starter.
The present study is designed to resolve the above inconsistencies, including those concerning whether there are either two or three subgroups of offenders with SSD (Objective 1) and the different operationalizations of Hodgins’ typology having been applied [Reference Hodgins18] (Objective 2). It aims to utilize a new methodology (Objective 3), and to explore different variables (Objective 4). It is to assess the number of subgroups in a previously unexplored sample of 370 offenders with SSD in Switzerland by means of an LCA of all the variables discussed most frequently in the literature (see Table 2).
2. Methodology
2.1. Source of data
A total of 370 offenders with schizophreniform disorders including all types of schizophrenia, schizoaffective disorder, and delusional disorder were identified among 1694 patients admitted to the Centre for Inpatient Forensic Therapies at the Zurich University Hospital of Psychiatry between 1982 and 2016. Their case files covering only one treatment period for each individual (the last, which was also always the longest admission) were included in the present study. They consisted of professionally documented anamneses, psychiatric inpatient and outpatient reports, police reports, testimonies, court proceedings, reports from social workers, and biannual reports from the nursing and care staff. Files were extensive and can be assumed to contain all relevant information on the health and biography of a patient due to the high medical and legal importance assigned to cases of forensic patients in Switzerland. Retrospective file analyses of these cases for the purposes of this study were approved by the Zurich Cantonal Ethics Committee.
2.2. Preliminary data processing and measures
A trained independent physician systematically reviewed all case files and conducted a directed qualitative content analysis [Reference Hsieh and Shannon20]. A second trained independent rater encoded a random subsample of 10% of cases to assess inter-rater reliability. Cohen’s Kappa [Reference Brennan and Hays21] was 0.78, which can be regarded as substantial [Reference Lambert and Bergin22]. The content analysis employed a questionnaire and rating protocol for coding based on the extended [Reference Habermeyer, Wolff, Gillner, Strohm and Kutscher23, Reference Kutscher, Schiffer and Seifert24] set of criteria proposed by Seifert [Reference Seifert25]. These criteria were augmented with those variables most frequently explored in current research on the topic (see Table 1). This ensured the inclusion of measures defined as being most similar to those explored in a large number of prior studies on Hodgins’ typology [Reference Hodgins, Piatosa and Schiffer4, Reference Hodgins18] (see Table 2 for definitions of operationalizations in this study and literature sources with similar operationalizations). Differences in operationalization were roughly as small as those between the available published studies on the subject (see Table 1). One example for this is the operationalization of difficulties in school (in contrast to not graduating from mandatory schooling). This was operationalized as repetition of a year in school. It seemed to be more objective than the self-reports of offenders [Reference Kooyman, Walsh, Stevens, Burns, Tyrer and Tattan8] or reports from schools on the allocation of additional teaching resources [Reference Tengström, Hodgins and Kullgren5] reported in prior research. Additionally, in contrast to previous research, this study differentiated more precisely between previous offences and index offences.
2.3. Data analysis
Data was assessed quantitatively using R Studio version 1.1.383. Before performing any statistical tests, multiple imputations [Reference Greenland and Finkle26, Reference Zhou, Eckert and Tierney27] by chained equations (MICE) [Reference Van Buuren, Boshuizen and Knook28] were executed to address missing values. MICE maintains the variability of missing data, and integrates the uncertainty caused by estimating them. It is considered to be one of the best methods for imputing missing values [Reference Ambler, Omar and Royston29]. The entire data set of 71 items was used to impute missing values with a total of 20 iterations of imputation.
Next, latent class analysis (LCA) was performed. LCA is a type of finite mixture model designed for analysing multivariate categorical data. It groups each observation probabilistically into an unobserved (= latent) nominal class, while minimizing the confusion between different observed items. LCA was conducted with the poLCA package implemented in R, which estimates the latent class model by maximizing the log-likelihood function using the expectation maximization (EM) algorithm.
Different numbers of classes (1 to 4) were evaluated to identify the most parsimonious model with the overall best fit representing the entire data set of 71 items and 370 observations (see Table 3). For a given number of classes, each latent class analysis was repeated 500 times with different starting values to avoid local extrema. In Class analyses 1 and 4, each item was assigned the same prior probability of belonging to a set class, given that no particular expectation regarding classification was available from the literature. In contrast, in Class analyses 2 and 3, the individual priors were allowed to vary depending on a covariate defined as the crime-schizophrenia-sequence variable. This variable classified patients into ES and LS (two-class model), or into ES, LS, and FO/LLS (three-class model) according to Hodgins’ [Reference Hodgins, Piatosa and Schiffer4] and related research [Reference Simpson, Grimbos, Chan and Penney14–Reference Van Dongen, Buck and van Marle16, Reference Crocker, Martin, Leclair, Nicholls and Seto19] on her framework: ES had entries in the criminal registry prior to first symptoms of a SSD; LS had symptoms of a SSD prior to an entry in the criminal registry that had to be recorded for a crime committed before age 35; and LLS had symptoms of a SSD prior to an entry in the criminal registry that had to be recorded for a crime committed after age 35.
A set of different measures was computed to assess model fit and to compare results with the previous literature. These were the maximum log-likelihood, the Bayesian information criterion (BIC), Akaike’s information criterion (AIC), and entropy. Whereas maximum log-likelihood is exclusively a measure of goodness of model fit, BIC and AIC are measures of parsimony aiming to avoid over-fitting. Entropy is a measure of classification uncertainty [Reference Asparouhov and Muthén30], with values of > 0.8 indicating a good separation between classes. For a particular number of classes, the model with the lowest log-likelihood was selected. To subsequently compare models between different numbers of classes, information criteria were evaluated. BIC penalizes additional model parameters more strongly than AIC and hence can be considered more conservative in preventing over-fitting. As a consequence, AIC has been reported to overestimate the correct number of components in a finite mixture model [Reference Soromenho31], whereas BIC performs more adequately [Reference Roeder and Wasserman32]. For this reason, BIC was prioritized over AIC in selecting the best model fit. scBIC is a sample-size-corrected BIC value being computed for completeness. To further evaluate model fit with respect to the predicted classification in two- and three-class models, a chi-square test was performed to test for an association between the predicted typologies and the identified classes.
3. Results
The lowest BIC and scBIC values - indicating best model fit in terms of model complexity and parsimony - were observed in the two-class and three-class models (see Table 3).
Given that BIC criteria were close and ambiguous for both model fits, the relevance of the identified classes was further evaluated in order to decide which model better represented the data in terms of identifying patient subcategories. Fig. 1 shows the normalized probability of class membership (i.e. the posterior probability of class membership divided by the probability of random assignment to a class) for the different patient subcategories. Here, the two-class model showed the same trend in both patient categories: a lower probability of belonging to Class 1 compared to Class 2. In contrast, the three-class model had more similar maximum probabilities of class membership, whereas inter-categorical differences were larger. In addition, a chi-square test performed on the contingency tables presented in Table 4 evaluating the matching between predicted categories and model-identified classes, was only significant for the three-class model (p = 0.005), but not for the two-class model (p = 0.191). This indicates a significant association between the three-class model and Hodgins’ predicted subcategories. Hence, the three-class model could better capture previously described patient subcategories. Therefore, subsequent data interpretation was performed on this model. Table 5 presents the posterior probability of each item belonging to a specific class in the selected three-class model. Posterior probabilities, conditional on the observed manifest variable, were computed using Bayes formula as described earlier [Reference Linzer and Lewis33].
Note: AIC = Akaike’s information criterion; BIC = Bayesian information criterion; entropy = measure of classification uncertainty.
Note: There was a significant association between the three-class model fit and Hodgins’ typology (p = 0.005), but not between the two-class model fit and the early-starters and late-starters typology (p = 0.191).
ES = early starters; LS = late starters; LLS/FO = late late starters/first offenders.
For subsequent class-specific interpretation, only variables showing at least a 10% or larger difference in class membership probabilities among categories were considered(see Table 5).
3.1. Class 1
In comparison to the other two classes, patients in this subgroup seem to have the best clinical match to Hodgins’ [Reference Hodgins, Piatosa and Schiffer4] description of the early starters (ES) and are estimated to include 39% of the total study population. They most probably face multiple challenges as minors, including disciplinary sanctions, being diagnosed with a conduct disorder, receiving mental health treatment, being the victim of physical abuse and emotional neglect, growing up separated from their biological parents, using legal and illegal substances, repeating a year in school, and not graduating from mandatory schooling. Their parents most probably also use illegal substances and alcohol. Patients in this subgroup are most probably less than 21 years old when first prodromal symptoms of an SSD are documented, an SSD is diagnosed, a first psychiatric inpatient treatment is given, a first criminal registry entry is recorded, and the index offence is committed. They most probably abuse alcohol and cannabis (but not illegal substances in general—for which they have the lowest probability). They have the highest probability of receiving more than five inpatient treatments and more than four criminal registry entries by the time they enter forensic inpatient treatment. Offences most probably include property crime, offences against the weapons act, and petty offences such as transgressions of traffic law or the controlled substances act. They are most probably single, unemployed, and homeless.
3.2. Class 2
Estimated characteristics of patients in this subgroup (estimated to compose 40% of the study population) seem to best resemble those described for late starters (LS). Compared to other subgroups, patients reveal the lowest probability of having experienced physical abuse as minors and the highest probability of refraining from the use of alcohol. Age of estimated illness onset, first diagnosis of an SSD, first psychiatric inpatient treatment, as well as first criminal registry entry is most probably between 21 and 35. Probabilities for all other variables differ less than 10% from those estimated for the other subgroups. Crimes committed seem to be similar to those committed by those in Class 1, but include fewer petty offences, fewer property crimes (especially such without violence), and fewer threats and coercion.
3.3. Class 3
Patient characteristics in Class 3 (21% of the sample) seem to best fit previous descriptions of offenders referred to as late late starters (LLS) or first offenders (FO). They have least probably been diagnosed with a conduct disorder as a minor. Although they most probably use illegal substances, these least probably include cannabis. Age of estimated illness onset, first diagnosis of an SSD, first psychiatric inpatient treatment, first criminal registry entry, and index offence are most probably after the age of 35. Previous offences are most likely to include sexual offences, and the index offence is most likely to involve threat or coercion. They are much more probably married and slightly more probably female.
As a final note pertaining to all three subgroups, the two most probable index offences are attempted or executed homicide and assault, confirming prior research noting there may be more differences in offender than offence characteristics [Reference Laajasalo and Hakkanen9].
Fig. 2 presents the distribution of classes in a set of selected items. Interestingly, individual age-related variables separated the classes surprisingly well.
4. Discussion
This study applied LCA as a case-centred analytic approach and focused on similarities and differences between classes of offenders with SSD instead of between the variables examined. Based on a set of variables similar to the one most frequently used in extant research (see Table 2), it analysed a previously unexplored sample of offenders with SSD. The study confirms the existence of the three subgroups of offenders proposed in prior research [Reference Simpson, Grimbos, Chan and Penney14–Reference Van Dongen, Buck and van Marle16, Reference Crocker, Martin, Leclair, Nicholls and Seto19]. Therefore, as proposed in the introduction, the results of this study are able to reduce doubts raised by inconsistencies in prior research about the existence of three distinct subgroups (Objective 1), to combine different operationalizations of Hodgins’ [Reference Hodgins, Piatosa and Schiffer4, Reference Hodgins18] typology (Objective 2) and variables (Objective 4), while using a more applicable [Reference Penney, Prosser and Simpson17] novel statistical methodology (Objective 3).
Results (in Table 5) indicate each of the variables operationalizing Hodgins’ offender typology (18)—in terms of age at first inpatient treatment (Fig. 2, D), age at estimated illness onset (Fig. 2, C), or age at first diagnosis of a SSD (Fig. 2, B)—differentiate better between subgroups of offender patients than the crime-schizophrenia-sequence variable (Fig. 2, E) used as a covariate in this study and as a grouping variable in other most recent research on this issue [Reference Simpson, Grimbos, Chan and Penney14–Reference Van Dongen, Buck and van Marle16, Reference Crocker, Martin, Leclair, Nicholls and Seto19]. The overall rather poor ability of the latter to distinguish subgroups is also reflected in the contingency table (Table 4) for the three-class model.
Nonetheless, there are two major shortcomings when the sole parameter for subgrouping is age at either first symptoms, first inpatient treatment, or diagnosis. First, offending may delay treatment of SSD [Reference Jones, Van den Bree, Ferriter and Taylor7, Reference Van Dongen, Buck and van Marle16] and thus result in false subgroupings of offender patients. Second, whether women are analysed separately (sample size permitting), as proposed in recent research on psychoses in women [Reference Seeman34], or together with male patients may also have an impact, because it is estimated 20% of women will be diagnosed with schizophrenia after the age of 40 and generally four to six years later than men [Reference Riecher-Rössler35]. In conclusion, using age at first criminal registry entry (Fig. 2, A) may serve as a fair differentiator of offender patients into Hodgins’ subgroups for future research when no more than one variable is available for subgrouping.
While helping to resolve prior inconsistencies, this study also has some limitations. These include the exclusion of those offenders with SSD waiting in Swiss prisons to be transferred to treatment in a forensic hospital, which generally is a problem in Switzerland [Reference Steinau, Brackmann, Sternemann, Biller-Andorno and Habermeyer36]. This may indicate a selection bias.
Another methodological limitation pertains to the rather small difference between the BIC and scBIC values informing final model selection. Although this is a common issue, already encountered previously by the one other study to use a similar statistical approach [Reference Penney, Prosser and Simpson17], model-fit insecurity was addressed in this study by including a covariate defining the predicted typologies. This permitted a statistical comparison of the association with the identified model classes (Table 4). This analysis unfortunately is applicable only when a theoretically predicted classification variable is available. Therefore, the comparison could be performed only for the two- and three-class models. Nonetheless, given that the two- and three-class model fits did show the overall lowest BIC and scBIC values, it is reasonable to restrict statistical comparison to these two model fits. Despite very different sample characteristics (offenders with SMI instead of just SSD), a slightly different methodology (LPA instead of LCA, see introduction for critique on this choice), and separate analysis of male and female offenders, Penney et al. [Reference Penney, Prosser and Simpson17] observed similarly small differences between BIC values in their study. They did, however, use a different criterion for model selection (the Lo–Mendell–Rubin likelihood ratio test) favouring a two-class solution. This reflects limitations in the available statistical techniques for LCA model comparisons. At the same time, the number of subgroups of offenders with SSD may differ from that of offenders with SMI. Including a model-fit evaluation based on theoretically postulated patient typologies does enhance the validity of the three-class model identified in the current study.
Overall, it may be more important for prevention, early intervention, and treatment of offenders with SMI or SSD to provide further details on subgroups—including psychopathological variables and current treatment efficacy. Measuring the presence and extent of personality disorders and psychopathy [Reference Hare and Neumann37] in addition to SSD could be a first step in that direction. This study considered personality disorders diagnosed prior to forensic admission, but available data was insufficient on the diagnosis of personality disorders after admission and (the relatively novel) psychopathy scores for a sufficient number of patients, which may be due to forensic hospitalizations considered in this study dating back as far as 1982. Future research should address this shortcoming by analysing psychopathology in more detail in more recent patient data.
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Declarations of interest
None.
Comments
No Comments have been published for this article.