Uncovering key predictive channels and clinical variables in the gamma band auditory steady-state response in early-stage psychosis: a longitudinal study

Kristina M. Holton; Amy Higgins; Austin J. Brockmeier; Mei-Hua Hall

doi:10.1017/neu.2024.60

Uncovering key predictive channels and clinical variables in the gamma band auditory steady-state response in early-stage psychosis: a longitudinal study

Published online by Cambridge University Press: 09 December 2024

Kristina M. Holton

Amy Higgins ,

Austin J. Brockmeier and

Mei-Hua Hall

Show author details

Kristina M. Holton*: Affiliation:
Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
Amy Higgins: Affiliation:
Psychosis Neurobiology Laboratory, McLean Hospital, Belmont, MA, USA
Austin J. Brockmeier: Affiliation:
Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA Department of Electrical and Computer Engineering, University of Delaware, Newark, DE, USA Department of Computer and Information Sciences, University of Delaware, Newark, DE, USA
Mei-Hua Hall*: Affiliation:
Psychosis Neurobiology Laboratory, McLean Hospital, Belmont, MA, USA Department of Psychiatry, Harvard Medical School, Boston, MA, USA Division of Psychotic Disorders, McLean Hospital, Belmont, MA, USA
*: Corresponding authors: Kristina M. Holton; Email: [email protected], Mei-Hua Hall; Email: [email protected]
Corresponding authors: Kristina M. Holton; Email: [email protected], Mei-Hua Hall; Email: [email protected]

Article contents

Abstract
Introduction
Methods
Results
Discussion
Supplementary material
Author contributions
Funding statement
Competing interests
Ethical standard
Footnotes
References

Rights & Permissions

Abstract

Objective:

Psychotic disorders are characterised by abnormalities in the synchronisation of neuronal responses. A 40 Hz gamma band deficit during auditory steady-state response (ASSR) measured by electroencephalogram (EEG) is a robust observation in psychosis and is associated with symptoms and functional deficits. However, the majority of ASSR studies focus on specific electrode sites, while whole scalp analysis using all channels, and the association with clinical symptoms, are rare.

Methods:

In this study, we use whole-scalp 40 Hz ASSR EEG measurements – power and phase-locking factor – to establish deficits in early-stage psychosis (ESP) subjects, classify ESP status using an ensemble of machine learning techniques, identify correlates with principal components obtained from clinical/demographic/functioning variables, and correlate functional outcome after a short-term follow-up.

Results:

We identified significant spatially-distributed group level differences for power and phase locking. The performance of different machine learning techniques and interpretation of the extracted feature importance indicate that phase locking has a more predictive and parsimonious pattern than power. Phase locking is also associated with principal components composed of measures of cognitive processes. Short-term functional outcome is associated with baseline 40 Hz ASSR signals from the FCz and other channels in both phase locking and power.

Conclusion:

This whole-scalp EEG study provides additional evidence to link deficits in 40 Hz ASSRs with cognition and functioning in ESP, and corroborates with prior studies of phase locking from a subset of EEG channels. Confirming 40 Hz ASSR deficits serves as a candidate phenotype to identify circuit dysfunctions and a biomarker for clinical outcomes in psychosis.

Keywords

Early stage psychosis electroencephalogram machine learning longitudinal schizophrenia spectrum disorder

Type: Original Article
Information: Acta Neuropsychiatrica , Volume 37 , 2025 , e1

DOI: https://doi.org/10.1017/neu.2024.60 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2024. Published by Cambridge University Press on behalf of Scandinavian College of Neuropsychopharmacology

Significant outcomes

Novel evidence of 40 Hz ASSR in ESP patients across whole-scalp EEG measures of phase locking and power.
Patterns in whole-scalp 40 Hz ASSR EEG channel measurements, especially phase locking, identified by machine learning, and focused on a subset of key EEG channels that serve as candidate biomarkers for early-stage psychosis.
Phase-locking factor is correlated with cognitive measures, and baseline 40 Hz ASSRs are correlated with short-term longitudinal functional outcomes.

Limitations

Reduced statistical power in longitudinal analysis due to limited sample size caused by the COVID-19 pandemic.
Analysis of early-stage psychosis case ignores the heterogeneity of SSD and AP diagnoses.
Network-level and source-level disruption of gamma oscillations in ESP are not investigated in the study.

Highlights

40 Hz ASSR deficits are already present in ESP, and PLF outperforms PWE in predictive and discriminative power through machine learning, while focusing on a subset of channels.
There is a strong correlation between cognitive measures and PLF.
Baseline 40 Hz ASSR is correlated with longitudinal functioning at a short-term follow-up.

Introduction

Psychotic disorders are characterised by aberrant sensory processing, cognitive deficits, and psychosocial functioning impairment. Within the sensory modality, the electroencephalogram (EEG) recording of auditory steady-state response (ASSR) reflects evoked oscillatory responses to modulated stimuli and induced oscillations peaking at 40 Hz to rapidly presented periodic auditory stimulation (Thune et al., Reference Thune, Recasens and Uhlhaas2016). For stimuli modulated at 40 Hz, ASSR is both evoked and induced. The 40 Hz ASSR is of particular importance because it lies in the range of gamma oscillations, which are robustly altered in major brain disorders, such as schizophrenia and psychosis (O’Donnell et al., Reference O’donnell, Vohs, Krishnan, Rass, Hetrick and Morzorati2013, Onitsuka et al., Reference Onitsuka, Tsuchimoto, Oribe, Spencer and Hirano2022). Disruptions in 40 Hz ASSR have been consistently observed in psychotic disorders including schizophrenia spectrum and affective psychosis as well as linked to cognitive function and clinical symptoms in these disorders, and are also considered as a potential biomarker for these conditions (Spencer et al., Reference Spencer, Salisbury, Shenton and Mccarley2008b; Uhlhaas and Singer, Reference Uhlhaas and Singer2010; Mulert et al., Reference Mulert, Kirsch, Pascual-Marqui, Mccarley and Spencer2011; Thune et al., Reference Thune, Recasens and Uhlhaas2016; Zhou et al., Reference Zhou, Mueller, Spencer, Mallya, Lewandowski, Norris, Levy, Cohen, Ongur and Hall2018; Onitsuka et al., Reference Onitsuka, Tsuchimoto, Oribe, Spencer and Hirano2022).

ASSR in the cerebral cortex are primarily generated by the reciprocal interactions between excitatory pyramidal cells and parvalbumin-expressing (PV+) inhibitory interneurons (basket cells) in local circuits (Buzsaki and Wang, Reference Buzsaki and Wang2012). It is proposed that the imbalance of inhibitory and excitatory (E/I) neural circuits is an underlying mechanism for psychosis including schizophrenia spectrum disorders (SSD) (Hirano and Uhlhaas, Reference Hirano and Uhlhaas2021; Onitsuka et al., Reference Onitsuka, Tsuchimoto, Oribe, Spencer and Hirano2022) and bipolar disorder with psychosis and major depressive disorder with psychotic features, also known as affective psychosis (AP) (Spencer et al., Reference Spencer, Niznikiewicz, Shenton and Mccarley2008a; Hall et al., Reference Hall, Smoller, Cook, Schulze, Hyoun Lee, Taylor, Bramon, Coleman, Murray, Salisbury and Levy2012; Johannesen et al., Reference Johannesen, O’donnell, Shekhar, Mcgrew and Hetrick2013). The imbalances of E/I circuits are already detectable in early-stage psychosis (ESP), as evidenced by deficits compared to healthy controls in the 40 Hz ASSR (Spencer et al., Reference Spencer, Salisbury, Shenton and Mccarley2008b, Grent-’t- Jong et al., Reference Grent-’t-Jong, Gajwani, Gross, Gumley, Krishnadas, Lawrie, Schwannauer, Schultze-Lutter and Uhlhaas2021).

Although the 40 Hz ASSR is primarily generated in the primary auditory areas, other brain regions, including the frontal lobe (Koshiyama et al., Reference Koshiyama, Miyakoshi, Joshi, Molina, Tanaka-Koshiyama, Sprock, Braff, Swerdlow and Light2020; Tada et al., Reference Tada, Kirihara, Ishishita, Takasago, Kunii, Uka, Shimada, Ibayashi, Kawai, Saito, Koshiyama, Fujioka, Araki and Kasai2021; Koshiyama et al., Reference Koshiyama, Miyakoshi, Joshi, Molina, Tanaka-Koshiyama, Joyce, Braff, Swerdlow and Light2021a), thalamus (Steinmann and Gutschalk, Reference Steinmann and Gutschalk2011; Grent-’t-Jong et al., Reference Grent-’t-Jong, Gajwani, Gross, Gumley, Krishnadas, Lawrie, Schwannauer, Schultze-Lutter and Uhlhaas2021), hippocampus (Grent-’t-Jong et al., Reference Grent-’t-Jong, Gajwani, Gross, Gumley, Krishnadas, Lawrie, Schwannauer, Schultze-Lutter and Uhlhaas2021), and parietal cortex (Tada et al., Reference Tada, Kirihara, Ishishita, Takasago, Kunii, Uka, Shimada, Ibayashi, Kawai, Saito, Koshiyama, Fujioka, Araki and Kasai2021) also contribute to ASSR generation. Koshiyama et al. show that in patients with schizophrenia, localised deficits of the 40 Hz ASSR in the auditory cortex quickly propagate to other brain regions, especially the frontal lobe, suggesting a network-level disruption of gamma oscillations (Koshiyama et al., Reference Koshiyama, Miyakoshi, Joshi, Molina, Tanaka-Koshiyama, Joyce, Braff, Swerdlow and Light2021a). The inability of the frontal regions to engage properly is associated with wider cognitive and functional deficits.

Cognitive dysfunction and social occupational function impairment are core features of psychotic disorders (Addington and Addington, Reference Addington and Addington2000; Kahn and Keefe, Reference Kahn and Keefe2013; Kalin et al., Reference Kalin, Kaplan, Gould, Pinkham, Penn and Harvey2015; Koshiyama et al., Reference Koshiyama, Miyakoshi, Thomas, Joshi, Molina, Tanaka-Koshiyama, Sprock, Braff, Swerdlow and Light2021b). To assess impairments in cognitive processes, the Matrix Consensus Cognitive Battery (MCCB), a standardised assessment tool, evaluates various cognitive domains affected by psychosis, including working memory, processing speed, attention, problem solving, verbal and visual learning, and social cognition (August et al., Reference August, Kiwanuka, Mcmahon and Gold2012). To measure functioning, the Global Assessment of Functioning (GAF) (Endicott et al., Reference Endicott, Spitzer, Fleiss and Cohen1976) is widely used for assessing overall functional level, while the modified Multnomah Community Ability Scale (MCAS) is used for assessing day-to-day activities and social occupational functions (Hendryx et al., Reference Hendryx, Dyck, Mcbride and Whitbeck2001; Chan et al., Reference Chan, Brady, Lewandowski, Higgins, Öngür and Hall2021). In addition to these deficits, the severity of symptoms, both psychosis and mood, may fluctuate over time and can be measured through the Positive and Negative Symptoms Scale (PANSS) (Kay et al., Reference Kay, Fiszbein and Opler1987), the Montgomery and Asberg Depression Rating Scale (MADRS) (Montgomery and Asberg, Reference Montgomery and Asberg1979), and Young’s Mania Rating Scale (YMRS) (Young et al., Reference Young, Biggs, Ziegler and Meyer1978).

EEG recordings of ASSRs are quantified by two measurements: evoked power or power of waveform envelope (PWE) and inter-trial phase-coherence, which we refer to herein as phase-locking factor (PLF). These measurements are computed from the EEG recordings using time-frequency analysis methods, specifically the wavelet transformation. Evoked power measures the overall strength or amplitude of the oscillations in response to the auditory stimuli. Relatively higher evoked power means that a larger population of neurons are synchronously firing in response to the auditory stimuli (Spencer et al., Reference Spencer, Niznikiewicz, Shenton and Mccarley2008a). PLF measures the consistency in the phase of the oscillations across different trials. If the phase of the oscillations is consistently aligned to the onset of the auditory stimuli across different trials, this results in a high phase-locking factor (Spencer et al., Reference Spencer, Salisbury, Shenton and Mccarley2008b).

While 40 Hz ASSR deficits have been consistently observed in SSD and AP patients, studies have typically focused on a few channels on the frontal central sites of the EEG (Onitsuka et al., Reference Onitsuka, Tsuchimoto, Oribe, Spencer and Hirano2022). Analysing ASSR activity from the whole-scalp EEG channels offers an advantage over using a few EEG channels by providing a more comprehensive and accurate representation of brain activity, leading to more reliable and clinically relevant insights. Whole-scalp analysis captures the full spatial distribution of ASSR responses, enhancing the ability to detect subtle abnormalities and differentiate between localised versus widespread dysfunction. This approach reduces the risk of missing important neural signals that might be impaired in patients but undetected by a limited number of channels, offering a richer and more detailed picture of neural activity. While source-level analyses are valuable for pinpointing the spatial origin of ASSR deficits (Grent-’t-Jong et al., Reference Grent-’t-Jong, Gajwani, Gross, Gumley, Krishnadas, Lawrie, Schwannauer, Schultze-Lutter and Uhlhaas2021; Koshiyama et al., Reference Koshiyama, Miyakoshi, Thomas, Joshi, Molina, Tanaka-Koshiyama, Sprock, Braff, Swerdlow and Light2021b), scalp-level ASSR provide clinically meaningful information despite being more global due to volume conductance (Nunez and Srinivasan, Reference Nunez and Srinivasan2006). It reflects the overall integrity of the auditory system, including how well the brain can synchronise with external auditory stimuli. In psychosis, disruptions in these processes are often detectable at the scalp level and are associated with broader cognitive and functional impairments. Thus, scalp-level ASSR can serve as a biomarker for identifying patients at risk of poorer outcomes, guiding interventions, and tailoring treatment plans (Javitt et al., Reference Javitt, Spencer, Thaker, Winterer and Hajos2008; Donde et al., Reference Donde, Kantrowitz, Medalia, Saperstein, Balla, Sehatpour, Martinez, O.’connell and Javitt2023).

Machine learning applied to these EEG data might provide a means to accurately predict disease status and trajectories (Barros et al., Reference Barros, Silva and Pinheiro2021), complementing clinical variables with EEG-based biomarkers. Machine learning methods can handle multivariate and complex data and extract or select representations of relevant data features for training classifiers (Saeidi et al., Reference Saeidi, Karwowski, Farahani, Fiok, Taiar, Hancock and Al-Juaid2021). The accuracy of the classifier depends on the choice of the classifier algorithm or functional form. Administering a battery of classifiers in both PLF and PWE can reveal which type of classifier performs the best with respect to each paradigm, while permutation analysis can establish the robustness of the classifiers. With some types of classifiers, the importance of each channel in either PLF or PWE can be assessed directly, but for other classifiers the importance is assessed by examining the classification accuracy with and without the channel. Taking this idea further, one can create ensemble rankings of each channel for PLF and PWE across the different classifiers revealing highly discriminatory channels. Through this machine learning approach, we assess EEG patterns in whole-scalp 40 Hz ASSR as potential biomarkers of psychosis. To date we are unaware of studies of ASSR applying machine learning methods on EEG with whole-scalp electrodes, and therefore, this study focusses on addressing this gap.

Additionally, studies have found associations that ASSR PLF or PWE deficits in psychosis are associated with clinical symptoms and functioning in patients (Mulert et al., Reference Mulert, Kirsch, Pascual-Marqui, Mccarley and Spencer2011; Zhou et al., Reference Zhou, Mueller, Spencer, Mallya, Lewandowski, Norris, Levy, Cohen, Ongur and Hall2018; Koshiyama et al., Reference Koshiyama, Miyakoshi, Thomas, Joshi, Molina, Tanaka-Koshiyama, Sprock, Braff, Swerdlow and Light2021b). However, testing clinical assessments individually is statistically suboptimal, given that many of these assessments are interrelated, either positively or negatively. Using principal component analysis (PCA) to group correlated clinical variables into components improves our ability to understand and interpret the relationships between clinical variables and channel-wise measurements of PLF and PWE.

Further, there is an urgent need to accurately characterise the progression of symptoms and understand the neurophysiological changes during the ESP period because early detection and intervention may likely slow functional decline and promote favourable outcomes (McGorry, Reference Mcgorry2002; Pantelis et al., Reference Pantelis, Yucel, Wood, Velakoulis, Sun, Berger, Stuart, Yung, Phillips and Mcgorry2005). Since the majority of studies of gamma oscillations report chronic schizophrenia patients (Thune et al., Reference Thune, Recasens and Uhlhaas2016), the potential to assess longitudinal functional outcomes in ESP subjects using 40 Hz ASSR is only recently being realised. Koshiyama and colleagues found that 40 Hz ASSR was significantly reduced in the recent onset schizophrenia and ultra-high-risk for psychosis groups and that the attenuated 40 Hz ASSR was correlated with future (12 month) global functioning level (GAF) in the schizophrenia group (Koshiyama et al., Reference Koshiyama, Kirihara, Tada, Nagai, Fujioka, Ichikawa, Ohta, Tani, Tsuchiya, Kanehara, Morita, Sawada, Matsuoka, Satomura, Koike, Suga, Araki and Kasai2018). However, this study only reported correlations for a single electrode site.

In this study, we apply machine learning and data science approaches to study whole-scalp 40 Hz ASSR in ESP patients. The dataset consists of a longitudinal cohort (both SSD and AP, N = 72) and age matched healthy controls (N = 58). We propose to i) examine whether 40 Hz ASSR PLF and PWE deficits are already present at early stage of illness, aligning with san earlier study (Spencer et al., Reference Spencer, Salisbury, Shenton and Mccarley2008b); ii) apply machine learning to construct classifiers using whole-scalp PLF and PWE channels to examine which channels are the most important to discriminate between ESP and HC and what type of classifier is optimal; iii) correlate PLF and PWE channels with the principal components that underlie clinical variables, in order to identify which components and channels are correlated; and iv) examine correlations between baseline PLF and PWE with clinical variables at one-year follow-up to gain predictive insights into short-term functional outcome. Our study explores different machine learning techniques and utilises either PLF or PWE to not only capture differences between ESP and HC, but also discover the channels most indicative of deficits and relevant to clinical variables. The results provide a framework for better understanding 40 Hz ASSR measurements as biomarkers.

Methods

Study sample

Subjects consisted of 130 participants, with a total of 29 schizophrenia spectrum disorder (SSD), 43 affective psychosis (AP), and 58 healthy controls (HC), under the approval of the McLean Hospital Institutional Review Board. The study schematic is shown in Fig. 1, and demographics are provided as Supplementary Table 1.

Figure 1. Study schematics.

Characteristics of this study cohort have been described in (Chan et al., Reference Chan, Brady, Hwang, Higgins, Nielsen, Ongur and Hall2020). Briefly, for inclusion criteria, patients were recruited from the First Episode Psychosis Clinic (McLean OnTrackTM) at McLean Hospital (Belmont, MA) and met DSM-IV criteria for Schizophrenia, Schizoaffective disorder, Bipolar I Disorder with psychotic features, or psychosis not otherwise specified, as assessed by the Structured Clinical Interview for DSM-IV (SCID) and review of medical records at their initial assessment. ESP at the time of entry to the clinic was defined as having an onset of psychotic symptoms within the past six years, and was not based on hospital admission or date or clinical intervention. Age-matched healthy controls (HCs) were recruited from the community and evaluated by SCID-NP. Exclusion criteria for ESP participants included psychotic symptoms that were attributable to acute intoxication or drug use, and illness duration over six years. Exclusion criteria of all subjects included: diagnosis of neurological disorders; history of head trauma with loss of consciousness; hearing impairments, blindness, or deafness; electroconvulsive therapy within the past 6 months; and IQ less than 70. Additional exclusion criteria for HCs were the following: no current or past history of psychotic or affective disorders, no current substance abuse or lifetime substance dependence, and no first-degree relative with a history of psychosis or bipolar disorder.

Clinical assessments

Clinical measures used in this study included: the Positive and Negative Syndrome Scale (PANSS) with subscales for Positive, Negative, and General symptoms (Kay et al., Reference Kay, Fiszbein and Opler1987), the MADRS (Montgomery and Asberg, Reference Montgomery and Asberg1979), Young Mania Rating Scale (YMRS) (Young et al., Reference Young, Biggs, Ziegler and Meyer1978), a modified Multnomah Community Ability Scale (MCAS) (Hendryx et al., Reference Hendryx, Dyck, Mcbride and Whitbeck2001; Chan et al., Reference Chan, Brady, Lewandowski, Higgins, Öngür and Hall2021), and the GAF (Endicott et al., Reference Endicott, Spitzer, Fleiss and Cohen1976). Cognition domains were measured using the MCCB (August et al., Reference August, Kiwanuka, Mcmahon and Gold2012). Medication information, including antipsychotics and lithium dosage, was collected at each assessment timepoint. Antipsychotics (96% second-generation) were converted into chlorpromazine equivalents based on the recommendations of Gardner et al., (Gardner et al., Reference Gardner, Murphy, O’donnell, Centorrino and Baldessarini2010).

Electrophysiological recording and processing

The electroencephalogram (EEG) was recorded using the BioSemi Active Two system (BioSemi Inc., Amsterdam, Netherlands) with a bandpass of DC–104 Hz at a sampling frequency of 512 Hz, and a Common Mode Sense as the reference (PO2 site) using either an 18- or 64-channel electrode cap. Electrooculogram (EOG) electrodes were placed below and at the outer canthi of the left eye. Fifteen subjects were recorded using an 18-channel BioSemi Active Two system. Their PLF and PWE values were imputed to 59 channels (PLF) and 63 channels (PWE) using a K-nearest neighbours approach, 10 neighbours, with optimal impute (iai v1.5.0).

The 40 Hz ASSR stimuli were presented through earphones in one block of stimuli (150/block) (Zhou et al., Reference Zhou, Mueller, Spencer, Mallya, Lewandowski, Norris, Levy, Cohen, Ongur and Hall2018). Stimuli consisted of trains of 1-ms white noise clicks (500-ms duration, 1100-ms stimulus onset asynchrony, 80-dB sound pressure level). Subjects were instructed to look at the fixation cross on the monitor and listen to the stimuli.

Signal processing was performed off-line using Brain Vision Analyzer software (Brain Products GmbH, Germany) and blind to group membership (Kozhemiako et al., Reference Kozhemiako, Wang, Jiang, Wang, Gai, Zou, Wang, Yu, Zhou, Li, Guo, Law, Coleman, Mylonas, Shen, Wang, Tan, Qin, Huang, Murphy, Stickgold, Manoach, Zhou, Zhu, Hal, Purcell and Pan2022). EEG data were downsampled to 256 Hz, re-referenced off-line to the linked mastoids, and filtered with a passband between 0.1 and 50 Hz. Single trial segments were extracted, baseline-corrected relative to the 500 ms pre-stimulus interval, eye-blink-corrected using Brain Vision Analyzer’s default setting (Gratton et al., Reference Gratton, Coles and Donchin1983) and artefact rejected if values exceeded>100 μV. Phase locking (PLF) and evoked power (PWE) at each site were calculated on wavelet coefficients obtained from Morlet wavelet transformation of the segmented data (representing the 1–50 Hz frequency range, with a total number of 50 frequency layers using a Morlet parameter of 10). The PLF quantifies consistency of oscillatory phase across individual trials, ranging from 0 (purely non-phase-locked activity) to 1 (fully phase-locked activity). Gamma-band (40 Hz) PLF and PWE were computed by averaging across the 36–46 Hz wavelet frequency layers in the 20–520 ms window post wherein both PLF and PWE were maximal for 40 Hz ASSR (Kozhemiako et al., Reference Kozhemiako, Wang, Jiang, Wang, Gai, Zou, Wang, Yu, Zhou, Li, Guo, Law, Coleman, Mylonas, Shen, Wang, Tan, Qin, Huang, Murphy, Stickgold, Manoach, Zhou, Zhu, Hal, Purcell and Pan2022). Averaged HC and ESP responses as time-frequency spectrograms for PLF and PWE were created from representative channels across the scalp using Brain Vision Analyzer software.

Assessing case-control differences

PLF and PWE channels were individually assessed for differences between ESP and HC using a one-sided (alternative=’greater’ for ESP compared to controls) using a Welch’s t-test (R version 4.1.2). Inverse log10 p-values were plotted on scalp maps using MATLAB script plot.topography.m (version 1.5) (Martínez-Cagigal, Reference Martínez-Cagigal2020) as scaled to the most extreme p-value (0.00138).

Modeling PLF and PWE channels with machine learning

PLF and PWE channels were used to predict ESP/HC by splitting subjects 80:20 into train and test sets stratified by case. Model training and hyperparameter selection was performed on the train set (80% of the subjects) and tested on the 26 held-out test subjects. Hyperparameter choices over a grid search of 100 bootstraps and final selection are reported in Supplementary Table 2. Random forest used 100 out-of-bag times, with mean Gini index reported (randomForest v4.7–1.1 (Liaw and Wiener, Reference Liaw and Wiener2002), caret v6.0–90 (Kuhn and Max, 2008)). Ridge (L2-penalized elastic net regression) underwent 100 bootstraps with importance reported (glmnet v4.1–4 (Friedman et al., Reference Friedman, Tibshirani and Hastie2010; Tay et al., Reference Tay, Narasimhan and Hastie2023), caret v6.0–90 (Kuhn and Max, 2008)). Gaussian process with a radial basis function (RBF) underwent 100 bootstraps, with feature importance reported as a measure of area under the curve (AUC) of the receiver operator characteristic curve (ROC) for each feature (kernlab v0.9–31 (Karatzoglou et al., Reference Karatzoglou, Smola, Hornik and Zeileis2004; Karatzoglou et al., Reference Karatzoglou, Smola and Hornik2023), caret v6.0–90 (Kuhn and Max, 2008)). Support vector machine (SVM) with a RBF kernel underwent 100 bootstraps, with importance reported as a measure of AUC of the ROC for each feature (kernlab v0.9–31 (Karatzoglou et al., Reference Karatzoglou, Smola, Hornik and Zeileis2004; Karatzoglou et al., Reference Karatzoglou, Smola and Hornik2023), caret v6.0–90 (Kuhn and Max, 2008)). Naive Bayes underwent 100 bootstraps, with feature importance reported as a measure of AUC of the ROC for each feature (naivebayes v0.9.7 (Majka, Reference Majka2024), caret v6.0–90 (Kuhn and Max, 2008)). For all algorithms, we calculated the test set F1, accuracy, accuracy upper/lower 95% confidence intervals, balanced accuracy, AUC, sensitivity, specificity, positive predictive value, negative predictive value, and root mean standard error (RMSE) (caret v6.0–90 (Kuhn and Max, 2008), ModelMetrics v1.2.2.2 (Hunt, Reference Hunt2020)).

We created an ensemble ranking of features across the ESP-HC deficits t-test and machine learning algorithms. Importance of clinical features from each algorithm was given an ordinal rank, and to ensemble, ranks were averaged across algorithms and sorted, assigning a new ordinal rank according to sorting. Plots were created with ggplot2 (v3.3.5) and RColorBrewer (v1.1–2).

To test the importance of correct feature labelling to the classifiers, we randomly scrambled the channel labels of the PLF and PWE data matrices 20 times, and calculated the test set performance measures’ means and standard deviations across the trials. Original performance and scrambled F1 performance were demonstrated via boxplot, created with ggplot2 (v3.3.5) and RColorBrewer (v1.1–2).

Assessing clinical variables correlated with channels’ responses

We decomposed the clinical variables into principal components. We first selected the subset of clinical variables with 80% complete data, then took only ESP patients with complete data across these features, which yielded 24 clinical variables and 46 ESP patients. We assessed the correlation of clinical variables via Pearson correlation and generated a corrplot (ggcorrplot 0.1.4). We performed PCA decomposition of the clinical variables, and evaluated the eigenvectors, retaining 9 principal components (PCs). The clinical variables driving each PC were thresholded by>10% contribution (factoextra 1.0.7). Each EEG channel’s PLF and PWE measurements were correlated with each of the coordinates for the 9 PCs using a Pearson correlation test, two-sided. P-values were corrected for multiple testing using Benjamini-Hochberg false discovery rate (FDR). The contribution of the clinical variables to the principal components were plotted via heatmap (pheatmap 1.0.12).

Assessing baseline PLF / PWE with clinical variables at baseline and one-year follow-up

Baseline and one-year follow-up clinical measures were correlated for the clinical assessments from 23 patients (8 SSD, 15 AP) via Pearson correlation, two-sided alternative hypothesis. Baseline PLF and PWE channels were correlated with baseline clinical measures and one-year follow-up clinical assessments from 23 patients (8 SSD, 15 AP) via Pearson correlation, one-sided alternative hypothesis as indicated per variable. Functioning (GAF, MCAS) range from high (less deficit) to low (more deficit) and have the alternative hypothesis of ‘greater’, whereas disease severity as measured by PANSS (all), YMRS, MADRS range from low (less severe) to high (most severe) and have the alternative hypothesis of ‘less’. (Supplementary Table 9, Supplementary Table 10). Correlation linear model plots were created with ggplot2 (v3.3.5) and ggpubr (v0.4.0).

Results

Comparing case-control ASSR

We compared the gamma band (40 Hz) ASSR response in HC versus ESP in both the PLF and PWE paradigms, demonstrated visually through representative channels as frequency spectrograms (Fig. 2A–B). We assessed these differences via a one-sided t-test, and found that deficits already exist at ESP (Fig. 2C–D). In PLF, a pattern of fronto-parietal deficits are present, but with the smallest p-values in the temporal T8 (p = 0.00137) and T7 (p = 0.0015) electrodes. PLF had 30 significant channels at a significance threshold (alpha) of p ≤ 0.05. In PWE, the pattern of deficits is centred around Fz and the frontal lobe, with 46 significant channels at a significance threshold (alpha) of p ≤ 0.05. The minimal p-value in PLF is 0.00138 for channel T8 whereas the minimal p-value for PWE is 0.0111 for F4. The full list of p-values are available in s = Supplementary Table 3.

Figure 2. ESP deficits across channels in PLF and PWE. Average 40 Hz ASSR frequency spectrograms over representative channels for HC (N = 58; top) and ESP (N = 72; bottom) for PLF (A) and PWE (B). Channels from left-right are AFz, Fz, F3, F4, Cz, C3, C4, T7, T8, Pz, P3, P4, Oz, O1, O2. For each channel in PLF (C) and PWE (D), a one-sided student’s t-test with the alternative hypothesis of ‘greater’ was run for HC versus ESP. Scale is inverse log10 p-value, with a maximum p-value of 0 and a minimum p-value of 0.00138.

Machine learning models classify case/control status from channels

We evaluated multiple machine learning techniques – random forest (RF), elastic net linear model with L2 penalisation (ridge), Gaussian process with a radial basis function (Gaussian radial), SVM with a radial basis kernel (SVM radial), and naive Bayes – to classify whether a participant was case or control using the PLF or PWE channels.

In the PLF paradigm, RF had the best F1 (0.69), accuracy (0.64), and balanced accuracy (0.63), and lowest root mean square error (RMSE) (0.60) in the test data. Naive Bayes had the second highest F1 (0.67) and performed similarly to RF. The Gaussian radial had the lowest F1 (0.57), accuracy (0.52), and balanced accuracy (0.51), with the highest RMSE (0.69) (Fig. 3A, Supplementary Table 4). The chance level of accuracy is 0.55 given 72 cases and 58 healthy controls.

Figure 3. Machine learning metrics for PLF and PWE. (A) For PLF and PWE test, machine learning metrics F1, overall accuracy (AccOverall), balanced accuracy (AccBal), and root mean square error (RMSE) are displayed for random forest (RF), Ridge (L2 elasticnet), Gaussian process with a radial kernel (Gaussian Radial), support vector machines with a radial kernel (SVM Radial), and naive Bayes. (B). For PLF and PWE, original test F1, and scrambled labels F1 over 20 permutations for each machine learning algorithm are demonstrated via boxplot. (C) For PLF and PWE, ensemble ranking metrics for t-test (increasing p-value), RF (mean Gini index), Ridge (beta coefficients), Gaussian Radial (AUC), SVM radial (AUC), Naive Bayes (AUC), and average rank across all six metrics (avg_rank) are displayed.

In contrast, the PWE paradigm had markedly less classification power and the performance across techniques varied markedly from the PLF paradigm. RF performed the worst with less than chance performance in F1 (0.38), accuracy (0.36), and balanced accuracy (0.36), and had the highest RMSE (0.80). Of the five algorithms, Gaussian radial performed the best, in F1 (0.65), accuracy (0.56), balanced accuracy (0.54), and had the lowest RMSE (0.66) (Fig. 3A Supplementary Table 4).

To better understand the importance of feature specificity for these techniques, we scrambled the feature labels 20 times and assessed F1, accuracy, and AUC. Scrambling the feature labels caused RF’s performance to drop off in PLF, suggesting its specificity in channel selection, whereas RF’s performance improved in PWE, which was previously below chance—indicating RF was overfitting in PWE. Performance for Gaussian radial markedly decreased in PWE indicating there existence of meaningful pattern across the channels that is lost when scrambling (Fig. 3B, Supplementary Table 4).

For each machine learning algorithm, the importance of the features to the model was extracted. RF utilises mean Gini index, ridge is directly interpretable in terms of the elastic net beta coefficients, while Gaussian radial, SVM radial, and naive Bayes all use the area under the curve (AUC) of the receiver operator characteristic (ROC) for each independent channel. In particular, for the PLF RF algorithm, the five highest-important channels were F8, C6, CP5, CP1, and P10 (Supplementary Table 5).

We combined these rankings with the t-test rankings, and derived an ensemble (average) rank for each channel in both PLF and PWE. For PLF, channels T8, CP5, FT8, T7, and P7 were the five highest-ranked channels in their ability to discriminate between ESP and HC (Fig. 3C, Supplementary Table 5). Conversely, the five PWE channels best able to discriminate between ESP and HC were F4, Fz, F6, T8, and C4 (Fig. 3C, Supplementary Table 5).

Combinations of clinical variables correlate with channels’ responses

For a subset of ESP with at least 80% completed clinical data (N = 46), we assessed the degree to which clinical variables are correlated, to find strong (absolute(R)>0.5) correlations between the PANSS scores, GAF and MCAS; MCCB subscores; and cannabis use (Fig. 4A, Supplementary Table 6). We performed PCA to decompose the clinical variables into components of highly-correlated features, thresholded by having>10% contribution to the principal component (PC). For the 46 ESP with complete data, we correlated each channel in PLF and PWE with the first nine PCs, which collectively explain 81% of the variance. The first PC has major contributions from GAF, PANSS Positive, PANSS Negative, and YMRS (Fig. 4B, Supplementary Table 6).

Figure 4. Clinical variables decomposed into principal components, and correlations to PLF and PWE. (A) Clinical variables with<20% missingness are correlated across 43 ESP with complete data via Pearson correlation. The Viridis colour scale shows high Pearson correlation value (yellow) to low Pearson correlation value (purple). (B). For the first 9 principal components, the contribution of each clinical variable is shown via colour bar, scaled per principal component. PLF demonstrates 52 channels had a significant Pearson correlation to PC2 (significance level of FDR ≤ 0.01) (highlighted in red).

Across the components, PC2 and PLF measures on 42 channels had a Pearson correlation coefficient (R) greater than 0.3, with 52 channels that were significantly correlated (FDR ≤ 0.01). PC2 is represented by MCCB cognition measures (processing, memory, visual solving). (Fig. 4B, Supplementary Table 6, Supplementary Table 7). In contrast, we did not find strong correlation within clinical components for PWE measures at FDR<0.01 (Supplementary Table 6, Supplementary Table 8).

We further investigated the relationships between the ASSR and the individual clinical measures at baseline for the subset of patients (N = 23) with longitudinal data: baseline PLF correlated to baseline GAF in 22 channels, MCAS in 3, YMRS in 24, MADRS in 17, PANSS General in 15, PANSS Negative in 10, and PANSS Total in 22 channels. Baseline PWE did not correlate with baseline GAF in any channels at an alpha ≤ 0.05, but correlated to baseline MADRS in 24 channels, and YMRS in Cz (Fig. 5, Supplementary Table 9, Supplementary Table 10).

Figure 5. Correlations of baseline FCz and Fz to longitudinal GAF. For PLF and PWE (A,C), one-sided Pearson correlation (R) of baseline FCz to baseline and one-year GAF score are displayed, along with correlation test p-value. FCz is the x-axis, GAF score is the y-axis. Blue line is the linear model trend line, grey is the standard error. For PLF and PWE (B,D), one-sided Pearson correlation (R) of baseline Fz to baseline and one-year GAF score are displayed, along with correlation test p-value. Fz is the x-axis, GAF score is the y-axis. Blue line is the linear model trend line, grey is the standard error.

Baseline channels predict functional outcome at follow-up

One-year follow-up clinical measures for a subset of patients (N = 23) were available. First, we measured the correlation of each clinical measure at baseline to its value at one year. Significant correlations were found for GAF (R = 0.75, p = 1.35e-3), MCAS (R = 0.804, p = 1.02e-4), and MADRS (R = 0.617, p = 8.35e-3) (Supplementary Table 11). We then examined correlations between baseline central channels FCz and Fz with clinical variables at the one-year follow-up. GAF was found to be significant at an alpha ≤ 0.05 in PLF FCz (R = 0.43, p = 0.037), PWE FCz (R = 0.49, p = 0.020), and PWE Fz (R = 0.44, p = 0.034) ] via a Pearson one-sided (alternative=’greater’) correlation test, as shown in Fig. 5, Supplementary Table 9, and Supplementary Table 10. We extended this analysis to measure the correlation of baseline 40 Hz to one-year GAF on all channels. In total, PLF had 11 channels significant (p ≤ 0.05) for GAF, while PWE had 21 (Supplementary Table 9, Supplementary Table 10).

Additionally, one-year PANSS Positive significantly correlated with baseline PWE at FCz (R = -0.39, p = 0.043) and one-year PANSS Total with PWE at Fpz (R = -0.40, p = 0.038) (Supplementary Table 10), while one-year YMRS was significantly correlated with PLF at F7 (R = -0.39, p = 0.044) (Supplementary Table 9).

Discussion

Our study employs a comprehensive, whole-scalp approach to examine 40 Hz ASSR in ESP and HC and explores various machine learning techniques to identify the channels most indicative of deficits and relevant to clinical variables. Our findings reveal that i) ASSR deficits are already present in ESP; ii) PLF outperforms PWE in predictive and discriminative power through machine learning, and focuses on a subset of channels; iii) there is a strong correlation between cognitive measures and PLF; and iv) baseline 40 Hz ASSR is correlated with longitudinal functioning at a short-term follow-up. PLF has a more predictive, parsimonious signature than PWE, with a subset of frontal and temporal channels particularly relevant in PLF, while front-central channels are disrupted in PWE.

Gamma band deficits are present in ESP

We found reduced gamma band ASSRs in ESP patients across multiple channels. This result is consistent with previous most reports in first episode psychosis patient (Spencer et al., Reference Spencer, Salisbury, Shenton and Mccarley2008b; Tada et al., Reference Tada, Nagai, Kirihara, Koike, Suga, Araki, Kobayashi and Kasai2016; Alegre et al., Reference Alegre, Molero, Valencia, Mayner, Ortuno and Artieda2017; Koshiyama et al., Reference Koshiyama, Kirihara, Tada, Nagai, Fujioka, Ichikawa, Ohta, Tani, Tsuchiya, Kanehara, Morita, Sawada, Matsuoka, Satomura, Koike, Suga, Araki and Kasai2018), including a study using MEG approach (Grent-’t-Jong et al., Reference Grent-’t-Jong, Gajwani, Gross, Gumley, Krishnadas, Lawrie, Schwannauer, Schultze-Lutter and Uhlhaas2021). This reiterates the utility of using the 40 Hz ASSR in assessing patient deficits in ESP. Across the whole scalp, the biggest differences between patients and controls in PLF are electrodes in the temporal sites (T7 and T8), consistent with evidence indicating the primary auditory cortex and superior temporal cortex being one key generators of 40 Hz ASSR (Draganova et al., Reference Draganova, Ross, Wollbrink and Pantev2007).

Predicting case/control status from channels

We applied multiple machine learning techniques using channels across the whole scalp to examine which channels are the most important in discriminating ESP/control status at baseline. The best performing classifier is a random forest (RF) on PLF, with measurements modestly predictive with an F1 score of 0.69. The channels with the highest feature importance were F8, C6, CP5, CP1, and P10. In relation to the temporal sites (T7 and T8), F8, C6, and P10 form a triangle around T8, and CP5 and CP1 extend from T7. This indicates that the RF is using the patterns across these channels to robustly distinguish on an individual level ESP and HC. The best performing classifier for PWE is a Gaussian process (GP) with a radial basis function that achieves an F1 score of 0.65. Notably, switching the type of classifier from the other dataset yields much lower performance (yielding F1 scores of 0.57 for GP on PLF and 0.35 for RF on PWE). While both algorithms can create nonlinear decision boundaries, the specificity of the two types of classifiers to data is explainable. Random forests choose a set of decision trees based on channel specificity and is precise in channel selection in PLF, while Gaussian processes with a radial kernel use patterns across all channels and therefore do better with PWE. To test the validity of our results, we found that scrambling the channel labelling impeded the random forest’s ability to make accurate predictions in PLF. The Gaussian process’ predictive accuracy dropped when scrambling feature labels for PWE, hinting at specific patterning across the channels. Overall, the moderate F1 score for the best model may be due to small sample sizes and inherited heterogeneity of the patient population. Future studies with a larger cohort size (N) may be better poised to build a multi-diagnosis classifier.

Utility of clinical variables in predicting channels’ responses

Few studies use clinical variables to relate to the ASSR channels’ responses (Mulert et al., Reference Mulert, Kirsch, Pascual-Marqui, Mccarley and Spencer2011; Zhou et al., Reference Zhou, Mueller, Spencer, Mallya, Lewandowski, Norris, Levy, Cohen, Ongur and Hall2018). In this study, we implemented a PCA strategy to group correlated clinical variables into components and examined the correlation patterns of these components with each electrode channel across the whole scalp in PLF and PWE. Not surprisingly, we found that symptom severity scales (PANSS General, Negative, Positive, YMRS, and MADRS) all positively correlated with one another and negatively correlated with global functioning (GAF) and community functioning (MCAS) (i.e., high symptom score, low functioning score and low symptom score, high functioning) (Fig. 4A). Cognitive variables (processing speed, attention, memory, visual, and problem solving) from MCCB measures were modestly correlated with one another, and comprised principal component 2 (PC2) (Fig. 4A–B). Applying the PCA embedding to PLF, we found PLF was correlated with PC2 (88% within PC/8% across PC of total channels FDR ≤ 0.01, R range between 0.30 and 0.40). This result provides supporting evidence that 40 Hz ASSR impairment relates to cognitive function and attention mechanism (Parciauskaite et al., Reference Parciauskaite, Bjekic and Griskova-Bulanova2021; Coffman et al., Reference Coffman, Ren, Longenecker, Torrence, Fishel, Seebold, Wang, Curtis and Salisbury2022). In contrast, we did not find strong evidence of PWE in relation to the clinical components, but a few channels correlated with the component (PC7) associated with manic symptoms and cannabis use.

Baseline channels predict longitudinal functional outcome

One prior study by Koshiyama et al., found a correlation between 40 Hz ASSR at FCz and future global functional outcome (GAF) in ESP (Koshiyama et al., Reference Koshiyama, Kirihara, Tada, Nagai, Fujioka, Ichikawa, Ohta, Tani, Tsuchiya, Kanehara, Morita, Sawada, Matsuoka, Satomura, Koike, Suga, Araki and Kasai2018). We show that, in addition to FCz as reported in Koshiyama’s study, 10 PLF and 20 PWE channels at baseline all correlated with GAF at one year, that is, greater ASSR impairments at baseline predict lower functioning one year later (Supplementary Table 9, Supplementary Table 10). Baseline GAF is also highly correlated to one-year GAF, as is MCAS and MADRS (Supplementary Table 11). Most clinical measures had baseline correlations to PLF (except PANSS Total), but the correlations drop out at one-year follow-up. Interestingly, both PLF and PWE demonstrated baseline correlation to MADRS (PLF 17 channels, PWE 24 channels), which is consistent with findings in Parker et al (Parker et al., Reference Parker, Hamm, Mcdowell, Keedy, Gershon, Ivleva, Pearlson, Keshavan, Tamminga, Sweeney and Clementz2019). Although both baseline MCAS and GAF are highly correlated with follow-up MCAS and GAF, baseline ASSR only predicts one-year GAF. No significant longitudinal correlations were found for the MCAS measure. GAF is rated as an overall impression of symptomatic and functioning performance whereas the modified MCAS scores several different axes of day-to-day functionality independently (e.g., independent living, meaningful activity, social activity and relationships, management of money) (Hendryx et al., Reference Hendryx, Dyck, Mcbride and Whitbeck2001; Chan et al., Reference Chan, Brady, Lewandowski, Higgins, Öngür and Hall2021). It is plausible that 40 Hz ASSR may be more sensitive in capturing an individual’s overall symptom and occupational functional state and that a larger sample is needed to have sufficient power for partitioning the precise functionality trajectories.

Overall PLF and PWE signature

Ding and Simon have demonstrated that phase synchronisation, as measured by PLF, is much more sensitive to stimulus-synchronized neural activity than power (Ding and Simon, Reference Ding and Simon2013). Specifically, gamma-band phase locking is not sensitive to modulations in signal amplitude, or changes in amplitude across subjects caused by differences in electrode impedance, and, thus, is a more robust measure of the underlying neural synchrony than gamma-band power. In this study, our findings corroborate this as PLF produced more extreme p-values on a subset of channels, PLF outperformed PWE in classification accuracy/F1 score, and showed stronger correlations with clinical variables.

Right hemispheric laterality of deficits

Interestingly, we observed that there appears to be hemispheric asymmetries, specifically in PLF, with more prominent contribution of 40 Hz ASSR in the right hemisphere. For example, the random forest trained to classify ESP relied on channels F8 and CP6. Ensemble rankings of PLF features also implicate F4, F6, T8, and C4 as the most important features driving classification of ESP versus HC. This is consistent with the literature, in which ASSR showing right hemispheric dominance (Grent-’t-Jong et al., Reference Grent-’t-Jong, Gajwani, Gross, Gumley, Krishnadas, Lawrie, Schwannauer, Schultze-Lutter and Uhlhaas2021)

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/neu.2024.60.

Acknowledgements

The authors would like to thank all of the participants who took part in this study.

Author contributions

K.M.H contributed to the conceptualisation, methodology, software, formal analysis, data curation, writing, review, and visualisation. A.H. contributed to the formal analysis and investigation. A.J.B. contributed to the conceptualisation, methodology, writing, review, visualisation, and supervision. M.H.H. contributed to the conceptualisation, formal analysis, data curation, investigation, resources, writing, review, supervision, project administration, and funding acquisition.

Funding statement

This work was funded by the National Institute of Health/National Institute of Mental Health R01MH109687 to MHH.

Competing interests

The authors declare no competing interests.

Ethical standard

This study was approved by the McLean Hospital Institutional Review Board under MassGeneral Brigham IRB 2015P002741.

Footnotes

Present Address: Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA.

References

Addington, J and Addington, D (2000) Neurocognitive and social functioning in schizophrenia: a 2.5 year follow-up study. Schizophrenia Research 44, 47–56.Google Scholar

Alegre, M, Molero, P, Valencia, M, Mayner, G, Ortuno, F and Artieda, J (2017) Atypical antipsychotics normalize low-gamma evoked oscillations in patients with schizophrenia. Psychiatry Research 247, 214–221.Google Scholar

August, SM, Kiwanuka, JN, Mcmahon, RP and Gold, JM (2012) The MATRICS consensus cognitive battery (MCCB): clinical and cognitive correlates. Schizophrenia Research 134, 76–82.Google Scholar

Barros, C, Silva, CA and Pinheiro, AP (2021) Advanced EEG-based learning approaches to predict schizophrenia: promises and pitfalls. Artificial Intelligence in Medicine 114, 102039.Google Scholar

Buzsaki, G and Wang, XJ (2012) Mechanisms of gamma oscillations. Annual Review of Neuroscience 35, 203–225.Google Scholar

Chan, SY, Brady, R, Hwang, M, Higgins, A, Nielsen, K, Ongur, D and Hall, MH (2020) Heterogeneity of outcomes and network connectivity in early-stage psychosis: A longitudinal study. Schizophrenia Bulletin 47, 138–148.Google Scholar

Chan, SY, Brady, RO, Lewandowski, KE, Higgins, A, Öngür, D and Hall, MH (2021) Dynamic and progressive changes in thalamic functional connectivity over the first five years of psychosis. Molecular Psychiatry 2, 1177–1183.Google Scholar

Coffman, BA, Ren, X, Longenecker, J, Torrence, N, Fishel, V, Seebold, D, Wang, Y, Curtis, M and Salisbury, DF (2022) Aberrant attentional modulation of the auditory steady state response (ASSR) is related to auditory hallucination severity in the first-episode schizophrenia-spectrum. Journal of Psychiatric Research 151, 188–196.Google Scholar

Ding, N and Simon, JZ (2013) Adaptive temporal encoding leads to a background-insensitive cortical representation of speech. Journal of Neuroscience 33, 5728–5735.Google Scholar

Donde, C, Kantrowitz, JT, Medalia, A, Saperstein, AM, Balla, A, Sehatpour, P, Martinez, A, O.’connell, MN and Javitt, DC (2023) Early auditory processing dysfunction in schizophrenia: mechanisms and implications. Neuroscience & Biobehavioral Reviews 148, 105098.Google Scholar

Draganova, R, Ross, B, Wollbrink, A and Pantev, C (2007) Cortical steady-state responses to central and peripheral auditory beats. Cerebral Cortex 18, 1193–1200.Google Scholar

Endicott, J, Spitzer, RL, Fleiss, JL and Cohen, J (1976) The global assessment scale. A procedure for measuring overall severity of psychiatric disturbance. Archives of General Psychiatry 33, 766–771.Google Scholar

Friedman, J, Tibshirani, R and Hastie, T (2010) Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software 33, 1–22.Google Scholar

Gardner, DM, Murphy, AL, O’donnell, H, Centorrino, F and Baldessarini, RJ (2010) International consensus study of antipsychotic dosing. American Journal of Psychiatry 167, 686–693.Google Scholar

Gratton, G, Coles, MG and Donchin, E (1983) A new method for off-line removal of ocular artifact. Electroencephalography and Clinical Neurophysiology 55, 468–484.Google Scholar

Grent-’t-Jong, T, Gajwani, R, Gross, J, Gumley, AI, Krishnadas, R, Lawrie, SM, Schwannauer, M, Schultze-Lutter, F and Uhlhaas, PJ (2021) 40-hz auditory steady-state responses characterize circuit dysfunctions and predict clinical outcomes in clinical high-risk for psychosis participants: a magnetoencephalography study. Biological Psychiatry 90, 419–429.Google Scholar

Hall, MH, Smoller, JW, Cook, NR, Schulze, K, Hyoun Lee, P, Taylor, G, Bramon, E, Coleman, MJ, Murray, RM, Salisbury, DF and Levy, DL (2012) Patterns of deficits in brain function in bipolar disorder and schizophrenia: a cluster analytic study. Psychiatry Research 200, 272–280.Google Scholar

Hendryx, M, Dyck, DG, Mcbride, D and Whitbeck, J (2001) A test of the reliability and validity of the multnomah community ability scale. Community Mental Health Journal 37, 157–168.Google Scholar

Hirano, Y and Uhlhaas, PJ (2021) Current findings and perspectives on aberrant neural oscillations in schizophrenia. Psychiatry and Clinical Neurosciences 75, 358–368.Google Scholar

Hunt, T (2020) ModelMetrics: rapid calculation of model metrics. R package version 1.2.2.2. https://CRAN.R-project.org/package=ModelMetrics.Google Scholar

Javitt, DC, Spencer, KM, Thaker, GK, Winterer, G and Hajos, M (2008) Neurophysiological biomarkers for drug development in schizophrenia. Nature Reviews Drug Discovery 7, 68–83.Google Scholar

Johannesen, JK, O’donnell, BF, Shekhar, A, Mcgrew, JH and Hetrick, WP (2013) Diagnostic specificity of neurophysiological endophenotypes in schizophrenia and bipolar disorder. Schizophrenia Bulletin 39, 1219–1229.Google Scholar

Kahn, RS and Keefe, RS (2013) Schizophrenia is a cognitive illness: time for a change in focus. JAMA Psychiatry 70, 1107–1112.Google Scholar

Kalin, M, Kaplan, S, Gould, F, Pinkham, AE, Penn, DL and Harvey, PD (2015) Social cognition, social competence, negative symptoms and social outcomes: inter-relationships in people with schizophrenia. Journal of Psychiatric Research 68, 254–260.Google Scholar

Karatzoglou, A, Smola, A and Hornik, K (2023) kernlab: Kernel-Based Machine Learning Lab. R package version 0.9–31. https://CRAN.R-project.org/package=kernlab.Google Scholar

Karatzoglou, A, Smola, A, Hornik, K and Zeileis, A (2004) Kernlab – an S4 package for kernel methods in R. Journal of Statistical Software 11, 1–20.Google Scholar

Kay, SR, Fiszbein, A and Opler, LA (1987) The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophrenia Bulletin 13, 261–276.Google Scholar

Koshiyama, D, Kirihara, K, Tada, M, Nagai, T, Fujioka, M, Ichikawa, E, Ohta, K, Tani, M, Tsuchiya, M, Kanehara, A, Morita, K, Sawada, K, Matsuoka, J, Satomura, Y, Koike, S, Suga, M, Araki, T and Kasai, K (2018) Auditory gamma oscillations predict global symptomatic outcome in the early stages of psychosis: A longitudinal investigation. Clinical Neurophysiology 129, 2268–2275.Google Scholar

Koshiyama, D, Miyakoshi, M, Joshi, YB, Molina, JL, Tanaka-Koshiyama, K, Joyce, S, Braff, DL, Swerdlow, NR and Light, GA (2021a) Neural network dynamics underlying gamma synchronization deficits in schizophrenia. Progress in Neuro-Psychopharmacology and Biological Psychiatry 107, 110224.Google Scholar

Koshiyama, D, Miyakoshi, M, Joshi, YB, Molina, JL, Tanaka-Koshiyama, K, Sprock, J, Braff, DL, Swerdlow, NR and Light, GA (2020) A distributed frontotemporal network underlies gamma-band synchronization impairments in schizophrenia patients. Neuropsychopharmacology 45, 2198–2206.Google Scholar

Koshiyama, D, Miyakoshi, M, Thomas, ML, Joshi, YB, Molina, JL, Tanaka-Koshiyama, K, Sprock, J, Braff, DL, Swerdlow, NR and Light, GA (2021b) Unique contributions of sensory discrimination and gamma synchronization deficits to cognitive, clinical, and psychosocial functional impairments in schizophrenia. Schizophrenia Research 228, 280–287.Google Scholar

Kozhemiako, N, Wang, J, Jiang, C, Wang, LA, Gai, G, Zou, K, Wang, Z, Yu, X, Zhou, L, Li, S, Guo, Z, Law, R, Coleman, J, Mylonas, D, Shen, L, Wang, G, Tan, S, Qin, S, Huang, H, Murphy, M, Stickgold, R, Manoach, D, Zhou, Z, Zhu, W, Hal, MH, Purcell, SM and Pan, JQ (2022) Non-rapid eye movement sleep and wake neurophysiology in schizophrenia. Elife 11, e76211.Google Scholar

Kuhn and Max. (2008) Building predictive models in R using the caret package. Journal of Statistical Software 28, 1–26.Google Scholar

Liaw, A and Wiener, M (2002) Classification and regression by randomForest. R News 2, 18–22.Google Scholar

Majka, M (2024) naivebayes: high performance implementation of the naive bayes algorithm in R. R package version 0.9.7. https://CRAN.R-project.org/package=naivebayes.Google Scholar

Martínez-Cagigal, V (2020) Topographic EEG/MEG plot. MATLAB file exchange version 1.5. https://www.mathworks.com/matlabcentral/fileexchange/72729-topographic-eeg-meg-plot.Google Scholar

Mcgorry, PD (2002) The recognition and optimal management of early psychosis: an evidence-based reform. World Psychiatry 1, 76–83.Google Scholar

Montgomery, SA and Asberg, M (1979) A new depression scale designed to be sensitive to change. British Journal of Psychiatry 134, 382–389.Google Scholar

Mulert, C, Kirsch, V, Pascual-Marqui, R, Mccarley, RW and Spencer, KM (2011) Long-range synchrony of gamma oscillations and auditory hallucination symptoms in schizophrenia. International Journal of Psychophysiology 79, 55–63.Google Scholar

Nunez, PL and Srinivasan, R (2006) Electric fields of the brain: the neurophysics of EEG. New York: Oxford University Press, pp. 313–352.Google Scholar

O’donnell, BF, Vohs, JL, Krishnan, GP, Rass, O, Hetrick, WP and Morzorati, SL (2013) The auditory steady-state response (ASSR): a translational biomarker for schizophrenia. Suppl Clin Neurophysiol 62, 101–112.Google Scholar

Onitsuka, T, Tsuchimoto, R, Oribe, N, Spencer, KM and Hirano, Y (2022) Neuronal imbalance of excitation and inhibition in schizophrenia: a scoping review of gamma-band ASSR findings. Psychiatry and Clinical Neurosciences 76, 610–619.Google Scholar

Pantelis, C, Yucel, M, Wood, SJ, Velakoulis, D, Sun, D, Berger, G, Stuart, GW, Yung, A, Phillips, L and Mcgorry, PD (2005) Structural brain imaging evidence for multiple pathological processes at different stages of brain development in schizophrenia. Schizophrenia Bulletin 31, 672–696.Google Scholar

Parciauskaite, V, Bjekic, J and Griskova-Bulanova, I (2021) Gamma-range auditory steady-state responses and cognitive performance: a systematic review. Brain Sciences 11, 217.Google Scholar

Parker, DA, Hamm, JP, Mcdowell, JE, Keedy, SK, Gershon, ES, Ivleva, EI, Pearlson, GD, Keshavan, MS, Tamminga, CA, Sweeney, JA and Clementz, BA (2019) Auditory steady-state EEG response across the schizo-bipolar spectrum. Schizophrenia Research 209, 218–226.Google Scholar

Saeidi, M, Karwowski, W, Farahani, FV, Fiok, K, Taiar, R, Hancock, PA and Al-Juaid, A (2021) Neural decoding of EEG signals with machine learning: a systematic review. Brain Sciences 11, 1525.Google Scholar

Spencer, KM, Niznikiewicz, MA, Shenton, ME and Mccarley, RW (2008a) Sensory-evoked gamma oscillations in chronic schizophrenia. Biological Psychiatry 63, 744–747.Google Scholar

Spencer, KM, Salisbury, DF, Shenton, ME and Mccarley, RW (2008b) Gamma-band auditory steady-state responses are impaired in first episode psychosis. Biological Psychiatry 64, 369–375.Google Scholar

Steinmann, I and Gutschalk, A (2011) Potential fMRI correlates of 40-Hz phase locking in primary auditory cortex, thalamus and midbrain. NeuroImage 54, 495–504.Google Scholar

Tada, M, Kirihara, K, Ishishita, Y, Takasago, M, Kunii, N, Uka, T, Shimada, S, Ibayashi, K, Kawai, K, Saito, N, Koshiyama, D, Fujioka, M, Araki, T and Kasai, K (2021) Global and parallel cortical processing based on auditory gamma oscillatory responses in humans. Cereb Cortex, 31, 4518–4532.Google Scholar

Tada, M, Nagai, T, Kirihara, K, Koike, S, Suga, M, Araki, T, Kobayashi, T and Kasai, K (2016) Differential alterations of auditory gamma oscillatory responses between pre-onset high-risk individuals and first-episode schizophrenia. Cerebral Cortex 26, 1027–1035.Google Scholar

Tay, JK, Narasimhan, B and Hastie, T (2023) Elastic net regularization paths for all generalized linear models. Journal of Statistical Software 106, 1–31.Google Scholar

Thune, H, Recasens, M and Uhlhaas, PJ (2016) The 40-Hz auditory steady-state response in patients with schizophrenia: a meta-analysis. JAMA Psychiatry 73, 1145–1153.Google Scholar

Uhlhaas, PJ and Singer, W (2010) Abnormal neural oscillations and synchrony in schizophrenia. Nature Reviews Neuroscience 11, 100–113.Google Scholar

Young, RC, Biggs, JT, Ziegler, VE and Meyer, DA (1978) A rating scale for mania: reliability, validity and sensitivity. British Journal of Psychiatry 133, 429–435.Google Scholar

Zhou, TH, Mueller, NE, Spencer, KM, Mallya, SG, Lewandowski, KE, Norris, LA, Levy, DL, Cohen, BM, Ongur, D and Hall, MH (2018) Auditory steady state response deficits are associated with symptom severity and poor functioning in patients with psychotic disorder. Schizophrenia Research 201, 278–286.Google Scholar

Figure 1. Study schematics.

Figure 2. ESP deficits across channels in PLF and PWE. Average 40 Hz ASSR frequency spectrograms over representative channels for HC (N = 58; top) and ESP (N = 72; bottom) for PLF (A) and PWE (B). Channels from left-right are AFz, Fz, F3, F4, Cz, C3, C4, T7, T8, Pz, P3, P4, Oz, O1, O2. For each channel in PLF (C) and PWE (D), a one-sided student’s t-test with the alternative hypothesis of ‘greater’ was run for HC versus ESP. Scale is inverse log10 p-value, with a maximum p-value of 0 and a minimum p-value of 0.00138.

Figure 3. Machine learning metrics for PLF and PWE. (A) For PLF and PWE test, machine learning metrics F1, overall accuracy (AccOverall), balanced accuracy (AccBal), and root mean square error (RMSE) are displayed for random forest (RF), Ridge (L2 elasticnet), Gaussian process with a radial kernel (Gaussian Radial), support vector machines with a radial kernel (SVM Radial), and naive Bayes. (B). For PLF and PWE, original test F1, and scrambled labels F1 over 20 permutations for each machine learning algorithm are demonstrated via boxplot. (C) For PLF and PWE, ensemble ranking metrics for t-test (increasing p-value), RF (mean Gini index), Ridge (beta coefficients), Gaussian Radial (AUC), SVM radial (AUC), Naive Bayes (AUC), and average rank across all six metrics (avg_rank) are displayed.

Figure 4. Clinical variables decomposed into principal components, and correlations to PLF and PWE. (A) Clinical variables with<20% missingness are correlated across 43 ESP with complete data via Pearson correlation. The Viridis colour scale shows high Pearson correlation value (yellow) to low Pearson correlation value (purple). (B). For the first 9 principal components, the contribution of each clinical variable is shown via colour bar, scaled per principal component. PLF demonstrates 52 channels had a significant Pearson correlation to PC2 (significance level of FDR ≤ 0.01) (highlighted in red).

Figure 5. Correlations of baseline FCz and Fz to longitudinal GAF. For PLF and PWE (A,C), one-sided Pearson correlation (R) of baseline FCz to baseline and one-year GAF score are displayed, along with correlation test p-value. FCz is the x-axis, GAF score is the y-axis. Blue line is the linear model trend line, grey is the standard error. For PLF and PWE (B,D), one-sided Pearson correlation (R) of baseline Fz to baseline and one-year GAF score are displayed, along with correlation test p-value. Fz is the x-axis, GAF score is the y-axis. Blue line is the linear model trend line, grey is the standard error.