Introduction
Hallucinations are aberrant perceptual experiences that are reported in several major psychiatric and neurological conditions (Eversfield & Orton, Reference Eversfield and Orton2019; Insel, Reference Insel2010; Toh, Thomas, & Rossell, Reference Toh, Thomas and Rossell2015; Waters & Fernyhough, Reference Waters and Fernyhough2017). Hallucinations are of major negative impact. Associated with delusions and psychosis, they often require repeated hospitalization. In neurodegenerative diseases, they increase the likelihood of earlier home placement (e.g.. Aarsland et al., Reference Aarsland, Larsen, Tandberg and Laake2000) and have been linked to higher mortality (Gonzalez et al., Reference Gonzalez, Dalen, Maple-Grødem, Tysnes and Alves2022). Hallucinations are very common and even occur in 5–10% of the general population, without any medical diagnosis (Larøi et al., Reference Larøi, Bless, Laloyaux, Kråkvik, Vedul-Kjelsås, Kalhovde and Hugdahl2019). Despite this clinical relevance, hallucination research, and the understanding of the underling brain mechanisms, have been hampered by methodological shortcomings to investigate hallucinations in real-time under controlled experimental settings. A major shortcoming is the absence of controlled procedures allowing to induce hallucinations in laboratory and/or clinical settings (Bernasconi et al., Reference Bernasconi, Blondiaux, Rognini, Dhanis, Jenni, Potheegadoo and Blanke2022; Pearson et al., Reference Pearson, Chiou, Rogers, Wicken, Heitmann and Ermentrout2016). Moreover, hallucination research has predominantly been conducted in clinical populations, and is therefore confounded by co-morbidities existing in the tested populations. Recent research has induced hallucinations in controlled laboratory in healthy individuals, using different pharmacological approaches (McClure-Begley & Roth, Reference McClure-Begley and Roth2022; Vollenweider & Preller, Reference Vollenweider and Preller2020) and several other procedures [i.e. Flicker-induced phosphenes (Allefeld, Pütz, Kastner, & Wackermann, Reference Allefeld, Pütz, Kastner and Wackermann2011; Pearson et al., Reference Pearson, Chiou, Rogers, Wicken, Heitmann and Ermentrout2016), Ganzfeld effect (Wackermann, Pütz, & Allefeld, Reference Wackermann, Pütz and Allefeld2008), sensory deprivation (Mason & Brady, Reference Mason and Brady2009; Merabet et al., Reference Merabet, Maguire, Warde, Alterescu, Stickgold and Pascual-Leone2004), Pavlovian conditioning (Powers, Mathys, & Corlett, Reference Powers, Mathys and Corlett2017b)].
Hallucinations induced by these methods, however, often have poor ecological validity, and, despite being induced under laboratory conditions and in healthy subjects, are characterized by low experimental control over their content (often not specific), timing and duration (may be long-lasting), and associated with impairments of consciousness (i.e. pharmacological approaches). We recently described a robotic procedure able to repeatedly induce a specific, highly-controllable and clinically-relevant sensorimotor hallucination – presence hallucination – the sensation that someone is nearby when no-one is actually present, and cannot be seen or heard (Bernasconi et al., Reference Bernasconi, Blondiaux, Potheegadoo, Stripeikyte, Pagonabarraga, Bejr-kasem and Blanke2021, Reference Bernasconi, Blondiaux, Rognini, Dhanis, Jenni, Potheegadoo and Blanke2022; Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014). Merging techniques from engineering and neuroscience, this procedure has been used to induce and understand clinically-relevant hallucinations (Bernasconi et al., Reference Bernasconi, Blondiaux, Rognini, Dhanis, Jenni, Potheegadoo and Blanke2022) under controlled experimental conditions in healthy individuals (Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014; Dhanis et al., Reference Dhanis, Blondiaux, Bolton, Faivre, Rognini, van de Ville and Blanke2022; Orepic, Park, Rognini, Faivre, & Blanke, Reference Orepic, Park, Rognini, Faivre and Blanke2022; Orepic, Rognini, Kannape, Faivre, & Blanke, Reference Orepic, Rognini, Kannape, Faivre and Blanke2021; Serino et al., Reference Serino, Pozeg, Bernasconi, Solcà, Hara, Progin and Blanke2021) as well as in patients with Parkinson's disease (Bernasconi et al., Reference Bernasconi, Blondiaux, Potheegadoo, Stripeikyte, Pagonabarraga, Bejr-kasem and Blanke2021) and early psychosis patients (Salomon et al., Reference Salomon, Progin, Griffa, Rognini, Do, Conus and Blanke2020). This method, when combined with auditory-verbal tasks that are carried out during robotic stimulation, is able to induce deficits in auditory-verbal self-monitoring in psychosis patients (Salomon et al., Reference Salomon, Progin, Griffa, Rognini, Do, Conus and Blanke2020) and induce changes in voice perception in healthy participants (Orepic et al., Reference Orepic, Rognini, Kannape, Faivre and Blanke2021). Here we asked whether the robotic procedure can be adapted to induce a hallucinatory state in healthy participants that is of major clinical relevance in psychiatry and comparable to auditory-verbal hallucinations (AVH).
AVH, the sensation of hearing voices without any speaker present (commonly known as ‘hearing voices’) are one of the most common (Bauer et al., Reference Bauer, Schanda, Karakula, Olajossy-Hilkesberger, Rudaleviciene, Okribelashvili and Stompe2011) and most distressing (Harkavy-Friedman et al., Reference Harkavy-Friedman, Kimhy, Nelson, Venarde, Malaspina and Mann2003) symptoms in schizophrenia spectrum disorder. AVH are characterized by a very heterogeneous phenomenology (e.g. varying with respect to voice numerosity, gender, frequency, emotional affect, etc.) (McCarthy-Jones et al., Reference McCarthy-Jones, Trauer, MacKinnon, Sims, Thomas and Copolov2014; Woods, Jones, Alderson-Day, Callard, & Fernyhough, Reference Woods, Jones, Alderson-Day, Callard and Fernyhough2015), and have also been observed in non-help-seeking individuals (Powers, Kelley, & Corlett, Reference Powers, Kelley and Corlett2017a; Sommer et al., Reference Sommer, Daalman, Rietkerk, Diederen, Bakker, Wijkstra and Boks2010), rendering their diagnosis and treatment challenging. With contemporary treatments being effective only to a certain degree (Lehman et al., Reference Lehman, Lieberman, Dixon, McGlashan, Miller, Perkins and Cook2004), there is a strong need for a better understanding of the underlying mechanisms. Despite the increasing frequency and the large number of studies revolving around AVH, their etiology remains debated, with two prominent and seemingly opposing theoretical accounts, suggesting that AVH result either from (1) deficits in self-monitoring, or (2) overly strong perceptual priors. Although both of these carry some empirical support, only theoretical speculations (Leptourgos & Corlett, Reference Leptourgos and Corlett2020; Northoff & Qin, Reference Northoff and Qin2011; Swiney & Sousa, Reference Swiney and Sousa2014; Synofzik, Vosgerau, & Newen, Reference Synofzik, Vosgerau and Newen2008; Wilkinson, Reference Wilkinson2014; Yttri, Urfer-Parnas, & Parnas, Reference Yttri, Urfer-Parnas and Parnas2022) have been made on how they might coexist in the brain and relate phenomenologically.
The self-monitoring account suggests that AVH arise from a deficit in self-monitoring, more specifically the inability to distinguish self- from other-related events. According to this framework (Miall & Wolpert, Reference Miall and Wolpert1996; Shadmehr, Smith, & Krakauer, Reference Shadmehr, Smith and Krakauer2010; Wolpert, Ghahramani, & Jordan, Reference Wolpert, Ghahramani and Jordan1995), self-other distinction is achieved by creating sensory predictions related to one's actions and by comparing them with the actual sensory feedback following those actions. When congruent with the sensory prediction, ascending sensory events are attenuated, and the action is attributed to the self, whereas if the prediction and the ascending sensory events are incongruent, attenuation is reduced and the action is attributed to an external agent (Heinks-Maldonado, Mathalon, Gray, & Ford, Reference Heinks-Maldonado, Mathalon, Gray and Ford2005; Shergill, Bays, Frith, & Wolpert, Reference Shergill, Bays, Frith and Wolpert2003). Impairments in self-monitoring have been observed in schizophrenia (Blakemore, Smith, Steel, Johnstone, & Frith, Reference Blakemore, Smith, Steel, Johnstone and Frith2000; Shergill, Samson, Bays, Frith, & Wolpert, Reference Shergill, Samson, Bays, Frith and Wolpert2005; Shergill et al., Reference Shergill, White, Joyce, Bays, Wolpert and Frith2014), and have been related to psychotic symptoms characterized by a misattribution of self-generated actions (Blakemore et al., Reference Blakemore, Smith, Steel, Johnstone and Frith2000; Frith, Reference Frith1987; Graham-Schmidt, Martin-Iverson, & Waters, Reference Graham-Schmidt, Martin-Iverson and Waters2016), including AVH, explaining them as a misattribution of self-generated speech toward external agents (Feinberg, Reference Feinberg1978; Ford, Roach, Faustman, & Mathalon, Reference Ford, Roach, Faustman and Mathalon2007; Frith, Reference Frith1992; Gould, Reference Gould1948; Green & Kinsbourne, Reference Green and Kinsbourne1990; Green & Preston, Reference Green and Preston1981; McGuigan, Reference McGuigan1966; Moseley, Fernyhough, & Ellison, Reference Moseley, Fernyhough and Ellison2013). However, the evidence supporting the self-monitoring account for AVH remains largely implicit, as AVH are rarely related to direct experimental manipulations of motor signals and their corresponding sensory feedback. Its empirical support mainly consists of studies [reviewed by (Whitford, Reference Whitford2019)] in which patients with schizophrenia either exhibited a reduced attenuation of auditory neural evoked response while speaking compared to passively hearing their voice, or there was a dysfunctional interregional communication within the speech network (Ford, Mathalon, Whitfield, Faustman, & Roth, Reference Ford, Mathalon, Whitfield, Faustman and Roth2002; Ford et al., Reference Ford, Dierks, Fisher, Herrmann, Hubl, Kindler and Van Lutterveld2012; Friston & Frith, Reference Friston and Frith1995; Hoffman, Fernandez, Pittman, & Hampson, Reference Hoffman, Fernandez, Pittman and Hampson2011), both of which were hypothesized to facilitate erroneous feedforward signaling. More direct evidence for the self-monitoring account would consist of a study demonstrating that experimentally-induced self-monitoring perturbations of different degrees (i.e. stronger, weaker) can explicitly lead to different degrees of AVH (i.e. more, less).
The second major account suggests that AVH might be engendered by overly strong beliefs (i.e. priors) about the environment (Cassidy et al., Reference Cassidy, Balsam, Weinstein, Rosengard, Slifstein, Daw and Horga2018; Corlett et al., Reference Corlett, Horga, Fletcher, Alderson-day, Schmack and Iii2019; Powers et al., Reference Powers, Mathys and Corlett2017b; Zarkali et al., Reference Zarkali, Adams, Psarras, Leyland, Rees and Weil2019). It relies on the predictive coding framework that sees the brain as a hierarchical Bayesian system in which priors (at higher levels) and incoming sensory information (lower levels) are combined for perception (Friston, Reference Friston2008, Reference Friston2009). Crucially, precision-weighting of bottom-up (sensory) and top-down (priors) components modulates perception, whereby the component with higher precision dominates perception. Accordingly, hallucinations have been hypothesized to arise when priors carry undue precision, overruling the actual sensory evidence (Adams, Brown, & Friston, Reference Adams, Brown and Friston2015; Corlett et al., Reference Corlett, Horga, Fletcher, Alderson-day, Schmack and Iii2019; Fletcher & Frith, Reference Fletcher and Frith2009; Sterzer et al., Reference Sterzer, Adams, Fletcher, Frith, Lawrie, Muckli and Corlett2018). This view is supported by empirical data demonstrating that both clinical (Kot & Serper, Reference Kot and Serper2002) and non-clinical (Alderson-Day et al., Reference Alderson-Day, Lima, Evans, Krishnan, Shanmugalingam, Fernyhough and Scott2017) voice-hearers, as well as psychosis-prone individuals (Teufel et al., Reference Teufel, Subramaniam, Dobler, Perez, Finnemann, Mehta and Fletcher2015) favor prior knowledge over sensory information during perceptual inference. These data suggest that perceptual inference in hallucinating individuals is driven by prior beliefs. However, this work has not addressed the relationship of prior beliefs to the perception of voices specifically – i.e., it remains unclear which kinds of priors need to be over-weighted in order to experience AVH, a voice of an external origin. For instance, more direct evidence for the strong-priors account in AVH would consist of a study that experimentally induces a specific perceptual feature (e.g. sensations of otherness in the form of a presence hallucination) and observed a corresponding perceptual bias in AVH (e.g. increased AVH attributed to others).
Here, we developed a new method of inducing AVH in a controlled laboratory environment by integrating methods from voice perception with sensorimotor stimulation, allowing us to investigate the contribution of both major AVH accounts. We used a robotic procedure that can create impairments in self-monitoring as well as elicit the sensations that there is another (alien) person close by (i.e. presence hallucination, PH) (Bernasconi et al., Reference Bernasconi, Blondiaux, Potheegadoo, Stripeikyte, Pagonabarraga, Bejr-kasem and Blanke2021; Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014; Salomon et al., Reference Salomon, Progin, Griffa, Rognini, Do, Conus and Blanke2020; Serino et al., Reference Serino, Pozeg, Bernasconi, Solcà, Hara, Progin and Blanke2021). Specifically, we designed a robotic setup that exposes participants to sensorimotor conflicts of various degree between repeated upper-limb poking movements and the corresponding tactile sensations on the back (Hara et al., Reference Hara, Rognini, Evans, Blanke, Yamamoto, Bleuler and Higuchi2011), linked to the misperception of the source and identity of sensorimotor signals of one's own body (Bernasconi et al., Reference Bernasconi, Blondiaux, Potheegadoo, Stripeikyte, Pagonabarraga, Bejr-kasem and Blanke2021, Reference Bernasconi, Blondiaux, Rognini, Dhanis, Jenni, Potheegadoo and Blanke2022; Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014; Salomon et al., Reference Salomon, Progin, Griffa, Rognini, Do, Conus and Blanke2020; Serino et al., Reference Serino, Pozeg, Bernasconi, Solcà, Hara, Progin and Blanke2021). Thus, our robotic setup applies sensorimotor conflicts of various degree (i.e. self-monitoring perturbations) and induces a perceptual prior about the presence of an external non-existing agent. We combined this procedure with a voice detection paradigm and measured experimentally-induced AVH-like sensations as an increase in vocal false alarms. Namely, a false alarm in a voice detection task indicates that participants have heard a non-existing voice, rendering vocal false alarms a suitable proxy for lab-induced AVH, as was similarly done by others (Barkus et al., Reference Barkus, Smallman, Royle, Barkus, Lewis and Rushe2011; Chhabra et al., Reference Chhabra, Selvaraj, Sreeraj, Damodharan, Shivakumar, Kumar and Venkatasubramanian2022; Moseley et al., Reference Moseley, Alderson-Day, Common, Dodgson, Lee, Mitrenga and Fernyhough2022; Moseley, Fernyhough, & Ellison, Reference Moseley, Fernyhough and Ellison2014; Powers et al., Reference Powers, Mathys and Corlett2017b; Schmack, Bosc, Ott, Sturgill, & Kepecs, Reference Schmack, Bosc, Ott, Sturgill and Kepecs2021). In our two studies with two independent cohorts, our participants detected voices (either their own or someone else's) presented at individual hearing thresholds in pink noise, while simultaneously experiencing robotic sensorimotor stimulation. We hypothesized that conflicting sensorimotor stimulation leading to a PH (Bernasconi et al., Reference Bernasconi, Blondiaux, Potheegadoo, Stripeikyte, Pagonabarraga, Bejr-kasem and Blanke2021, Reference Bernasconi, Blondiaux, Rognini, Dhanis, Jenni, Potheegadoo and Blanke2022; Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014) would lead to an increase in vocal false alarms (i.e. reporting hearing voices in trials with no physical voice present in noise), compared to the stimulation with a weaker sensorimotor conflict, thereby relating our findings with the self-monitoring account. Moreover, we hypothesized that this increase would be modulated by the voice task they are involved in (other-voice detection vs. self-voice detection), being especially prominent when performing other-voice detection, thereby relating our findings with the strong-priors account. These findings would demonstrate empirical support for both seemingly opposing theoretical accounts.
Methods
Participants
We conducted two studies with the same general procedure and experimental design. Study 2 was set to replicate the effects observed in Study 1. Both studies involved 24 right-handed participants chosen from the general population, fluent in French and naïve to the purpose of the study. In Study 1, 17 participants were female (mean age ± s.d.: 25.0 ± 4.2 years old), whereas in Study 2, 13 were female (26.6 ± 5.3 years old). Sample size in both studies was similar to our previous work (Orepic et al., Reference Orepic, Rognini, Kannape, Faivre and Blanke2021) and determined to match the number of all possible permutations of experimental conditions. No participants reported any history of psychiatric or neurological disorders as well as any hearing deficits. Participants gave informed consent in accordance with the institutional guidelines (protocol 2015-00092, approved by the Comité Cantonal d'Ethique de la Recherche of Geneva), and received monetary compensation (CHF 20/h).
Stimuli
Participants' voices were recorded (Zoom H6 Handy recorder) while saying nine one-syllable words in French (translated to English: nail, whip, ax, blade, fight, bone, rat, blood, saw, worm). The words were chosen from the list of 100 negatively-valenced words, as rated by 20 schizophrenic patients and 97 healthy participants (Jalenques, Enjolras, & Izaute, Reference Jalenques, Enjolras and Izaute2013). Negative words were purposefully chosen in our previous study (Orepic et al., Reference Orepic, Rognini, Kannape, Faivre and Blanke2021), in order to better approximate the phenomenology of AVH, that are mostly negative in content (Woods et al., Reference Woods, Jones, Alderson-Day, Callard and Fernyhough2015). After the background noise was removed from the recordings, they were standardized for sound intensity (−12 dBFS) and duration (500 milliseconds) (Audacity software). The preprocessed recordings were used as self-voice stimuli in a voice detection task, which also contained other-voice stimuli – i.e. equivalent voice recordings of a gender-matched person unknown to the participant. Auditory stimuli were presented to participants through noise-cancelling headphones (Bose QC20). The experimental paradigm was created in MATLAB 2017b with Psychtoolbox library (Brainard, Reference Brainard1997; Kleiner et al., Reference Kleiner, Brainard, Pelli, Ingling, Murray, Broussard and Murray2007; Pelli, Reference Pelli1997).
Experimental procedure
Upon arrival, participants were screened for eligibility criteria, after which their voices were recorded. This was followed by two Sensorimotor blocks (synchronous and asynchronous), designed to assess illusory effects of sensorimotor stimulation. Sensorimotor blocks were followed by Staircase blocks (bottom-up and top-down), used to estimate individual hearing thresholds with a voice detection task. Finally, in four Task blocks (synchronous-self, synchronous-other, asynchronous-self, asynchronous-other) we assessed vocal false alarms by combining sensorimotor stimulation and voice detection task. At the end of the experiment, participants filled out the PDI questionnaire (Peters, Joseph, Day, & Qarety, Reference Peters, Joseph, Day and Qarety2004), that assesses delusional ideation present in the general population and has been related both to errors in self-monitoring (Teufel, Kingdon, Ingram, Wolpert, & Fletcher, Reference Teufel, Kingdon, Ingram, Wolpert and Fletcher2010) as well as excessive prior-weighting.
Sensorimotor blocks: assessment of illusory effects
Identical to our previous studies (Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014; Faivre et al., Reference Faivre, Vuillaume, Bernasconi, Salomon, Blanke and Cleeremans2020; Orepic et al., Reference Orepic, Rognini, Kannape, Faivre and Blanke2021; Salomon et al., Reference Salomon, Progin, Griffa, Rognini, Do, Conus and Blanke2020; Serino et al., Reference Serino, Pozeg, Bernasconi, Solcà, Hara, Progin and Blanke2021), during sensorimotor blocks participants manipulated a robotic system that consists of two integrated units: the front part – a commercial haptic interface (Phantom Omni, SensAble Technologies) – and the back part – a three degree-of-freedom robot (Hara et al., Reference Hara, Rognini, Evans, Blanke, Yamamoto, Bleuler and Higuchi2011) (Fig. 1). Blindfolded participants were seated between the front and back parts of the robot and were asked to perform repeated poking movements with their right index finger using the front part. Participants' pokes were replicated by the back part, thus applying corresponding touches on participants' backs. The touches were mediated by the robot either in synchronous (without delay) or in asynchronous (with 500 milliseconds delay) fashion, creating different degrees of sensorimotor conflict between the upper limb movement and somatosensory feedback on the back.
Following a two-minute-long sensorimotor stimulation (both synchronous and asynchronous), participants filled out a short questionnaire. Specifically, on a Likert scale from 0 (not at all) to 6 (very strong), after each block, participants rated the strength of illusory self-touch (‘I felt as if I was touching my back by myself’), somatic passivity (‘I felt as if someone else was touching my back’) and presence hallucination (‘I felt as if someone was standing close to me’). Questionnaire contained an additional control item (‘I felt as if I had three bodies.’). The order of the two blocks (synchronous and asynchronous) was counterbalanced across participants.
Staircase blocks: defining hearing thresholds
Participants' individual hearing thresholds were estimated with a voice detection task combined with a one-up-one-down staircase procedure (Cornsweet, Reference Cornsweet1962). During the task, participants were continuously hearing short bursts of pink noise and were instructed to report whether they heard a voice in the noise by pressing on a button after the noise offset. Each burst of noise lasted for 3.5 s and voice onset randomly occurred in a period between 0.5 and 2.5 s after the noise onset, ensuring a minimum of 0.5 s of noise before and after the presentation of a voice recording. Following participants' response in each trial (i.e. a button click after the noise offset), an inter-trial interval jittered between 1 and 1.5 s.
The staircase procedure employed only other-voice stimuli and consisted of two blocks, one starting from a high (top-down block) and another from a low (bottom-up block) sound intensity level, counterbalanced across participants. In both staircase blocks, each word was presented four times in a randomized order, resulting in 36 trials. Threshold in each block was computed as a mean value from the last 15 trials and the average of the two thresholds was considered as participants' hearing threshold. No differences in detectability between self-voice and other-voice stimuli, as well as between different words were assured in a pilot study.
Task blocks: combining a voice detection task with sensorimotor stimulation
During Task blocks, participants were performing the voice detection task while being exposed to sensorimotor stimulation (i.e. while they manipulated the robotic device). Task blocks differed based on the type of sensorimotor stimulation (synchronous, asynchronous), as well as of vocal stimuli (self, other). Thus, each participant completed four Task blocks (synchronous-self, synchronous-other, asynchronous-self, asynchronous-other) and had a unique order of blocks (i.e. we tested 24 participants to match 24 possible permutations of Task blocks). Task blocks started with 30 s of sensorimotor stimulation, followed by a concomitant voice detection task (Fig. 1). Throughout the auditory task, participants continued manipulating the robot and auditory stimuli were not time-locked to participants' movements. The voice detection task was identical to the task in Staircase blocks, with the addition of 18 trials that contained only noise (i.e. no-voice trials). No-voice trials were randomized together with 45 trials containing a voice (i.e. each word was presented five times within a block), resulting in 63 trials per block. An adaptive one-up-one-down staircase procedure was maintained throughout the block to ensure that the voices were presented at hearing threshold.
Statistical analysis
Statistical analysis and plotting were performed in R (R Core Team, 2020), using notably the lme4 (Bates, Mächler, Bolker, & Walker, Reference Bates, Mächler, Bolker and Walker2015), lmerTest (Kuznetsova, Brockhoff, & Christensen, Reference Kuznetsova, Brockhoff and Christensen2018), and afex (Singmann, Bolker, Westfall, & Aust, Reference Singmann, Bolker, Westfall and Aust2019) packages. The results were illustrated using sjplot (Lüdecke, Reference Lüdecke2018) and ggplot2 (Wickham, Reference Wickham2016) packages.
Vocal false alarms
Serving as a measure of experimentally-induced AVH, our primary research interest was to identify the effects of sensorimotor stimulation on vocal false alarm rate. Thus, on no-voice trials, we conducted a mixed-effects binomial regression with Response as dependent variable and Stimulation (synchronous, asynchronous), Voice (self, other) and Gender (male, female) as fixed effects, and participants as random effect. Voice and Stimulation were constant throughout the block. The Response-variable indicates whether participants detected or not a voice in the noise, thus in the no-voice trials it represents a false alarm (whereas for the trials containing a voice in the noise, it stands for a hit). An interaction term was added between the effects of Stimulation and Voice. The Gender effect was added to the regression because of previous reports of gender differences with respect to felt presences as well as AVH (Alderson-Day et al., Reference Alderson-Day, Moseley, Mitrenga, Moffatt, Lee, Foxwell and Fernyhough2022, Reference Alderson-Day, Woods, Moseley, Common, Deamer, Dodgson and Fernyhough2021). Random effects included a by-participant random intercept. By-participant random slopes for the main effects were added following model selection based on maximum likelihood.
Questionnaire ratings
Ratings in questionnaire items were assessed by a mixed-effects linear regression containing a fixed effect of Stimulation (synchronous, asynchronous) and by-subject random intercepts. For the questionnaire items that significantly differed between the two sensorimotor stimulations (synchronous, asynchronous), we additionally explored whether the illusion assessed by the corresponding questionnaire item affected false alarm rate in the voice detection task. Specifically, to the mixed-effect binomial regression described above (with Response as a dependent variable) we added an additional fixed effect Questionnaire Item, with values represented as Likert-scale ratings (0–6) given for the corresponding questionnaire item and sensorimotor stimulation. The effect of Questionnaire Item was related to an interaction term with the effect of Condition.
Delusional ideation
Similar to questionnaire items, we explored the effects of delusional ideation on false alarm rate, by adding PDI score (Peters et al., Reference Peters, Joseph, Day and Qarety2004) as a covariate to the equivalent mixed-effect binomial regression, and forming a two-way interaction together with the effect of Condition.
Control analyses
Our primary outcome variable was false-alarm rate. However, in order to ensure that our experimental manipulation only affected no-voice trials (i.e. false alarms), we also conducted equivalent mixed-effects binomial regression analyses for the trials with voices present in noise (i.e. with hit rate as dependent variable). A lack of equivalent effects on hit rate would indicate that our experimental manipulation did not affect the detectability of the voices when they are actually present in noise, but that the effects are specific to reporting hearing non-existing voices in noise. Besides hit rate, our control variables were d’, i.e., task sensitivity (the distance between the midpoints of distributions of signal and signal with added noise; calculated as the standardized false-alarm rate subtracted from the standardized hit rate) and criterion, i.e., response bias (the number of standard deviations from the midpoint between these two distributions; calculated as the mean of the standardized hit rate and standardized false alarm rate). D’ and criterion were assessed with a two-way ANOVA containing Stimulation (synchronous, asynchronous) and Voice (self, other) as fixed effects with an interaction term.
Results
Vocal false alarms
In both studies, we investigated the effects of sensorimotor stimulation (synchronous, asynchronous) and type of voice stimuli (self, other) on the rate of induced vocal false alarms in the voice detection task.
Study 1
In Study 1, a mixed-effects binomial regression revealed a main effect of Stimulation (estimate = −0.58, Z = −2.12, p = 0.034), indicating a higher false alarm rate during asynchronous [mean = 0.15, 95% CI (0.13–0.17)], compared to synchronous stimulation [mean = 0.13, 95% CI (0.11, 0.16)]. Critically, there was a significant interaction between Voice and Stimulation (estimate = 1.09, Z = 2.72, p = 0.007) (Fig. 2a). Post-hoc analyses of the interaction indicated that during the blocks containing other-voice stimuli, false alarm rate was higher with asynchronous stimulation [estimate = −0.56, Z = −2.1, p = 0.036; mean = 0.16, 95% CI (0.13–0.19), synchronous: mean = 0.12, 95% CI (0.09–0.15)]. During self-voice blocks, there was a tendency for the opposite effect: an increase in false alarm rate with synchronous stimulation (estimate = 0.52, Z = 1.77, p = 0.077; mean = 0.16, 95% CI (0.13–0.19), asynchronous: mean = 0.14, 95% CI (0.11–0.17)] (Fig. 2a). There was no significant main effect of Voice (estimate = −0.87, Z = −1.83, p = 0.068). This shows that (1) there were more vocal false alarms during the condition with higher sensorimotor conflict, and (2) this was more prominent in the blocks containing other-voice stimuli.
Study 2
In Study 2, we replicated this interaction effect between Stimulation and Voice (estimate = 1.14, Z = 2.97, p = 0.003), again revealing that in other-voice blocks false alarms increased with asynchronous [estimate = −0.64, Z = −2.15, p = 0.031, mean = 0.22, 95% CI (0.19–0.25), synchronous: mean = 0.19, 95% CI (0.16–0.22)], whereas in self-voice blocks the opposite effect occurred – more false alarms in synchronous stimulation [estimate = 0.53, Z = 1.98, p = 0.048; mean = 0.26, 95% CI (0.22–0.3), asynchronous: mean = 0.23, 95% CI (0.19–0.27)] (Fig. 2b). Also, there were, again, more false alarms during asynchronous stimulation [main effect of Stimulation; estimate = −0.63, Z = −2.21, p = 0.027; mean = 0.23, 95% CI (0.2–0.23), synchronous: mean = 0.23, 95% CI (0.2–0.23)]. There were no differences in false alarms between the two voices (estimate = 0.03, Z = 0.14, p = 0.887). Neither study had a significant effect of Gender (all p > 0.05, online Supplementary material).
In order to assess possible differences in false alarm rate across studies, we ran equivalent mixed-effects regression by pooling all participants from both studies (N = 48) and added an additional factor Study (with levels 1 and 2). The effects observed in separate studies were replicated (online Supplementary Material) and there were no significant effects of Study (estimate = 0.52, Z = 0.56, p = 0.576). This suggests that false alarm rate was not significantly different across studies.
Together, the results of Study 2 show that the same effects of sensorimotor stimulation on vocal false alarms were replicated in an independent cohort of participants.
Delusional ideation
We also investigated the potential relationship between the observed increase in vocal false alarms and delusional ideation (measured by the PDI questionnaire (Peters et al., Reference Peters, Joseph, Day and Qarety2004)). In both independent subject samples that we tested (Study 1 and 2), binomial mixed-effect regression of responses in non-voice trials revealed a significant main effect of PDI score (Study 1: estimate = 0.2, Z = 2.17, p = 0.03; Study 2: estimate = 0.33, Z = 2.15, p = 0.032), indicating that the higher participants scored on delusional ideation inventory, the more false alarms they made during the voice detection task. In both studies, there was a tendency for a significant interaction between the effects of Stimulation and PDI (Study 1: estimate = −0.2, Z = −1.9, p = 0.057; Study 2: estimate = −0.1, Z = −1.9, p = 0.058), indicating that this relationship was stronger during asynchronous stimulation (Fig. 3). We also divided the PDI scores into 3 subcategories – distress, preoccupation, and conviction (Peters et al., Reference Peters, Joseph, Day and Qarety2004; Schmack et al., Reference Schmack, de Castro, Rothkirch, Sekutowicz, Rössler, Haynes and Sterzer2013) – and ran equivalent mixed effect analyses. The only consistent result across the two studies was an interaction between Stimulation and Conviction score (Study 1: estimate = −0.08, Z = −2.53, p = 0.011; Study 2: estimate = −0.04, Z = −2.17, p = 0.029), indicating a stronger relationship between Conviction and false alarms during Asynchronous stimulation. The results for other subcategories are reported in the online Supplementary material.
Together, these data show that increased false alarms are related to delusional ideation, suggesting a presence of perceptual priors underlying the effects on robotically-induced false alarms.
Questionnaire ratings
In Study 1, presence hallucination was, as expected, experienced more during asynchronous stimulation [estimate = −0.5, t(24) = −2.68, p = 0.013]. In Study 2, we observed a tendency for the same effect [estimate = −0.67, t(24) = −1.92, p = 0.067]. Somatic passivity was also rated higher during asynchronous stimulation in Study 1 [estimate = −1.08, t(24) = −3.68, p = 0.001], with a tendency in Study 2: [estimate = −0.58, t(24) = −1.81, p = 0.083]. Self-touch ratings were higher during synchronous compared to asynchronous stimulation in both studies [Study 1: estimate = 0.79, t(24) = 2.21, p = 0.037; Study 2: estimate = 0.83, t(24) = 3.46, p = 0.002]. The control questionnaire item was unaffected by sensorimotor stimulation [Study 1: estimate = −0.04, t(24) = −0.58, p = 0.566; Study 2: estimate = 0.13, t(24) = 1.39, p = 0.176]. There were no significant effects of gender (all p > 0.05). Means and standard deviations of all questions for both studies are reported in online Supplementary material. Correlation between questionnaire ratings and false alarm rates indicated results that were inconsistent between the two studies and are thus reported in online Supplementary material.
Together, this shows that in both studies, our robotic stimulation induced the expected experiential consequences (Bernasconi et al., Reference Bernasconi, Blondiaux, Potheegadoo, Stripeikyte, Pagonabarraga, Bejr-kasem and Blanke2021; Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014) – i.e., higher PH and somatic passivity with asynchronous stimulation.
Control analyses
As control analyses, we investigated the effects on Stimulation and Voice on hit rate (i.e. responses in trials where there was voice present in noise), as well as on d’ and criterion. In Study 1, binomial mixed-effects regressions of responses in voice-trials revealed a tendency for a main effect of Stimulation (estimate = −0.23, Z = −1.94, p = 0.052), underlying a higher hit rate during asynchronous [mean = 0.55, 95% CI (0.53–0.57)] than synchronous stimulation [mean = 0.53, 95% CI (0.51–0.55)]. Hit rates were unaffected by Voice (estimate = 0, Z = 0.03, p = 0.979) and Gender (estimate = −0.03, Z = −0.11, p = 0.91). The interaction between Voice and Stimulation indicated a tendency toward significance (estimate = 0.23, Z = 1.81, p = 0.071). In Study 2, none of the tendencies from Study 1 proved significant. There was no main effect of Stimulation (estimate = −0.08, Z = −0.76, p = 0.446), nor it interacted with Voice (estimate = 0.12, Z = 0.9, p = 0.367). There were no significant effects of Gender (estimate = −0.7, Z = −1.73, p = 0.084) nor Voice (estimate = −0.09, Z = −0.81, p = 0.416).
In neither study did we observe significant effects on d’ and criterion (online Supplementary material).
The control analyses show that the detectability of the voices, when they are indeed present in noise (i.e. hit rate), is affected neither by Stimulation nor type of detected voice, and that the aforementioned effects are specific to trials in which there is no voice present in noise (i.e. on false alarms).
Time evolution of false alarms
To investigate whether the occurrence of false alarms changed as a function of time, we ran equivalent binomial mixed-effects regressions with an additional factor Trial (with values 1–63, indicating a trial within a block) together with a three-way interaction between Trial, Stimulation, and Voice. As models for individual studies did not converge, we merged the data from both studies together, and added an additional factor Study (with levels 1 and 2) to the model. The model revealed a main effect of Trial (estimate = 0.02, Z = 2.09, p = 0.037), indicating the increase of false alarms with time. The model also revealed an interaction between Trial and Stimulation (estimate = −0.02, Z = −2.13, p = 0.033). Running the model separately for the two levels of Stimulation (synchronous, asynchronous) indicated that false alarm rate increased in time during asynchronous stimulation (estimate = 0.02, Z = 2.34, p = 0.019, online Supplementary Fig. S7), with no significant effects of Trial during synchronous stimulation (estimate = −0.01, Z = −0.82, p = 0.412). Interaction between Trial and Voice was not significant (estimate = 0, Z = −0.47, p = 0.638), nor was the three-way interaction between Trial, Stimulation, and Voice (estimate = 0.02, Z = 1.25, p = 0.21). The main effect of Study was not significant (estimate = 0.5, Z = 0.57, p = 0.566). Equivalent model with hit rate as dependent variable did not reveal any significant effects (online Supplementary material).
These results show that false alarms, but not hits, were more likely to occur toward the end of the experimental blocks, especially during asynchronous stimulation.
Discussion
In two independent cohorts of healthy participants, we employed specific sensorimotor robotic stimulation to induce AVH-like phenomena, as indicated by an increase in the false alarm rate while participants were engaged in a voice detection task. Critically, the presence of AVH-like phenomena was additionally modulated by the type of sensorimotor stimulation. Thus, the asynchronous sensorimotor condition (related to other-agent sensations: PH and somatic passivity) induced overall more AVH-like phenomena and this effect was especially observed in experimental blocks containing other-voice stimuli. Finally, the rate of AVH-like phenomena was positively related to delusional ideation and this correlation was stronger for vocal false alarms during the asynchronous PH-inducing condition.
Hallucination engineering
Extending our sensorimotor procedure that has been shown to induce PH in healthy subjects (Bernasconi et al., Reference Bernasconi, Blondiaux, Rognini, Dhanis, Jenni, Potheegadoo and Blanke2022; Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014; Dhanis et al., Reference Dhanis, Blondiaux, Bolton, Faivre, Rognini, van de Ville and Blanke2022; Orepic et al., Reference Orepic, Rognini, Kannape, Faivre and Blanke2021; Serino et al., Reference Serino, Pozeg, Bernasconi, Solcà, Hara, Progin and Blanke2021) and patients with Parkinson's disease (Bernasconi et al., Reference Bernasconi, Blondiaux, Potheegadoo, Stripeikyte, Pagonabarraga, Bejr-kasem and Blanke2021), we here demonstrate a new experimental paradigm able to induce controlled AVH-like phenomena (manifested as specific false alarms) in healthy, non-hallucinating individuals. Previous methods of inducing hallucinations in healthy individuals – such as through psychedelic medications (Preller & Vollenweider, Reference Preller and Vollenweider2018) or by automatized visual stimulations (e.g. Flicker-induced (Allefeld et al., Reference Allefeld, Pütz, Kastner and Wackermann2011; Pearson et al., Reference Pearson, Chiou, Rogers, Wicken, Heitmann and Ermentrout2016)) – have identified many important challenges present in hallucination engineering. These are, for instance, difficulties to repeatedly induce a hallucination within a given and short period of time, investigating the hallucination of a given participant in real time, and quantifying hallucinations with objective measures (as opposed to measures such as verbal self-reports) – rendering them prone to participant and experimenter biases (Adler, Reference Adler1973; Rosenthal & Fode, Reference Rosenthal and Fode1963). Our AVH-inducing paradigm partly address these challenges – e.g., we were able to repeatedly elicit a perception of non-existing voices in short experimental blocks, which were quantified objectively (through a false alarm rate) and in real time. Moreover, we elicited hallucinations in healthy participants, thereby controlling for confounds related to disease, in which hallucinations typically occur. Finally, our approach on AVH-like experiences allowed us to compare our main findings with the two most prominent views about AVH – the self-monitoring account and the strong-priors account, providing evidence for the clinical relevance of both theoretical accounts.
Self-monitoring
According to the self-monitoring account, AVH result from aberrant predictive mechanisms related to motor actions and the corresponding sensory feedback (Miall & Wolpert, Reference Miall and Wolpert1996; Shadmehr et al., Reference Shadmehr, Smith and Krakauer2010; Wolpert et al., Reference Wolpert, Ghahramani and Jordan1995). Evidence from previous studies arguing for the self-monitoring account is largely implicit, and mainly links AVH to either (1) differences in the amplitude of auditory evoked responses between self- and externally-generated actions or (2) differences in interregional communication within the speech network (Ford et al., Reference Ford, Dierks, Fisher, Herrmann, Hubl, Kindler and Van Lutterveld2012, Reference Ford, Mathalon, Whitfield, Faustman and Roth2002; Friston & Frith, Reference Friston and Frith1995; Hoffman et al., Reference Hoffman, Fernandez, Pittman and Hampson2011; Whitford, Reference Whitford2019). Here, however, we explicitly manipulate self-monitoring with our robotic procedure, and, as a consequence, observe differences in the magnitude of experimentally-evoked AVH-like sensations (i.e. vocal false alarms). Specifically, with our robotic procedure, we induced two levels of self-monitoring perturbation – with stronger (asynchronous) and weaker (synchronous) sensorimotor conflicts – and observed a differential effect on AVH-like sensations (i.e. more false alarms with stronger sensorimotor conflicts). This shows that healthy individuals are more likely to hear non-existing voices in auditory noise when simultaneously experiencing self-monitoring perturbations, which suggests that similar mechanisms might occur in the brain of voice hearers.
It is important to note, however, that AVH as quantified in the present experiments arose not as an impairment in speech-related sensory predictions in voice perception, but as an alteration of tactile, proprioceptive, and motor processes involved in self-monitoring (related to the repetitive arm movements and their corresponding tactile feedback on the back), which was sufficient to alter voice perception, as previously shown for the clinically related phenomenon of thought insertion (Serino et al., Reference Serino, Pozeg, Bernasconi, Solcà, Hara, Progin and Blanke2021). This extends the current view of the self-monitoring account for AVH to a more global deficit in self-monitoring (Blanke, Reference Blanke2012; Blanke, Slater, & Serino, Reference Blanke, Slater and Serino2015; Serino et al., Reference Serino, Pozeg, Bernasconi, Solcà, Hara, Progin and Blanke2021), that has until now been almost exclusively focused on predictions related to speech-related signals (Ford & Mathalon, Reference Ford and Mathalon2019; Whitford, Reference Whitford2019). Moreover, by emphasizing the multimodal implications of sensorimotor conflicts, these data support the original comparator models of schizophrenia (Frith, Reference Frith1992; Frith & Done, Reference Frith and Done1988) that were not speech-specific and incorporated phenomena such as passivity experiences (Frith, Reference Frith1987; Frith & Done, Reference Frith and Done1989). By demonstrating that errors in the global self-monitoring system affect auditory-verbal perception, these findings bridge the empirical gap between these globally-oriented early theories and the auditory-oriented empirical perspective that has been prevailing in the recent years (Ford & Mathalon, Reference Ford and Mathalon2019; Ford et al., Reference Ford, Mathalon, Whitfield, Faustman and Roth2002; Reference Ford, Roach, Faustman and Mathalon2007; Heinks-Maldonado et al., Reference Heinks-Maldonado, Mathalon, Gray and Ford2005; Whitford, Reference Whitford2019).
Collectively, these findings suggest that errors in self-monitoring, induced by conflicting sensorimotor stimulations involving arm and trunk signals, and even if not involving an experimental manipulation of auditory-verbal signals, are sufficient to exert a specific effect on auditory-verbal perception, reflected as a proneness to hear non-existing voices in noise.
Strong perceptual priors
Concerning the strong-priors account, AVH are proposed to arise when strong beliefs (i.e. priors) exert a top-down effect on perception (Cassidy et al., Reference Cassidy, Balsam, Weinstein, Rosengard, Slifstein, Daw and Horga2018; Corlett et al., Reference Corlett, Horga, Fletcher, Alderson-day, Schmack and Iii2019; Powers et al., Reference Powers, Mathys and Corlett2017b; Zarkali et al., Reference Zarkali, Adams, Psarras, Leyland, Rees and Weil2019). Evidence from previous studies arguing for the strong-priors account mainly links AVH to favoring prior knowledge over sensory information during perceptual inference (Alderson-Day et al., Reference Alderson-Day, Lima, Evans, Krishnan, Shanmugalingam, Fernyhough and Scott2017; Kot & Serper, Reference Kot and Serper2002; Teufel et al., Reference Teufel, Subramaniam, Dobler, Perez, Finnemann, Mehta and Fletcher2015). In a classical experimental scenario, researchers experimentally manipulate trial-by-trial multisensory (e.g. visuo-auditory) contingencies (Davies, Davies, & Bennett, Reference Davies, Davies and Bennett1982; Ellson, Reference Ellson1941; Powers et al., Reference Powers, Mathys and Corlett2017b) to engender stronger priors about the occurrence of the hallucinated auditory stimulus (e.g. a false alarm in a tone-detection task). However, these studies did not investigate which type of a perceptual prior can lead to which type of a hallucinated stimulus, especially with respect to vocal stimuli.
Here, we extend this work by showing that different auditory-perceptual priors induced through our robotic stimulation can affect the type of AVH-like sensations (i.e. self-voice false alarm vs. other-voice false alarm). Specifically, during asynchronous stimulation, that is related to otherness-related sensations (i.e. PH and somatic passivity) (Bernasconi et al., Reference Bernasconi, Blondiaux, Potheegadoo, Stripeikyte, Pagonabarraga, Bejr-kasem and Blanke2021; Blanke et al., Reference Blanke, Pozeg, Hara, Heydrich, Serino, Yamamoto and Rognini2014; Serino et al., Reference Serino, Pozeg, Bernasconi, Solcà, Hara, Progin and Blanke2021), there were more false alarms in other-voice blocks, compared to self-voice blocks. Thus, while exposed to asynchronous PH-inducing stimulation, participants reported hearing a non-existing voice in noise more often in blocks where noise stimuli (without any voice) were mixed with other-voice stimuli, compared to self-voice stimuli. Accordingly, we propose that our robotic procedure induces a perceptual prior (otherness sensations) that, in turn, exerts a specific top-down effect on auditory-verbal perception leading to false alarms of another voice. As a control, during synchronous stimulation, that is associated with self-touch sensations, we observed more false alarms in self-voice blocks.
It is important to note that the present effect (increase in false alarms) were related to sensorimotor stimulation per se (i.e. robotic stimulation), and not to the subjective experience that is associated with the stimulation (i.e. strength of PH measured with questionnaires). More specifically, the same sensorimotor stimulation that has been associated with PH was here associated with other-voice false alarms. The strength of PH itself, however, was not related to false alarms. The reason for this might be twofold. For one, we measured the strength of PH only at the beginning of the experiment, in blocks in which participants did not perform the auditory task, but were only exposed to sensorimotor stimulation. False alarms were assessed later in the experiment, in blocks in which participants were both experiencing the stimulation and performed the task. As in those blocks, we did not administer questionnaires, we do not know what participants were subjectively experiencing when false alarms occurred. It is possible that the strength of the subjective experience of PH was changing during the experiment. Our results can only show that the same type of sensorimotor stimulation is both associated with PH and with an increase of other-voice false alarms. For another, it is possible that sensorimotor stimulation exerts some effects on auditory perception which are not necessarily captured by questionnaire ratings, but are captured by psychophysical tasks (for further discussion of this point see (Orepic et al., Reference Orepic, Rognini, Kannape, Faivre and Blanke2021).
Proposed integration of the two accounts
AVH-like sensations induced through our paradigm might relate the self-monitoring and strong-priors accounts. Namely, in our paradigm, there might be two perceptual priors at play – an auditory-verbal prior and a sensorimotor prior (coming from the robotic stimulation). The auditory-verbal prior consists of the fact that repeatedly hearing a voice with a specific identity (self or other) throughout our experimental blocks creates an expectation about the identity of the voices to follow – i.e., if one continuously hears consecutive other-voice stimuli, one might expect to hear other-voice again in the near future. Concerning the sensorimotor prior, it has been proposed that there might be different intersecting hierarchies in the brain – related to self-monitoring, self-related priors and other-related priors – and that errors in self-monitoring are explained away by changing precision either of self-related or other-related priors (Corlett et al., Reference Corlett, Horga, Fletcher, Alderson-day, Schmack and Iii2019; Leptourgos & Corlett, Reference Leptourgos and Corlett2020). Applied to our data, self-monitoring perturbations coming from asynchronous stimulation might thus be explained away by increasing the precision of other-related priors (e.g. perceived as PH). Crucially, the directionality of the imposed ‘voice prior’ (self or other) might hence interfere with the prior stemming from the concomitant sensorimotor stimulation, moreover in a complementary fashion. Thus, other-voice prior combined with other-related sensorimotor prior might lead to an increase in other-voice false alarms. This proposed mechanism is illustrated in Fig. 4.
This proposal is additionally corroborated by our finding that false alarms occurred more often toward the end of experimental blocks. In Hemsley's original formulation (Hemsley, Reference Hemsley1993), the concept of priors was defined as ‘stored memories of regularities of previous input on current perception’. Increased occurrence of false alarms over time could therefore suggest that they result as a consequence of priors (i.e. stored regularities) that develop over time in the way we described above: auditory-verbal priors derived from trial history combined with a continuous disruption in self-monitoring delivered through robotic stimulation. However, based on these findings, we are not able to make strong claims about the time evolution of priors, because it was not explicitly manipulated in our paradigm. Typically, studies investigating the role of priors on perception explicitly manipulate the development of priors over time, e.g. by changing the likelihood of stimulus occurrence within experimental blocks (e.g. [Powers et al., Reference Powers, Mathys and Corlett2017b; Schmack et al., Reference Schmack, de Castro, Rothkirch, Sekutowicz, Rössler, Haynes and Sterzer2013)]. In our paradigm, we employed an adaptive staircase procedure (one-up one-down), maintaining participants' detectability threshold at a 50% rate. Future work could address the compatibility with the strong-priors account more directly by adapting the experimental paradigm in a Bayesian fashion.
Delusional ideation
Relationship between robot-induced effects and beliefs is further corroborated in our data by relating the increase of false alarms with the PDI score that reflects delusion proneness. Namely, delusional ideation has been related both to the endorsement of hallucinations (Varghese et al., Reference Varghese, Scott and Mcgrath2008) and to excessive prior-weighting (McLean et al., Reference Mclean, Balzan and Mattiske2020; Schmack et al., Reference Schmack, Schnack, Priller and Sterzer2015; Teufel et al., Reference Teufel, Subramaniam, Dobler, Perez, Finnemann, Mehta and Fletcher2015) in the general population. This suggests that PDI might measure a trait of overly strong perceptual priors that impose top-down effects on perception (Adams et al., Reference Adams, Brown and Friston2015; Fletcher & Frith, Reference Fletcher and Frith2009; Sterzer et al., Reference Sterzer, Adams, Fletcher, Frith, Lawrie, Muckli and Corlett2018), in addition to the top-down effect (state) induced by sensorimotor stimulation. Moreover, from the three subscales of delusional ideation – distress, conviction, and preoccupation – a consistent relationship to false alarms similar to the general PDI score was observed only for the conviction scale. Conviction subscale was previously related to reduced perceptual stability and a stronger belief-induced bias on perception, as well as stronger functional connectivity between frontal areas encoding beliefs and sensory areas encoding perception (Schmack et al., Reference Schmack, de Castro, Rothkirch, Sekutowicz, Rössler, Haynes and Sterzer2013), suggesting that similar effects might be at play here.
We administered the PDI questionnaire to our participants mainly because our primary aim was to investigate whether our lab-induced hallucinations relates to delusional ideation – a phenomenon other than hallucinations. However, future work should also investigate the relationship between robotically-induced hallucinations and measures of hallucination proneness, such as CAPS (Bell, Halligan, & Ellis, Reference Bell, Halligan and Ellis2006), LSHS (Waters, Badcock, & Maybery, Reference Waters, Badcock and Maybery2003), and MUSEQ (Mitchell et al., Reference Mitchell, Maybery, Russell-Smith, Collerton, Gignac and Waters2017). This might provide additional insights into the nature of robotically-induced AVH (e.g. see (Teufel et al., Reference Teufel, Subramaniam, Dobler, Perez, Finnemann, Mehta and Fletcher2015)).
False alarms as AVH-like sensations
Our findings further support previous studies that argue for the use of auditory false alarms as a proxy for hallucinations. In a large-scale multisite study (Moseley et al., Reference Moseley, Aleman, Allen, Bell, Bless, Bortolon and Fernyhough2021), hallucinatory experiences in more than a thousand participants were associated with false alarms in a signal detection task, as opposed to other commonly used hallucination-related tasks (e.g. source memory or dichotic listening). Similarly, in a recent review, signal and voice detection tasks were found to be the most robust among the five most common voice-hearing induction paradigms (Anderson, Hartley, & Bucci, Reference Anderson, Hartley and Bucci2021). Our paradigm replicates and extends previous signal-detection paradigms by showing effects specific to other-voice stimuli (false alarms), as opposed to tones and self-voice stimuli.
Importantly, in our studies, auditory detectability between experimental stimuli (self and other voice) and sensorimotor stimulations (synchronous and asynchronous) cannot account for present effects, as sensorimotor stimulation did not affect hit rate nor d’ in the voice detection task. The lack of differences in signal-detection measures other than false alarm rate narrows our effect down to participants perceiving non-existing stimuli. The lack of any d’ differences suggests that the present effects are not due to a difference in discriminability across conditions. Specifically, when there were voices present in the noise, participants were able to detect them equally well, regardless of the concomitant sensorimotor stimulation. Differences in sensorimotor stimulation only affected the performance in trials with no voices present in the noise.
Even though we and others have related false alarms to hallucinations, it is important to note that experiencing false alarms is not similar to experiencing actual AVH. AVH are complex and heterogeneous experiences, characterized by many phenomenological differences across voice hearers, such as the perceived number of voices and even their personality traits (McCarthy-Jones et al., Reference McCarthy-Jones, Trauer, MacKinnon, Sims, Thomas and Copolov2014; Woods et al., Reference Woods, Jones, Alderson-Day, Callard and Fernyhough2015). In our view, the present AVH-like phenomena are still an elegant proxy for AVH, because a false alarm in a voice detection task represents a scenario in which a person reports hearing a voice that does not exist. Future work should extend these paradigms toward voice hearers in the general population and address other aspects of AVH phenomenology, e.g. by introducing biases through personalized avatars (Craig et al., Reference Craig, Rus-Calafell, Ward, Leff, Huckvale, Howarth and Garety2018).
Conclusion
In conclusion, here we demonstrate a sensorimotor-robotic procedure and method (Bernasconi et al., Reference Bernasconi, Blondiaux, Rognini, Dhanis, Jenni, Potheegadoo and Blanke2022) that is able to induce AVH-like sensations in healthy individuals and in a fully controlled laboratory environment. Specifically, we show that different types of sensorimotor stimulation can selectively induce vocal false percepts and that stimulations that induce sensations related to otherness and an alien agent led to a higher number of other-voice false alarms, an effect that we further related with delusion proneness. Besides the novelty and the important methodological impact, these results shed new light on AVH phenomenology, providing experimental support for both prominent albeit seemingly opposing accounts – portraying AVH as a hybrid between deficits in self-monitoring and hyper-precise priors.
Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1017/S0033291723002222.
Author's contributions
Study concept and design: PO, NF, OB. Acquisition of data: PO, MF. Analysis and interpretation of data: PO, FB, OB. Drafting of the manuscript: PO, FB, OB. Critical revision of the manuscript for important intellectual content: All authors. Statistical analysis: PO, FB. Obtained funding: OB. Administrative, technical, or material support: All authors. Study supervision: OB.
Competing interest
O.B. is an inventor on patent US 10 286 555 B2 (Title: Robot-controlled induction of the feeling of a presence) held by the École Polytechnique Fédérale de Lausanne (EPFL) that covers the robot-controlled induction of the presence hallucination (PH). O.B. is an inventor on patent US 10 349 899 B2 (Title: System and method for predicting hallucinations) held by the Ecole Polytechnique Fédérale de Lausanne (EPFL) that covers a robotic system for the prediction of hallucinations for diagnostic and therapeutic purposes. O.B. is a cofounder and a shareholder of Metaphysiks Engineering SA, a company that develops immersive technologies, including applications of the robotic induction of PHs that are not related to the diagnosis, prognosis, or treatment in medicine. O.B. is a member of the board and a shareholder of Mindmaze SA.