The role of characterisation in everyday voice engagement and AVATAR therapy dialogue

Thomas Ward; Rachel Lister; Miriam Fornells-Ambrojo; Mar Rus-Calafell; Clementine J. Edwards; Conan O'Brien; Tom KJ Craig; Philippa Garety

doi:10.1017/S0033291721000659

The role of characterisation in everyday voice engagement and AVATAR therapy dialogue

Published online by Cambridge University Press: 08 April 2021

Thomas Ward

Rachel Lister ,

Miriam Fornells-Ambrojo ,

Mar Rus-Calafell ,

Clementine J. Edwards ,

Conan O'Brien ,

Tom KJ Craig and

Philippa Garety

Show author details

Thomas Ward*: Affiliation:
Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK South London & Maudsley NHS Foundation Trust, London, UK
Rachel Lister: Affiliation:
Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
Miriam Fornells-Ambrojo: Affiliation:
Research Department of Clinical, Educational and Health Psychology, University College London, London, UK
Mar Rus-Calafell: Affiliation:
Mental Health Research and Treatment Center, Faculty of Psychology, Ruhr-Universität Bochum, Bochum, Germany Department of Health Service and Population Research, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
Clementine J. Edwards: Affiliation:
Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK South London & Maudsley NHS Foundation Trust, London, UK
Conan O'Brien: Affiliation:
Research Department of Clinical, Educational and Health Psychology, University College London, London, UK
Tom KJ Craig: Affiliation:
Department of Health Service and Population Research, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
Philippa Garety: Affiliation:
Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK South London & Maudsley NHS Foundation Trust, London, UK
*: Author for correspondence: Thomas Ward, E-mail: [email protected]

Article contents

Abstract
Background
Methods
Results
Discussion
Introduction
Analysis
Results
Discussion
Conclusion
Declaration of interests
Data sharing
Footnotes
References

Rights & Permissions

Abstract

Background

Voices are commonly experienced as communication with a personified ‘other’ with ascribed attitudes, intentionality and personality (their own ‘character’). Phenomenological work exploring voice characterisation informs a new wave of relational therapies. To date, no study has investigated the role of characterisation in behavioural engagement with voices or within psychological therapy for distressing voices.

Methods

Baseline characterisation (the degree to which the voice is an identifiable and characterful entity) of the dominant voice was rated (high, medium or low) using a newly developed coding framework, for n = 60 people prior to starting AVATAR therapy. Associations between degree of characterisation and (i) everyday behavioural engagement with voices (The Beliefs about Voices Questionnaire-Revised; n = 60); and (ii) interaction within avatar dialogue [Session 4 Time in Conversation (participant–avatar); n = 45 therapy completers] were explored.

Results

Thirty-three per cent reported high voice characterisation, 42% medium and 25% low. There was a significant association between characterisation and behavioural engagement [H(2) = 7.65, p = 0.022, ɛ2 = 0.130] and duration of participant–avatar conversation [F(2,42) = 6.483, p = 0.004, η2 = 0.236]. High characterisation was associated with increased behavioural engagement compared with medium (p = 0.004, r = 0.34; moderate effect) and low (p = 0.027, r = 0.25; small−moderate effect) with a similar pattern observed for the avatar dialogue [high v. medium: p = 0.008, Hedges’ g = 1.02 (large effect); high v. low: p = 0.023, Hedges' g = 1.03 (large effect)]. No differences were observed between medium and low characterisation.

Discussion

Complex voice characterisation is associated with how individuals interact with their voice(s) in and out of therapy. Clinical implications and future directions for AVATAR therapy and other relational therapies are discussed.

Keywords

Auditory hallucinations digital health personification relational therapy voices

Type: Original Article
Information: Psychological Medicine , Volume 52 , Issue 16 , December 2022 , pp. 3846 - 3853

DOI: https://doi.org/10.1017/S0033291721000659 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re- use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2021. Published by Cambridge University Press

Introduction

The way I always see it is, he reminds me of a tradesman that's done some kind of office work for 30 years… And he's got a lot of…stories to tell you about it. …. he's lived the lifestyle that…that he'd be happy with. He knows […] I need a proper conversation with him to find direction in life and he ignores that. [John (35)]

Influential models of auditory verbal hallucinations (henceforth voices) outline how internally generated stimuli (often related to inner speech and/or memory) come to be ‘misattributed’ to an external source as a result of biopsychosocial factors including difficulties in self- and source-monitoring (David, Reference David2004; Waters et al., Reference Waters, Allen, Aleman, Fernyhough, Woodward, Badcock and Menon2012). Some authors have proposed moving away from an individualistic focus on ‘aberrant’ auditory perception to consider voices as experiences of perceived communication with an ‘other’ (Deamer & Wilkinson, Reference Deamer and Wilkinson2015; Wilkinson & Bell, Reference Wilkinson and Bell2016). This includes growing interest in the role of broader social-cognitive processes to explain why voices, like John's, are not simply ‘alien’ to the self, but experienced as specific, characterised social agents (Alderson-Day & Fernyhough, Reference Alderson-Day and Fernyhough2016; Bell, Reference Bell2013; Wilkinson & Bell, Reference Wilkinson and Bell2016).

According to phenomenological studies, around 70% of voices are reported as ‘characterful’ (Woods, Jones, Alderson-Day, Callard, & Fernyhough, Reference Woods, Jones, Alderson-Day, Callard and Fernyhough2015), i.e. associated with specific character traits, including experiences, attitudes, animacy and beliefs. A recent study (Alderson-Day et al., Reference Alderson-Day, Woods, Moseley, Common, Deamer, Dodgson and Fernyhough2021) found evidence of ‘complex personification’ of voices in a significant minority (40%) of a sample of people attending early intervention for psychosis services (n = 60). While voice severity scores were very similar in groups with and without complex personification, there was an association between complex personification and experiencing voices as conversational and companionable (Alderson-Day et al., Reference Alderson-Day, Woods, Moseley, Common, Deamer, Dodgson and Fernyhough2021). Beavan (Reference Beavan2011) identifies the experience of forming a relationship with a ‘characterised’ identity as an essential aspect of the experience, and proposes that voices can be important signifiers of a person's life history and relationship to their (social) world (see also Birchwood et al., Reference Birchwood, Gilbert, Gilbert, Trower, Meaden, Hay and Miles2004; Corstens, Longden, McCarthy-Jones, Waddingham, & Thomas, Reference Corstens, Longden, McCarthy-Jones, Waddingham and Thomas2014; Hayward, Berry, & Ashton, Reference Hayward, Berry and Ashton2011; Longden, Corstens, Escher, & Romme, Reference Longden, Corstens, Escher and Romme2012; Romme, Escher, Dillon, Corstens, & Morris, Reference Romme, Escher, Dillon, Corstens and Morris2009).

This reframing of what it means to ‘hear a voice’ brings potentially significant implications for psychological approaches to distressing voices. Birchwood and colleagues (Birchwood et al., Reference Birchwood, Gilbert, Gilbert, Trower, Meaden, Hay and Miles2004; Birchwood, Meaden, Trower, Gilbert, & Plaistow, Reference Birchwood, Meaden, Trower, Gilbert and Plaistow2000) have integrated their seminal cognitive model (Birchwood & Chadwick, Reference Birchwood and Chadwick1997; Chadwick & Birchwood, Reference Chadwick and Birchwood1994) with a ‘social mentalities’ approach which posits that humans have evolved mechanisms for recognising dominant–subordinate interactions, i.e. their social rank (Gilbert et al., Reference Gilbert, Birchwood, Gilbert, Trower, Hay, Murray and Miles2001; Gilbert & Allan, Reference Gilbert and Allan1994). Early experiences of powerlessness and inferiority within social relationships may establish social schemata that drive the subsequent appraisals of voices, and ultimately lead to significant levels of distress and depression (Birchwood et al., Reference Birchwood, Gilbert, Gilbert, Trower, Meaden, Hay and Miles2004). Systematic reviews support the view that social schemata may mediate the cognitive appraisal–distress relationship with the implication that therapies could benefit from targeting social and interpersonal variables (Mawson, Cohen, & Berry, Reference Mawson, Cohen and Berry2010; Paulik, Reference Paulik2012). These theoretical developments have informed a specific cognitive therapy for command hallucinations, which in a randomised controlled trial (Birchwood et al., Reference Birchwood, Michail, Meaden, Tarrier, Lewis, Wykes and Peters2014) reported significant changes in specific treatment targets (namely compliance behaviours and the power difference between voice-hearer and voice).

More recently, a new wave of relational approaches has emerged which focuses on the interpersonal relationship between the hearer and voice [see e.g. Talking with Voices and Making sense of voices (Corstens, Longden, & May, Reference Corstens, Longden and May2012; Steel et al., Reference Steel, Schnackenberg, Perry, Longden, Greenfield and Corstens2019)]; Relating Therapy (Hayward, Jones, Bogen-Johnston, Thomas, & Strauss, Reference Hayward, Jones, Bogen-Johnston, Thomas and Strauss2017; Hayward, Overton, Dorey, & Denney, Reference Hayward, Overton, Dorey and Denney2009); AVATAR therapy (Craig et al., Reference Craig, Rus-Calafell, Ward, Leff, Huckvale, Howarth and Garety2018; Leff, Williams, Huckvale, Arbuthnot, & Leff, Reference Leff, Williams, Huckvale, Arbuthnot and Leff2013; Leff, Williams, Huckvale, Arbuthnot, & Leff, Reference Leff, Williams, Huckvale, Arbuthnot and Leff2014; Ward et al., Reference Ward, Rus-Calafell, Ramadhan, Soumelidou, Fornells-Ambrojo, Garety and Craig2020). A key aspect of these relational approaches is the representation of the social agent to which the voice is ascribed (Deamer & Wilkinson, Reference Deamer and Wilkinson2015) with voice personification and characterisation brought to the fore. AVATAR therapy involves real-time ‘face-to-face’ dialogue between the person and a computerised representation of their main (dominant) persecutory voice (enacted by the therapist using voice transformation software). The typical AVATAR therapy structure involves an initial focus on the person asserting themselves over verbatim voice content, delivered by the avatar to mirror the daily voice experience (Session 1–3). Time spent in direct dialogue in early sessions tends to be short (~5 min) and is influenced by individual differences in anxiety and acclimatisation to the set-up. Over time the avatar transitions to a more conciliatory position cueing a focus on self-esteem, developing a shared understanding of the voice and working towards the person reclaiming power and control in their life (Sessions 4–6). From Session 4 onwards the dialogue tends to become more extended as the therapy targets a realistic enactment of the ascribed character and background of the voice reflecting the person's beliefs (e.g. identity, power, intention and the consequences of resistance (Chadwick & Birchwood, Reference Chadwick and Birchwood1994) and how they experience voice characterisation when relating to their voice. The ultimate aim of AVATAR therapy is for the person to experience increased power, control and confidence within the avatar dialogue which is generalised to the daily voices and other social relationships (see also Hayward et al., Reference Hayward, Jones, Bogen-Johnston, Thomas and Strauss2017; Steel et al., Reference Steel, Schnackenberg, Perry, Longden, Greenfield and Corstens2019). While voice characterisation has been considered likely to impact on AVATAR therapy delivery (Craig, Rus-Calafell, & Ward, Reference Craig, Rus-Calafell and Ward2016; Ward et al., Reference Ward, Rus-Calafell, Ramadhan, Soumelidou, Fornells-Ambrojo, Garety and Craig2020) its specific influence on therapy remains an open question. Indeed while ‘characterful’ voices appear common among voice-hearers (Beavan, Reference Beavan2011; Woods et al., Reference Woods, Jones, Alderson-Day, Callard and Fernyhough2015), attempts to understand associations between voice characterisation and other variables are only just emerging and, to date, no empirical study has investigated its role in psychological therapy for voices.

Aims

This study aims to explore whether the extent of everyday voice engagement and participant–avatar interaction within AVATAR therapy are associated with the degree to which the person's dominant voice is experienced as a highly characterised social agent.

Hypotheses

1) Higher voice characterisation will be associated with increased behavioural engagement with voices.
2) Higher voice characterisation will be associated with increased interaction (longer time in direct dialogue) with a digital representation of the voice (‘the avatar’) within AVATAR therapy.

Methods

This study tests associations between voice characterisation (coded high, medium or low from pre-therapy assessments) and (i) a standardised measure of behavioural engagement [The Beliefs about Voices Questionnaire-Revised (BAVQ-R); assessed pre-therapy] and (ii) time spent in direct dialogue with ‘avatar’ (digital representation of voice) during Session 4 of AVATAR therapy intervention.

Participants

Sixty participants allocated to receive AVATAR therapy were randomly selected from a larger clinical trial of AVATAR therapy [approved by the London-Hampstead Research Ethics Committee (Reference 13/Lo/0482) (Craig et al., Reference Craig, Rus-Calafell, Ward, Leff, Huckvale, Howarth and Garety2018)]. The inclusion criteria: aged over 18 years, ability to speak and read English, and have experienced troubling auditory hallucinations for at least 12 months. Exclusion criteria: aged under 18 years, unable to give informed consent, currently in receipt of cognitive behaviour therapy for psychosis, refusing all medication, a diagnosis of organic brain disease or a primary substance dependency or auditory hallucinations in a language not spoken by the therapists. A total of 45 participants engaged with therapy such that a recorded Session 4 was available and therefore this subset was used to test Hypothesis 2.

Measures

Characterisation: A novel phenomenological coding framework was developed to derive a measure of the degree of characterisation of the person's main voice. Characterisation reflected the degree to which the voice was experienced as an identifiable and characterful entity. Informed by previous phenomenological work (see e.g. Woods et al., Reference Woods, Jones, Alderson-Day, Callard and Fernyhough2015), the coding framework took into account identified physical characteristics, identity and psychosocial characteristics. Presence of more of these features and aspects carrying greater complexity/detail indicated higher degree of characterisation. Three levels (high, medium or low; see Table 3) were defined with the rationale that the degree of characterisation might be particularly relevant to the AVATAR therapy context. A clear identity (whether the person was explicitly ‘known’ in the sense of a ‘real-life’ referent or an identifiable stranger) was a requirement for a coding of high characterisation, and an exclusion for coding of low characterisation. The framework was used to code transcripts of the baseline assessment on the AVATAR trial as well as the clinical interview which precedes the commencement of AVATAR therapy [this clinical assessment included the question ‘Does it feel as though the voice(s) that you hear have their own character or personality?’]. The focus of these assessments was on the main/dominant voice as this is represented by the created ‘avatar’ and targeted within AVATAR therapy. Following the initial development of the coding frame, five participants were randomly selected for pilot coding by authors (RL and TW). Through this process the framework was refined to ensure reliability and accurate reflection of the data. RL then coded the remaining sample with 10 participants randomly selected and independently coded by TW; interrater reliability for the characterisation variable was 0.88 (quadratic weighting). There was 100% agreement for ratings involving high characterisation; three disagreements in ratings of low and medium categories were resolved by further discussion.

BAVQ-R (Chadwick, Lees, & Birchwood, Reference Chadwick, Lees and Birchwood2000): a validated self-report measure of a person's appraisals of their voices, including perceived malevolence, benevolence and omnipotence (six items each), and styles of relating to the voice; behavioural engagement, emotional engagement, emotional resistance (four items each), and behavioural resistance (five items). Items are scored from 0 (disagree) to 3 (strongly agree). The scale has good validity and good reliability (e.g. Cronbach's alpha for engagement of 0.87 and for resistance of 0.85) (Chadwick et al., Reference Chadwick, Lees and Birchwood2000). Although the BAVQ-R response scales have been validated using a total including both emotional and behavioural items, subscales have been reported separately (Birchwood et al., Reference Birchwood, Michail, Meaden, Tarrier, Lewis, Wykes and Peters2014). Behavioural engagement subscale score was used in the current study as a measure of active engagement with the voice (e.g. items: When I hear my voice usually: ‘I seek the advice of my voice’; ‘I willingly follow what my voice tells me to do’; ‘I listen to it because I want to’; ‘I have done things to start to get in contact with my voice’). Additional response subscales were included in exploratory analysis: emotional engagement (‘My voice reassures me/makes me happy/makes me feel calm/makes me feel confident’); behavioural resistance (e.g. ‘When I hear my voice I usually tell it to leave me alone/try to stop it’); emotional resistance (e.g. “My voice frightens me/makes me feel down).

Interaction with ‘avatar’ within AVATAR therapy dialogue: Session 4 recordings (n = 45) were transcribed and used to measure engagement in dialogue with the avatar. The primary measure of interaction was total conversation time (TCT), between the person and the avatar, in seconds. Secondary variables were: total number of words (TNW) spoken by person and avatar during the conversation, number of exchanges (NoE) between avatar and person and number of words spoken by the person (NWP). TCT was calculated using the timer displayed on Windows Media Player and ‘number of words’ data were calculated using the ‘word count’ function in Microsoft Word. Time or words captured by these measures excluded any time spent talking with the therapist. Session 4 was selected as most reflective of elaborated dialogue given the typical AVATAR therapy structure. Example Session 4 dialogue: Avatar: ‘You seem different, I can't hold you down like before, what is changing?’; person: ‘I'm not that little girl anymore, I'm a grown woman, you will not have this power over me anymore’.

Analysis

Analysis was conducted using Stata Version 16. Behavioural engagement (BAVQ-R) showed significant positive skew which could not be corrected using log transformation, therefore the non-parametric Kruskal–Wallis test was used to analyse associations between characterisation and behavioural engagement, with pairwise post-hoc Dunn tests to compare levels (high, medium and low). Univariate analysis of variance, with post-hoc Tukey testing, was used to analyse differences in TCT during AVATAR therapy dialogue for the different levels of voice characterisation (high, medium and low). To increase the power given the relatively small group sizes, secondary analyses were conducted with groups dichotomised as high (complex) characterisation v. medium/low (non-complex) characterisation (see Alderson-Day et al. (Reference Alderson-Day, Woods, Moseley, Common, Deamer, Dodgson and Fernyhough2021) for a comparable approach) – Mann–Whitney U tests were used for Hypothesis 1 and independent t tests for Hypothesis 2.

Results

Data from 60 participants were included in this study; ages ranged between 19 and 67 years old (to the nearest year), with a mean age of 42 years. Table 1 shows demographics, clinical characteristics and engagement variables.

Table 1. Demographics for the sample (n = 60)

Descriptive data on voices

Although most participants reported multiple voices (82%, n = 49), assessments reported in the current paper were anchored to the dominant voice, as this was used to create the avatar. The majority of dominant voices were male with ‘stranger’ the most common identity category (see Table 2).

Table 2. Ascribed gender and identity of the main voice

All participants heard intelligible voice content, but almost half (45%, n = 27) also heard vague or unintelligible content. For almost all participants (95%, n = 57), voices addressed the participant in the second person at least some of the time, however, 42% (n = 25) of people had voices using a mixture of forms of address. All participants reported at least some of their voice content as communicative – this included 100% reporting at least one verbatim voice comment that had a literal meaning (typically relating to criticism, abuse and threat), as well as over half reporting a non-literal ‘communicative intent’ (57%, n = 34), e.g. the voice drawing the attention to a chocolate bar wrapper on the floor taken to mean ‘it's there to say eat lots of chocolate and get fat’; a voice command ‘go sit outside’ taken to mean the voice coercing the person to become homeless. The degree of characterisation of the voice (three categories: high, medium and low), with frequencies, defining features and examples, are shown in Table 3.

Hypothesis 1: higher voice characterisation will be associated with increased behavioural engagement with voices

Table 3. Degree of characterisation of voices (high, medium and low) with associated typical features and examples

There was a significant association between degree of voice characterisation and behavioural engagement (BAVQ-R) as shown in Table 4 (moderate effect size). As hypothesised, those with highly characterised voices showed higher behavioural engagement when compared with those with medium characterisation (moderate effect size) and low (small-moderate effect size). There were no significant differences in behavioural engagement between low and medium characterisation. Exploratory analysis was conducted for the other response scales of BAVQ-R. There was a significant association between characterisation and emotional engagement (moderate effect size). Those with highly characterised voices showed significantly higher emotional engagement when compared with medium (moderate effect size) and low (small-moderate effect size) characterisation. There were no associations between voice characterisation and resistance (behavioural or emotional).

Hypothesis 2: higher voice characterisation will be associated with increased interaction with a digital representation of the voice ('the avatar’) within AVATAR therapy

Table 4. Engagement and resistance (behavioural and emotional) across different levels of voice characterisation

Note: For non-parametric tests effect size r = z/√N, where z = standardised test statistic. N = number of observations (Rosenthal, Reference Rosenthal1991); 0.1 = small, 0.3 = medium, 0.5 = large.

The mean TCT across the sample (n = 45) was 742.42 s (s.d. = 293.94). There was a significant effect of the degree of characterisation on TCT (Table 5). Post-hoc tests indicated that TCT was significantly greater for those with high characterisation, than those with low or medium characterisation (large effect sizes); no significant difference was found between low and medium characterisation (Table 5).

Table 5. Total duration of active dialogue between person and avatar (during Session 4)

Note: TCT = total conversation time (seconds).

As a sensitivity check, three other objective measurements of engagement with the avatar (TNW spoken, NWP, NoE) were tested for associations with voice characterisation. While the ‘TNW’ and ‘NWP’ variables showed the same pattern reported above, the ‘NoE’ variable did not show a significant association with characterisation (see Online Supplementary Table S1).

Secondary analyses

Secondary analyses comparing high (complex) with low/medium (non-complex) characterisation showed significant differences in behavioural and emotional engagement (moderate effect sizes) but not resistance scales (see Online Supplementary Table S2). Avatar dialogue interaction was significantly greater in those with high (complex) characterisation than low/medium (non-complex) for all measures – effect sizes were large except for NoE which showed moderate effect sizes (see Online Supplementary Table S3).

Discussion

This study builds on the conceptualisation of voices as social experiences of communication with an ascribed ‘other’. Voice characterisation has been investigated with respect to everyday voice interaction and, for the first time, engagement within AVATAR therapy for distressing voices. The findings suggest that more highly characterised dominant voices (a third of the sample) were associated with increased behavioural engagement with voices which was mirrored in greater interaction during AVATAR therapy dialogue.

The findings that just over 70% of the sample reported some degree of voice characterisation (i.e. high or medium) is broadly consistent with other phenomenological research in this area (Beavan, Reference Beavan2011; Woods et al., Reference Woods, Jones, Alderson-Day, Callard and Fernyhough2015), suggesting that voices are ‘characterful’ to some extent for the majority of voice-hearers. While the most common voice identity was ‘unknown/strangers’, all reported communicative intent from their voice (literal and in some cases also non-literal). Similarly, Alderson-Day et al. (Reference Alderson-Day, Woods, Moseley, Common, Deamer, Dodgson and Fernyhough2021) found (within a sample attending an early intervention service) that 75% of voices recurred over time, had a distinct character, but were not related to a known person [i.e. experienced as an ‘internally individuated agency’ (Wilkinson & Bell, Reference Wilkinson and Bell2016)]. The key differences observed in the current study related specifically to highly characterised voices compared with medium (one-dimensional characterisation) or low (amorphous voices); no consistent differences were observed between low and medium characterisation.

There is increasing interest in how, when and why voices become personified and whether characterisation evolves over time. Personification may emerge early for some [where social cognitive and neurocognitive processes are primed for the ascription of agency (Bell, Reference Bell2013)], while for others voices may remain essentially non-personified experiences. Previous findings of complex personification at the point of first clinical contact and absence of association with time since onset suggest that personification cannot be easily reduced to a secondary consequence of beliefs about voices (Alderson-Day et al., Reference Alderson-Day, Woods, Moseley, Common, Deamer, Dodgson and Fernyhough2021). The study of Alderson-Day et al. (Reference Alderson-Day, Woods, Moseley, Common, Deamer, Dodgson and Fernyhough2021) also reported that complex personification (significantly overlapping in definition with ‘high characterisation’ in the current study) was found in 40% of participants and associated with experiencing voices as conversational and companionable. This is consistent with the finding in the current study of an association between high characterisation (33% of the current sample) and behavioural engagement (see also John's ‘need for a conversation’ with his highly characterised voice in the opening quote). It should be noted that voices inhabit a complex phenomenological and developmental landscape in which establishing primacy and reciprocal influence between beliefs, experiences and personification is challenging. Interpersonal experiences (including current discrimination or conversely positive new relationships) evolve over time, continuing to shape experiences in ways that may be missed in cross-sectional studies (see also Longden et al., Reference Longden, Corstens, Escher and Romme2012).

Despite the findings in support of the main study hypotheses, important questions remain. Overall ratings of behavioural engagement (BAVQ-R) were notably low. This is unsurprising in a sample recruited for recurrent distressing voices, given the association of BAVQ-R engagement with benevolent voices (Chadwick et al., Reference Chadwick, Lees and Birchwood2000). There are also important clinical questions as to how ‘voice engagement’ might vary across individuals and contexts. Higher levels of engagement may reflect active dialogue with positive aspects of voices, consistent with the similar pattern of findings found for positive emotional engagement (albeit again with low overall levels). The interpersonal dimension of ‘proximity’, with opposing poles of ‘withdrawal/self-isolation’ and ‘over-involvement/intrusiveness’ (Hayward et al., Reference Hayward, Berry and Ashton2011) has been identified as important in the context of distressing voices (Birtchnell, Reference Birtchnell1996; Hayward et al., Reference Hayward, Berry and Ashton2011). This underscores the diversity of voices and the role of context (current and autobiographical) when considering relating to voices (Hayward et al., Reference Hayward, Jones, Bogen-Johnston, Thomas and Strauss2017). Qualitative research has started to explore relational change in voices over time (Bogen-Johnston, deVisser, Strauss, & Hayward, Reference Bogen-Johnston, deVisser, Strauss and Hayward2019; Hartigan, McCarthy-Jones, & Hayward, Reference Hartigan, McCarthy-Jones and Hayward2014). Future longitudinal studies exploring the potential evolution of voice characterisation over time would be of significant theoretical and clinical importance. Experience-sampling methodology (Myin-Germeys et al., Reference Myin-Germeys, Kasanova, Vaessen, Vachon, Kirtley, Viechtbauer and Reininghaus2018) may also illuminate further the ways in which characterisation intersects with voice engagement, distress and daily activity and how this might evolve (e.g. before and after therapy).

This study provides the first empirical evidence that baseline voice characterisation predicts person–avatar interaction during active therapy dialogue. TCT was selected as the best proxy for active engagement, with the rationale that extended dialogue increases the opportunity to address treatment targets including work on past relational conflicts which can be mirrored in the voice (Ward et al., Reference Ward, Rus-Calafell, Ramadhan, Soumelidou, Fornells-Ambrojo, Garety and Craig2020). However, linguistic analysis of dialogic exchanges might identify more nuanced aspects of the therapy process especially relating to enacted characterisation and the ‘pragmatic’ opportunities within dialogue (Deamer & Wilkinson, Reference Deamer and Wilkinson2015).

Clearly a crucial question is whether the influence of voice characterisation on engagement in AVATAR dialogue leads to improvement in the actual distressing voices. Rus-Calafell et al. (Reference Rus-Calafell, Ward, Zhang, Edwards, Garety and Craig2020) have demonstrated that the interaction between sense of voice presence (i.e. virtual embodiment of the ‘voice as avatar’) and reduction of anxiety predicted improvement in two of the significant AVATAR therapy outcomes: PSYRATS (Haddock, McCarron, Tarrier, & Faragher, Reference Haddock, McCarron, Tarrier and Faragher1999) total and frequency of voices. Future research could investigate whether voice characterisation influences sense of presence and has an impact on treatment outcome and we plan to investigate this (we discuss in more detail below, in clinical implications, our planned exploration of characterisation as a potential moderator of treatment outcome in the new AVATAR2 clinical trial).

Limitations

This study reports associations and there is a possibility of unmeasured causes in addition to uncertainty concerning the direction of causation. Future longitudinal and interventional studies may elucidate these relationships. The recruitment of the sample with distressing voices means that findings may not generalise to people who experience voices without associated distress. Therefore, replication is required in studies ideally including non-clinical voice-hearers. The coding framework defined three levels of characterisation with the rationale that the degree of characterisation might be important in the AVATAR therapy context. However, no significant differences were observed between low and medium categories, while effect sizes for differences in behavioural engagement (BAVQ-R) were, unexpectedly, slightly larger for comparisons of high v. medium than high v. low. Rater disagreements were also limited to the low/medium distinction. Taken together these findings suggest that it may be preferable to dichotomise characterisation into complex (high) v. non-complex (low/medium) characterisation consistent with other emerging work (Alderson-Day et al., Reference Alderson-Day, Woods, Moseley, Common, Deamer, Dodgson and Fernyhough2021). Indeed, the secondary analyses adopting this approach showed consistent differences with study hypotheses. The coding framework did not include pre-determined scoring rules and would benefit from refinement to increase standardisation and operationalisation of essential scoring criteria. This work is currently underway and will allow us to report on validity in future studies. Furthermore, while the assessments (and AVATAR therapy) were anchored to the dominant persecutory voice, multiple voices were the norm in this sample and it is possible that certain responses might incorporate aspects of the non-dominant voices which may have affected findings. The association between characterisation and avatar dialogue engagement was less strong for NoE compared to the other measures (although it was significant with a moderate effect size when characterisation was dichotomised). A fine-grained approach, for example the linguistic analysis noted above, would be required to establish the optimal method of measuring interaction in dialogue in future studies.

Clinical implications

The process of creating an avatar representing the voice (an act of externalisation, personification and embodiment) has been identified as helpful for many, including those with less personified voices (Ward et al., Reference Ward, Rus-Calafell, Ramadhan, Soumelidou, Fornells-Ambrojo, Garety and Craig2020). The AVATAR therapy approach could be framed as changing voice characterisation through dialogue, with the aim of mirrored change in the everyday voice. It remains an open question as to whether AVATAR therapy may be most indicated where complex personification is ‘in play’ pre-therapy. It might also be argued that more highly characterised voices (i.e. those showing complex personification) may be particularly amenable to other approaches which share AVATAR therapy's emphasis on the mirroring of voice relating with other relationships (Hayward et al., Reference Hayward, Jones, Bogen-Johnston, Thomas and Strauss2017) and the understanding of voices within autobiographical context (Corstens et al., Reference Corstens, Longden and May2012; Steel et al., Reference Steel, Schnackenberg, Perry, Longden, Greenfield and Corstens2019). This might suggest personalising the therapy approach according to the degree and nature of the characterisation. Conversely relational approaches, including AVATAR therapy, may offer equally effective (albeit perhaps differing) therapeutic opportunities for personified and non-personified voices. A new trial is about to start (AVATAR2) which, alongside testing efficacy compared with usual care, will examine optimising and personalising AVATAR therapy (Reference Garety, Edwards, Ward, Emsley, Huckvale, McCrone and CraigGarety et al. submitted). The efficacy of a brief (six sessions with standardised focus on exposure and assertiveness) and extended (12 sessions incorporating individual formulation-driven treatment targets) form of AVATAR therapy will be tested compared to usual care, with voice characterisation included as a potential moderator of treatment outcome.

Conclusion

The current study represents an important initial step in connecting voice characterisation with voice engagement (in and out of therapy). We have found that dominant voices showing complex characterisation are associated with increased voice engagement in daily life and that this is mirrored in more extended AVATAR therapy dialogue. Relational approaches such as AVATAR therapy are changing the landscape of psychological therapy for distressing voices. The experience of voice-hearing is diverse and grounded within personal biography and individual meanings. It is hoped that the integration of phenomenological and clinical research may lead to the development of more effective, personalised therapies for people who experience distressing voices.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0033291721000659.

Acknowledgements

The research was funded by the Wellcome Trust (FWBC-AVATAR 098272/z/12/z). PAG and TKJC were part-funded by the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.

Declaration of interests

None.

Data sharing

According to Wellcome Trust's Policy on data, software and materials management and sharing, all data supporting this study will be available in anonymised form from the corresponding author (TW) on reasonable request, subject to review, following the publication of results.

Footnotes

Joint First Authors

†

Joint Last Authors

References

Alderson-Day, B., & Fernyhough, C. (2016). Auditory verbal hallucinations: Social, but how? Journal of Consciousness Studies, 23(7–8), 163–194.Google Scholar

Alderson-Day, B., Woods, A., Moseley, P., Common, S., Deamer, F., Dodgson, G., … Fernyhough, C. (2021). Voice-Hearing and personification: Characterizing social qualities of auditory verbal hallucinations in early psychosis. Schizophrenia Bulletin, 47(1), 228–236.CrossRef Google Scholar PubMed

Beavan, V. (2011). Towards a definition of “hearing voices”: A phenomenological approach. Psychosis, 3(1), 63–73.CrossRef Google Scholar

Bell, V. (2013). A community of one: Social cognition and auditory verbal hallucinations. PLoS Biology, 11(12), e1001723.CrossRef Google Scholar PubMed

Birchwood, M., & Chadwick, P. (1997). The omnipotence of voices: Testing the validity of a cognitive model. Psychological Medicine, 27(6), 1345–1353. doi: 10.1017/s0033291797005552.CrossRef Google Scholar PubMed

Birchwood, M., Gilbert, P., Gilbert, J., Trower, P., Meaden, A., Hay, J., … Miles, J. N. (2004). Interpersonal and role-related schema influence the relationship with the dominant ‘voice’ in schizophrenia: A comparison of three models. Psychological Medicine, 34(8), 1571–1580. doi: 10.1017/s0033291704002636.CrossRef Google Scholar PubMed

Birchwood, M., Meaden, A., Trower, P., Gilbert, P., & Plaistow, J. (2000). The power and omnipotence of voices: Subordination and entrapment by voices and significant others. Psychological Medicine, 30(2), 337–344. doi: 10.1017/s0033291799001828.CrossRef Google Scholar PubMed

Birchwood, M., Michail, M., Meaden, A., Tarrier, N., Lewis, S., Wykes, T., … Peters, E. (2014). Cognitive behaviour therapy to prevent harmful compliance with command hallucinations (COMMAND): A randomised controlled trial. The Lancet Psychiatry, 1(1), 23–33. doi: 10.1016/s2215-0366(14)70247-0.CrossRef Google Scholar PubMed

Birtchnell, J. (1996). How humans relate: A new interpersonal theory. London, UK: Psychology Press.Google Scholar

Bogen-Johnston, L., deVisser, R., Strauss, C., & Hayward, M. (2019). “It's just a bit like a rollercoaster”: A longitudinal qualitative study exploring a model of the phases of voice hearing. Psychosis, 11(4), 308–318.CrossRef Google Scholar

Chadwick, P., & Birchwood, M. (1994). The omnipotence of voices. A cognitive approach to auditory hallucinations. British Journal of Psychiatry, 164(2), 190–201. doi: 10.1192/bjp.164.2.190.CrossRef Google Scholar PubMed

Chadwick, P., Lees, S., & Birchwood, M. (2000). The revised beliefs about voices questionnaire (BAVQ-R). British Journal of Psychiatry, 177, 229–232. doi: 10.1192/bjp.177.3.229.CrossRef Google Scholar

Corstens, D., Longden, E., & May, R. (2012). Talking with voices: Exploring what is expressed by the voices people hear. Psychosis, 4(2), 95–104.CrossRef Google Scholar

Corstens, D., Longden, E., McCarthy-Jones, S., Waddingham, R., & Thomas, N. (2014). Emerging perspectives from the hearing voices movement: Implications for research and practice. Schizophrenia Bulletin 40, S285-S294.CrossRef Google Scholar PubMed

Craig, T. K., Rus-Calafell, M., & Ward, T. (2016). AVATAR therapy for refractory auditory hallucinations. Brief interventions for psychosis (Chapter. 4, pp. 41–54). New York: Springer Nature.Google Scholar

Craig, T. K., Rus-Calafell, M., Ward, T., Leff, J. P., Huckvale, M., Howarth, E., … Garety, P. A. (2018). AVATAR Therapy for auditory verbal hallucinations in people with psychosis: A single-blind, randomised controlled trial. The Lancet Psychiatry, 5(1), 31–40. Retrieved from https://www.thelancet.com/pdfs/journals/lanpsy/PIIS2215-0366(17)30427-3.pdf.CrossRef Google Scholar PubMed

David, A. (2004). The cognitive neuropsychiatry of auditory verbal hallucinations: An overview. Cognitive Neuropsychiatry, 9(1–2), 107–123.Google Scholar PubMed

Deamer, F., & Wilkinson, S. (2015). The speaker behind the voice: Therapeutic lessons from pragmatic theory. Frontiers in psychology, 6, 1–5.CrossRef Google Scholar

Garety, P., Edwards, C. J., Ward, T., Emsley, R., Huckvale, M., McCrone, P., … Craig, T. Optimising AVATAR therapy for distressing voices: Study protocol for the AVATAR2 multi-centre randomised controlled trial (submitted to trials).Google Scholar

Gilbert, P., & Allan, S. (1994). Assertiveness, submissive behaviour and social comparison. British Journal of Clinical Psychology, 33(3), 295–306.CrossRef Google Scholar PubMed

Gilbert, P., Birchwood, M., Gilbert, J., Trower, P., Hay, J., Murray, B., … Miles, J. N. (2001). An exploration of evolved mental mechanisms for dominant and subordinate behaviour in relation to auditory hallucinations in schizophrenia and critical thoughts in depression. Psychological Medicine, 31(6), 1117–1127. doi: 10.1017/s0033291701004093.CrossRef Google Scholar PubMed

Haddock, G., McCarron, J., Tarrier, N., & Faragher, E. B. (1999). Scales to measure dimensions of hallucinations and delusions: The psychotic symptom rating scales (PSYRATS). Psychological Medicine, 29(4), 879–889. doi: 10.1017/S0033291799008661.CrossRef Google Scholar PubMed

Hartigan, N., McCarthy-Jones, S., & Hayward, M. (2014). Hear today, not gone tomorrow? An exploratory longitudinal study of auditory verbal hallucinations (hearing voices). Behavioural and Cognitive Psychotherapy, 42(1), 117.CrossRef Google Scholar

Hayward, M. (2003). Interpersonal relating and voice hearing: To what extent does relating to the voice reflect social relating? Psychology and Psychotherapy: Theory. Research and Practice, 76(4), 369–383.Google Scholar

Hayward, M., Berry, K., & Ashton, A. (2011). Applying interpersonal theories to the understanding of and therapy for auditory hallucinations: A review of the literature and directions for further research. Clinical Psychology Review, 31(8), 1313–1323. doi: 10.1016/j.cpr.2011.09.001CrossRef Google Scholar

Hayward, M., Jones, A.-M., Bogen-Johnston, L., Thomas, N., & Strauss, C. (2017). Relating therapy for distressing auditory hallucinations: A pilot randomized controlled trial. Schizophrenia Research, 183, 137–142.CrossRef Google Scholar PubMed

Hayward, M., Overton, J., Dorey, T., & Denney, J. (2009). Relating therapy for people who hear voices: A case series. Clinical Psychology & Psychotherapy, 16(3), 216–227. doi: 10.1002/cpp.615CrossRef Google Scholar PubMed

Leff, J., Williams, G., Huckvale, M., Arbuthnot, M., & Leff, A. P. (2014). Avatar therapy for persecutory auditory hallucinations: What is it and how does it work? Psychosis, 6(2), 166–176.CrossRef Google Scholar PubMed

Leff, J., Williams, G., Huckvale, M. A., Arbuthnot, M., & Leff, A. P. (2013). Computer-assisted therapy for medication-resistant auditory hallucinations: Proof-of-concept study. British Journal of Psychiatry, 202, 428–433. doi: 10.1192/bjp.bp.112.124883.CrossRef Google Scholar PubMed

Longden, E., Corstens, D., Escher, S., & Romme, M. (2012). Voice hearing in a biographical context: A model for formulating the relationship between voices and life history. Psychosis, 4(3), 224–234.CrossRef Google Scholar

Mawson, A., Cohen, K., & Berry, K. (2010). Reviewing evidence for the cognitive model of auditory hallucinations: The relationship between cognitive voice appraisals and distress during psychosis. Clinical Psychology Review, 30(2), 248–258. doi: 10.1016/j.cpr.2009.11.006.CrossRef Google Scholar PubMed

Myin-Germeys, I., Kasanova, Z., Vaessen, T., Vachon, H., Kirtley, O., Viechtbauer, W., & Reininghaus, U. (2018). Experience sampling methodology in mental health research: New insights and technical developments. World Psychiatry, 17(2), 123–132.CrossRef Google Scholar PubMed

Paulik, G. (2012). The role of social schema in the experience of auditory hallucinations: A systematic review and a proposal for the inclusion of social schema in a cognitive behavioural model of voice hearing. Clinical Psychology & Psychotherapy, 19(6), 459–472. doi: 10.1002/cpp.768.CrossRef Google Scholar

Romme, M., Escher, S., Dillon, J., Corstens, D., & Morris, M. (2009). Living with voices: 50 stories of recovery: PCCS books, Monmouth, UK.Google Scholar

Rosenthal, R. (1991). Meta-Analytic procedures for social research (rev. ed.). applied social research methods series, Vol. 6. Thousand Oaks, CA: Sage Publications, Inc; US.CrossRef Google Scholar

Rus-Calafell, M., Ward, T., Zhang, X. C., Edwards, C. J., Garety, P., & Craig, T. (2020). The role of sense of voice presence and anxiety reduction in AVATAR therapy. Journal of Clinical Medicine, 9(9), 2748.CrossRef Google Scholar PubMed

Steel, C., Schnackenberg, J., Perry, H., Longden, E., Greenfield, E., & Corstens, D. (2019). Making sense of voices: A case series. Psychosis, 11(1), 3–15.CrossRef Google Scholar

Ward, T., Rus-Calafell, M., Ramadhan, Z., Soumelidou, O., Fornells-Ambrojo, M., Garety, P., … Craig, T. K. (2020). AVATAR Therapy for distressing voices: A comprehensive account of therapeutic targets. Schizophrenia Bulletin, 46(5), 1038-1044.CrossRef Google Scholar PubMed

Waters, F., Allen, P., Aleman, A., Fernyhough, C., Woodward, T. S., Badcock, J. C., … Menon, M. (2012). Auditory hallucinations in schizophrenia and nonschizophrenia populations: A review and integrated model of cognitive mechanisms. Schizophrenia Bulletin, 38(4), 683–693.CrossRef Google Scholar PubMed

Wilkinson, S., & Bell, V. (2016). The representation of agents in auditory verbal hallucinations. Mind & Language, 31(1), 104–126.CrossRef Google Scholar PubMed

Woods, A., Jones, N., Alderson-Day, B., Callard, F., & Fernyhough, C. (2015). Experiences of hearing voices: Analysis of a novel phenomenological survey. The Lancet Psychiatry, 2(4), 323–331. Retrieved from https://www.thelancet.com/pdfs/journals/lanpsy/PIIS2215-0366(15)00006-1.pdf.CrossRef Google Scholar PubMed

Table 1. Demographics for the sample (n = 60)

Table 2. Ascribed gender and identity of the main voice

Table 3. Degree of characterisation of voices (high, medium and low) with associated typical features and examples

Table 4. Engagement and resistance (behavioural and emotional) across different levels of voice characterisation

Table 5. Total duration of active dialogue between person and avatar (during Session 4)

Ward et al. supplementary material

File 19.5 KB

Article contents

The role of characterisation in everyday voice engagement and AVATAR therapy dialogue

Abstract

Keywords

Introduction

Aims

Hypotheses

Methods

Participants

Measures

Analysis

Results

Descriptive data on voices

Secondary analyses

Discussion

Limitations

Clinical implications

Conclusion

Supplementary material

Acknowledgements

Declaration of interests

Data sharing

Footnotes

References

Ward et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests