Factor structure, reliability and criterion-related validity of the English version of the Problematic Series Watching Scale

Emanuele Fino; Mollie Humphries; Jake Robertson; Gábor Orosz; Mark D. Griffiths

doi:10.1192/bjo.2022.561

Factor structure, reliability and criterion-related validity of the English version of the Problematic Series Watching Scale

Published online by Cambridge University Press: 24 August 2022

Gábor Orosz and

Emanuele Fino*: Affiliation:
NTU Psychology, Nottingham Trent University, UK
Mollie Humphries: Affiliation:
NTU Psychology, Nottingham Trent University, UK
Jake Robertson: Affiliation:
NTU Psychology, Nottingham Trent University, UK
Gábor Orosz: Affiliation:
Sherpas Laboratory, Université d'Artois, France
Mark D. Griffiths: Affiliation:
NTU Psychology, Nottingham Trent University, UK
*: Correspondence: Emanuele Fino. Email: [email protected]

Article contents

Abstract
Background
Aims
Method
Results
Conclusions
Method
Results
Discussion
Data availability
Author contributions
Funding
Declaration of interest
References

Rights & Permissions

Abstract

Background

Psychological research in the past decade has investigated the psychosocial implications of problematic use of on-demand online video streaming services, particularly series watching. Yet, a psychometric measure of problematic series watching in English is not available.

Aims

The present study aimed to test the factor structure, reliability and criterion-related validity of the English version of the Problematic Series Watching Scale, a six-item self-report assessing problematic series watching, based on the biopsychosocial components model of addiction.

Method

Participants were recruited from two UK university student samples. Study 1 (n = 333) comprised confirmatory factor analysis, reliability tests and item response theory analyses to test the original unidimensional model and investigate each item's levels of discrimination and information. Study 2 (n = 209) comprised correlation analyses to test the criterion-related validity of the scale.

Results

There was a good fit of the theoretical model of the scale to the data (Comparative Fit Index = 0.998, Root Mean Square Error of Approximation = 0.024 [90% CI 0.000–0.093], Standardised Root Mean square Residual = 0.048), satisfactory reliability (ω = 0.79) and item levels of discrimination and information. The scale positively correlated with time spent watching series (rs = 0.26, P < 0.001) and negative affect (rs = 0.43, P < 0.001), and correlated negatively with positive affect (rs = −0.12, P > 0.05), mental well-being (rs = −0.25, P < 0.001) and sleep quality (rs = −0.14, P < 0.05).

Conclusions

Results are discussed in relation to the ongoing debate on binge watching and series watching in the context of positive reinforcement versus problematic behaviour.

Keywords

Addiction components model binge watching factor structure problematic series watching online video streaming

Type: Papers
Information: BJPsych Open , Volume 8 , Issue 5 , September 2022 , e160

DOI: https://doi.org/10.1192/bjo.2022.561 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2022. Published by Cambridge University Press on behalf of the Royal College of Psychiatrists

The recent proliferation of online video streaming platforms and services has resulted in global dramatic changes in consumer screen watching behaviour. Research has indicated distinctive situational and structural features facilitating their success and spread in several cultural contexts, including the unlimited access to shows, the affordable costs of subscriptions, being advertising-free, being available on multi-devices, and their wide offer of serialised shows.^{Reference Granow, Reinecke and Ziegele1} Other studies have highlighted the increasing personal autonomy underlying the offer as a major driver of their success.^{Reference Ort, Wirz and Fahr2}

Research in the past decade has investigated the psychosocial implications of problematic use of online video streaming services, particularly ‘binge watching’ of television series.^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3,Reference Flayelle, Maurage, Vögele, Karila and Billieux4} Although there is no consensus on its definition, the term 'binge watching' is broadly used to refer to a sustained usage of audio-visual content^{Reference Ort, Wirz and Fahr2} through digital media and in a serialised fashion.^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3,Reference Balakrishnan and Griffiths5} Some authors have questioned the extent to which binge watching represents a problematic behaviour, arguing that although excessive behaviours sometimes manifest as a repeated and prolonged engagement into a particular activity, they do not necessarily carry adverse biopsychosocial effects.^{Reference Flayelle, Maurage, Vögele, Karila and Billieux4} Nevertheless, if comparing such behaviour to the more extensively studied phenomenon of problematic internet use, concerns arise as to whether some individuals might develop uncontrolled and dysfunctional patterns of binge watching, leading to mental and physical health consequences.^{Reference Fernandes, Maia and Pontes6}

The biopsychosocial components model of addiction^{Reference Griffiths7,Reference Griffiths, Kuss, Pontes, Billieux, Wolff, White and Karch8} posits that any behaviour can become addictive, and in that respect, six psychological components manifest in the affected individuals: salience, mood modification, tolerance, withdrawal, conflict and relapse. Salience indicates the prominence of the behaviour in an individual's mind. Mood modification represents the use of the behaviour to cope with stress, anxiety, depressive symptoms and with negative emotions generally. Tolerance refers to the progressively greater amount of time spent engaging in that behaviour to achieve the desired feeling (e.g., arousal or escape). Withdrawal represents the experience of cognitive distortion and negative feelings associated with a discontinuity in the behaviour, involving a series of psychological symptoms (from stress to insomnia, and even physical side-effects). Conflict indicates an individual's experience of struggling with the increasing concerns associated with the behaviour, to the point of interfering with their mental functioning, everyday activities and social relationships. Finally, relapse represents the tendency to revert to previous patterns of the behaviour as a consequence of a failed attempt to control or interrupt the dysfunctional cycle.

The biopsychosocial components model of addiction has been successfully adapted to the investigation of several online problematic behaviours, such as YouTube use,^{Reference Balakrishnan and Griffiths5} sexual behaviour and online pornography use,^{Reference Griffiths9} and most importantly, internet addiction.^{Reference Kuss, Shorter, van Rooij, Griffiths and Schoenmakers10} Recently, the addiction components model has also been used to operationalise the assessment of problematic series watching. A psychometric measure (i.e. Problematic Series Watching Scale; PSWS) was developed, tested and validated among Hungarian samples. However, to date, the English version provided by the authors has not been tested among an English-speaking sample, limiting its usability in other linguistic and cultural contexts.^{Reference Orosz, Bőthe and Tóth-Király11}

Addictive versus problematic behaviours

Recent literature on problematic internet use has outlined an important theoretical distinction between problematic behaviours and addictive syndromes, with the former considered as being outside the range of pathological mental conditions rather representing ‘a distinct pattern of cognitions and behaviours’, often resulting in negative outcomes for daily life.^{Reference Fernandes, Maia and Pontes6} In this sense, problematic behaviours have been located at the mid-point on a continuum of severity of implications for the individual versus addictive behaviours, the latter being intended to be at its extreme end.^{Reference Fernandes, Maia and Pontes6} Nevertheless, problematic behaviours can negatively affect several aspects of an individual's life, such as their mental and physical health, educational and occupational performances, and social attachments,^{Reference Fernandes, Maia and Pontes6} ultimately representing a risk factor for individual impairment.^{Reference Fernandes, Maia and Pontes6,Reference Caplan12,Reference Shapira, Lessig, Goldsmith, Szabo, Lazoritz and Gold13} However, in light of the contemporary scientific debate on the risk for overpathologising everyday life, caution is required with interpreting excessive behaviours under the lens of problematic outcomes rather than a healthy and possibly positive reinforcement operated by the same behaviour.^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3,Reference Flayelle, Maurage, Vögele, Karila and Billieux4,Reference Flayelle, Castro-Calvo, Vögele, Astur, Ballester-Arnal and Challet-Bouju14}

Binge and problematic watching during the COVID-19 pandemic

Although binge watching and problematic video streaming use represent relatively understudied phenomena, some anecdotal observations and clinical case studies have recently been reported,^{Reference Sharma, Sharma, Anand, Thamilselvan, Suma and Nisha15} concomitant with the global affirmation of models of subscription-based, on-demand streaming services. Some have highlighted that the phenomenon may be of even greater interest in the context of the COVID-19 pandemic and its associated mid- and long-term consequences, with social isolation, financial preoccupations and the relatively accessible and affordable services provided by on-demand platforms potentially increasing the risk for such activities to turn into a dysfunctional pattern of problematic behaviour.^{Reference Dixit, Marthoenis, Arafat, Sharma and Kar16} In particular, a study of 715 adults from the Italian population, conducted in the early stages of the COVID-19 pandemic and lockdown, showed that individuals spent, on average, more time watching series during that period compared with their pre-pandemic habits, and that was associated with anxiety and stress, especially among women. Interestingly, the authors found that both non-problematic and problematic TV series watching were associated with symptoms of anxiety and escapism, possibly acting as a psychological strategy to cope with the difficulties arising with the pandemic.^{Reference Boursier, Musetti, Gioia, Flayelle, Billieux and Schimmenti17} Similarly, a study conducted among 1089 adults in the first half of 2021, showed that binge watching predicted stress, loneliness, insomnia, depression and anxiety, whereas time spent binge watching was found to increase the relevant symptoms.^{Reference Raza, Yousaf, Sohail, Munawar, Ogadimma and Siang18} Another longitudinal study investigated series watching over a 6-week period during the first pandemic lockdown in Belgium, France and Switzerland. The authors found that male gender and social motives for series were associated with lower negative affect, whereas a loss of control of binge watching predicted negative affect over time.^{Reference Sigre-Leirós, Billieux, Mohr, Maurage, King and Schimmenti19}

Problematic series watching, internet addiction and mental well-being

Previous research has found internet addiction to be associated with negative affect, poor sleep quality, and impaired mental health and quality of life.^{Reference Zhang, Tran, Huong, Hinh, Nguyen and Tho20–Reference Ho, Zhang, Tsang, Toh, Pan and Lu22} In particular, a study indicated that approximately 27% of young Vietnamese individuals diagnosed with internet addiction reported sleep-related difficulties,^{Reference Zhang, Tran, Huong, Hinh, Nguyen and Tho20} whereas another study^{Reference Tran, Huong, Hinh, Nguyen, Le and Nong21} found that young individuals with internet addiction were more likely to report negative self-care, problematic daily routines, pain, discomfort, anxiety and depression. Similarly, a meta-analysis found internet addiction to be positively associated with alcohol abuse, hyperactivity, depression and anxiety.^{Reference Ho, Zhang, Tsang, Toh, Pan and Lu22}

Interestingly, such results are consistent with recent literature on binge watching and problematic series watching. In particular, two self-report scales to assess series watching motives and binge watching engagement and symptoms have been recently developed,^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3} namely the Watching TV Series Motives Questionnaire (WTSMQ) and the Binge-Watching Engagement and Symptoms Questionnaire (BWESQ). The two scales resulted from exploratory and confirmatory analyses of motives for series watching and binge watching engagement and symptoms, respectively, in a sample comprised primarily of university students. The WTSMQ showed a four-factor model, including social motives, emotional enhancement, enrichment and coping/escapism, whereas the BWESQ showed a seven-factor model, including four ‘positive’ factors, namely engagement, positive emotions, desire savouring and pleasure preservation, as well as three factors representing dimensions of problematic behaviour: binge watching, dependency and loss of control.

They found significant correlations (P < 0.05) between the WTSMQ-Coping/escapism and positive affect (r_s = −0.13), and between the former and negative affect (r_s = 0.38), as assessed with the Positive and Negative Affect Schedule (PANAS).^{Reference Watson, Clark and Tellegen23} Notably, they also found a correlation of 0.39 between WTSMQ-Coping/escapism and scores on the Compulsive Internet Use Scale,^{Reference Meerkerk, Van Den Eijnden, Vermulst and Garretsen24} which previous research found to correlate with depression and poor sleep quality.^{Reference de Vries, Nakamae, Fukui, Denys and Narumoto25} Similar patterns were observed for BWESQ-Binge watching (positive affect: r_s = −0.07; negative affect: r_s = 0.28; compulsive internet use: r_s = 0.48), BWESQ-Dependency (positive affect: r_s = −0.12; negative affect: r_s = 0.31; compulsive internet use: r_s = 0.46) and BWESQ-Loss of control (positive affect: r_s = −0.14; negative affect: r_s = 0.26; compulsive internet use: r_s = 0.51). Other studies have reported positive correlations between problematic series watching and individuals’ self-development (r = 0.26), social interaction (r = 0.25), impulsive behaviour such as lack of perseverance (r = 0.24) and urgency (r = 0.25), harmonious series passion (r = 0.46) and obsessive series passion (r = 0.76),^{Reference Orosz, Vallerand, Bőthe, Tóth-Király and Paskuj26} and between binge watching frequency and poor sleep quality (r = 0.15), cognitive pre-sleep arousal (r = 0.15) and somatic pre-sleep arousal (r = 0.09).^{Reference Exelmans and Van den Bulck27} Furthermore, a recent study reported 32% of poor sleepers as being binge viewers, similar to what was previously found in relation to internet addiction.^{Reference Zhang, Tran, Huong, Hinh, Nguyen and Tho20,Reference Exelmans and Van den Bulck27}

Development of an English version of the PSWS

Although the WTSMQ and the BWESQ have shown satisfactory measurement properties, they focus on series watching motives and binge watching; a specific measure of problematic series watching in English, to the best of our knowledge, does not currently exist. Having a reliable and valid measure of problematic series watching in English will carry important implications and potentially open the way to improve our current understanding of problematic series watching in several cultural contexts, with significant impact on the future agenda of research concerning problematic online behaviours.

In particular, as discussed in previous literature on other problematic behaviours, translating, adapting and validating psychometric measures in other cultural contexts carries a number of advantages. For example, Weatherly and colleagues note:

‘First, it would provide a single measure that was potentially useful to practitioners and researchers in multiple cultures. Second, such a measure could be used to identify differences at a cultural level … Third, if similar relationships are found between the contingencies maintaining … the behaviour and other measures of the same problematic behaviour in other linguistic and cultural contexts, then it could be argued that one of the important factors underlying … [such behaviour] had been identified’.^{Reference Weatherly, Dymond, Samuels, Austin and Terrell28}

Moreover, this will potentially enable researchers from several cultural contexts to address a number of unresolved theoretical and measurement-related questions about problematic series watching, such as (a) better differentiation between a ‘positive’ engagement from dysfunctional patterns of problematic watching among those presenting with an excessive and/or prolonged exposure to series watching; (b) clarification of the nomological network of problematic series watching, and (c) determination of the specific circumstances and factors prompting the activation, maintenance and reinforcement of excessive and problematic watching, in accordance with the recently proposed need for a behavioural analysis of problematic and addictive behaviours.^{Reference James and Tunney29}

Research aims and hypotheses

For all of these reasons, in this research, we aimed to test the factor structure, reliability and criterion-related (concurrent) validity of the English version of the PSWS,^{Reference Orosz, Bőthe and Tóth-Király11} using two UK samples of university students. In particular, Study 1 tested the original measurement model observed in Hungarian samples,^{Reference Orosz, Bőthe and Tóth-Király11} the reliability of the model and the ability of the items to discriminate between individuals at different levels of the assumed latent dimension. Subsequently, Study 2 tested the criterion-related validity of the scale, hypothesising that PSWS scores would positively be associated with series watching time and negative affect, and negatively associated with positive affect, mental well-being and sleep quality.

Method

Study 1

Participants and procedure

Participants were UK undergraduate psychology students. The inclusion criteria were being aged at least 18 years, currently enrolled in an undergraduate psychology programme and having used any TV streaming service at least once in the past 12 months. A total of 405 students were contacted between January and June 2021, of which 358 completed the procedure and participated in Study 1, and 333 were retained for the analyses after preliminary data screening (see Results).

Participants were contacted and recruited online, mainly via the psychology department's institutional research participation scheme website, and via word of mouth and advertising on social media websites. They were told that this was a study on their use of online video streaming services and invited to complete an online survey through Qualtrics (Qualtrics, Provo, USA, https://www.qualtrics.com), an experience management platform. They were compensated for their time and effort with academic research credits, following the institution's recommendations. No financial incentives were offered. Electronic informed consent was obtained from all participants. Those who signed the consent form were initially screened for the inclusion criteria, and those who met the criteria were asked to complete the survey, at the end of which they were thanked and debriefed.

The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008. The School of Social Sciences Research Ethics Committee at Nottingham Trent University expressed favourable opinion on the ethics application (application number 2021/81 and amendment no. 2021/189).

Measures

The PSWS^{Reference Orosz, Bőthe and Tóth-Király11} is a measure of problematic series watching, based on the addiction components model.^{Reference Griffiths7} Each item is scored on a five-point Likert scale from 1 (‘Never’) to 5 (‘Always’), asking participants to indicate how often during the past year they had thought, felt or behaved similarly to how it was described in each individual item. The scale includes six items, representing the components of salience, mood modification, tolerance, withdrawal, conflict and relapse, respectively. Originally, its psychometric properties were tested in two independent Hungarian samples, with the scale being internally consistent in both samples (Cronbach's α = 0.69 and 0.76). Total scores can be obtained by summing up individual item's scores.

Regarding the translation and adaptation procedure, the PSWS was developed on the basis of a seven-item measure of work addiction.^{Reference Andreassen, Griffiths, Hetland and Pallesen30} A key difference between the two measures consisted in the replacement of the term ‘work’ with ‘series watching’. The PSWS items were originally translated from English to Hungarian, following the established and widely utilised protocol proposed by Beaton et al,^{Reference Beaton, Bombardier, Guillemin and Ferraz31} for the cross-cultural translation and adaptation of psychometric measures, consisting of six stages (initial dual translation, synthesis of the two translations, blind back-translation, cross-cultural evaluation by a panel of experts, test of the pre-final version, and submission). The English version of the PSWS was finally provided by the authors in their published paper, and that was used in the present study.

Statistical analyses

Data were preliminarily screened for missing observations, unengaged responses (s.d. < 0.3) and multivariate outliers, the latter determined by estimating Cook's generalised distances from a factor-analytic model accounting for the six PSWS items loading onto the problematic watching latent dimension. The 0.5 quantile of the F distribution with k + 1 and n − k − 1 degrees of freedom (α = 0.001) was used as the cut-off value for detection. Two sets of analyses were performed on two different randomly extracted data-sets (n₁ = 166 and n₂ = 167) of the original sample (N = 333), respectively: Confirmatory Factor Analysis (CFA), and Item Response Theory (IRT) analyses.

As for CFA, the mean- and variance-adjusted weighted least-square estimation method was used to account for the ordinal nature of the data. To evaluate the fit of the model to the data, the study used the comparative fit index (≥0.95), root mean square error of approximation (<0.07) and standardised root mean square residual (<0.08). The reliability (internal consistency) of the PSWS was evaluated by means of the omega coefficient, considering values ≥0.7 as satisfactory. All the analyses were conducted through statistical programming language R version 4.0.4 for macOS, free software distributed under a GNU-style copyleft by R Core Team (R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/), and the following packages: lavaan,^{Reference Rosseel32} lordif,^{Reference Choi, Gibbons and Crane33} mirt^{Reference Chalmers34} and semTools: Useful Tools for Structural Equation Modeling (The Comprehensive R Network, https://cran.r-project.org/web/packages/semTools/index.html).

Regarding IRT, a unidimensional graded response model was fitted to the data, assuming the probability to endorse an item's response to be a function of the participant's location on the latent continuum of problematic series watching. The study estimated and evaluated items’ residual correlations from the model and tested the assumption of local independence. The probability density function was used on the latent continuum to estimate marginal reliability. Item's slopes (α), specifically their ability to discriminate between participants on the latent continuum, response category thresholds (β) and the item information function (IIF) were also estimated; the latter is considered as the degree of statistical information accounted for by the item. Item characteristic curves were used to inform the analysis and the evaluation of the model. Then, each item's invariance was examined by using logistic regression/IRT and the chi-squared likelihood ratio test (α = 0.001), specifically aiming to detect any possible differential item functioning across female and male participants.

Study 2

Participants and procedure

Participants in Study 2 comprised 210 psychology students, recruited between June and October 2021, of which 209 were retained after data screening (one multivariate outlier was identified and removed). The same procedure, inclusion criteria and ethical considerations used in Study 1 were used in Study 2.

Additional measures

The 20-item PANAS^{Reference Watson, Clark and Tellegen23} comprises two ten-item subscales assessing positive affect (e.g. excitement, inspiration) and negative affect (e.g. feeling upset, afraid). In the original validation study, both scales were found to be reliable (Cronbach's α ranging from 0.86 to 0.90 for positive affect and 0.84 to 0.87 for negative affect; test–retest correlations ranging from 0.47 to 0.68 for positive affect and from 0.39 to 0.71 for negative affect). Total subscale scores were obtained by summing up individual item's scores.

The seven-item Short Warwick-Edinburgh Mental Wellbeing Scale (SWEMWBS)^{Reference Stewart-Brown, Tennant, Tennant, Platt, Parkinson and Weich35} derives from the original 14-item Warwick-Edinburgh Mental Wellbeing Scale,^{Reference Tennant, Hiller, Fishwick, Platt, Joseph and Weich36} with items representing mainly eudemonic well-being, rated on five response categories (1 = ‘None of the time’ to 5 = ‘All of the time’). Research showed that the scale is reliable (Cronbach's α = 0.85). Total SWEMWBS scores were obtained by first summing up individual item's scores, and then converting raw scores into metric scores.^{Reference Tennant, Hiller, Fishwick, Platt, Joseph and Weich36}

The Sleep Quality Scale (SQS)^{Reference Snyder, Cai, DeMuro, Morrison and Ball37} is a single-item measure of overall sleep quality. It requires respondents to think about the quality of their sleep overall, such as how many hours of sleep they usually get, how often they wake up during the night and earlier than they have to, and how refreshing they find their sleep. Respondents are then asked to rate their overall typical sleep quality on a Likert-type scale, with scores ranging from 0 to 10 (0 = ‘Terrible’, 10 = ‘Excellent’).

Finally, we asked participants to self-report how many hours they spend watching series, on average, per week.

Statistical analyses

The criterion-related (concurrent) validity of the PSWS was assessed with the second sample, after screening the data by using the same procedure used in Study 1. We used Spearman's correlations between the PSWS total scores and the other variables in the study.

Results

Study 1

Among the 358 students from the first sample who completed the study procedure, 285 (79.61%) self-reported as female, 67 (18.72%) self-reported as male, four (1.12%) self-reported as non-binary gendered and two (0.56%) preferred not to report their gender. Their minimum and maximum self-reported ages were 18 and 49 years (mean 21.82, s.d. 5.00), respectively. Five participants showed missing observations, 19 displayed patterns of unengaged responses (s.d. < 0.3) and one participant was identified as a multivariate outlier. These responses were removed from the analytical data-set, leading to a final sample of 333 useful observations, later randomly split into two independent and approximately equally sized data-sets (n₁ = 166 and n₂ = 167). Tables 1 and 2 report PSWS items’ descriptive statistics and the Spearman's rho items’ intercorrelation matrix, respectively.

Table 1 Descriptive statistics (Study 1; n = 333)

Table 2 Problematic Series Watching Scale, Spearman's rho correlation matrix (Study 1; n = 333)

***P < 0.001.

Results from CFA (n₁ = 166) showed a satisfactory model fit (comparative fit index 0.998, root mean square error of approximation 0.024 (90% CI 0.000–0.093), standardised root mean square residual 0.048), confirming the fit of the model to the data. As shown in Figure 1, the items presented adequate standardised regression weights (0.51–0.84), with the solution being internally consistent (ω = 0.79).

Fig. 1 Confirmatory factor analysis (Study 1; n ₁ = 166). PSWS, Problematic Series Watching Scale.

An IRT graded response model was then fitted to the data (n₂ = 167). The model showed residual correlations higher than the estimated critical value (0.21) for all of the items, indicating possible local dependence. Satisfactory levels of marginal reliability were found (r_xx = 0.76). Item characteristic curves showed patterns of ordered response categories for all the items, although items 2, 3 and 6 showed some overlaps of two categories (‘Rarely’, ‘Sometimes’), suggesting the utility of collapsing those categories (Fig. 2).

Fig. 2 Item characteristic curves (Study 1; n ₂ = 167). P(θ), probability endorsing a category option; P1–P5, item response curves for category options 1–5.

Item 4 showed the highest values of discrimination and information (α = 2.30, s.e. = 0.49, IIF = 1.60), followed by Item 5 (α = 1.90, s.e. = 0.37, IIF = 0.63), whereas Item 3 (α = 1.31, s.e. = 0.25, IIF = −0.41) and Item 2 (α = 1.03, s.e. = 0.22, IIF = −1.33) were the least informative and discriminative items. Table 3 reports detailed parameters, s.e. and information function for all the items.

Table 3 Graded response model, item discrimination parameters, category thresholds and information function (Study 1; n₂ = 167)

IIF, item information function, standardised values.

Items’ differential functioning was tested across female and male participants (n₂ = 165). The results from the likelihood ratio tests were not significant for any of the six items (α = 0.001), suggesting no differential item functioning.

Study 2

In the second sample, consisting of 210 original observations, one multivariate outlier and no unengaged responses were found across all the measures. Therefore, the remaining 209 observations were used in the analyses. Among the 210 students, 166 (79.81%) self-reported as female, 39 (18.75%) self-reported as male, three self-reported as non-binary (1.44%) and two preferred not to report (0.96%). Their minimum and maximum self-reported ages were 18 and 29 years (mean 20.16, s.d. 1.46), respectively.

Table 4 presents descriptive statistics on all the measures, and Table 5 shows the results from the validity analyses. The PSWS positively correlated with time spent watching series (r_s = 0.26, P < 0.001) and with PANAS negative affect (r_s = 0.43, P < 0.001), and correlated negatively with PANAS positive affect (r_s = −0.12, P > 0.05), SWEMWBS mental well-being (r_s = −0.25, P < 0.001) and SQS sleep quality (r_s = −0.14, P < 0.05), confirming the relevant hypotheses.

Table 4 Descriptive statistics (Study 2; n = 209)

PSWS, Problematic Series Watching Scale; PANAS, Positive and Negative Affect Schedule; SWEMWBS, Short Warwick-Edinburgh Mental Wellbeing Scale; SQS, Sleep Quality Scale.

Table 5 Spearman's rho correlation matrix (Study 2; n = 209)

PSWS, Problematic Series Watching Scale; PANAS, Positive and Negative Affect Schedule; SWEMWBS, Short Warwick-Edinburgh Mental Wellbeing Scale; SQS, Sleep Quality Scale.

*P < 0.05, ***P < 0.001.

Discussion

The present study tested the factor structure, reliability and criterion-related (concurrent) validity of the PSWS, originally validated in Hungarian samples,^{Reference Orosz, Bőthe and Tóth-Király11} among two UK samples of university students, using the English version provided by the authors in their published paper. We tested the relevant measurement model; its reliability; the ability of the items to discriminate between individuals positioned at different levels of the assumed problematic series watching latent continuum (Study 1); and the correlation between the PSWS and measures of time spent watching series, positive affect, negative affect, mental well-being and sleep quality (Study 2).

The results from Study 1 showed that the original measurement model was a good fit to the data, and that the solution was internally consistent, with all the items loading adequately onto the latent dimension. The IRT analyses indicated that three items required the collapsing of two categories to improve the substantive validity of the scale. Finally, regarding the criterion-related validity of the PSWS, the findings from Study 2 showed that PSWS scores were positively associated with time spent watching series and negative affect, and negatively associated with positive affect, mental well-being and sleep quality, confirming the relevant hypotheses.

Implications of the results

The findings of the present study are promising and of great interest, but are far from conclusive and require further investigation. In fact, although the results contributed to evidence on the one-dimensional structure and the reliability of the PSWS, there is a significant lack of evidence in contemporary literature with regards to the addictive and/or problematic nature of series watching. Consistently, it must be noted that the extent to which excessive and/or prolonged series watching behaviour might constitute a risk factor for an individual's cognitive, emotional and functional impairment has yet to be defined. In this regard, the arguments provided in recent literature^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3} on the risk for overpathologising everyday life must be considered of foremost interest and importance.

Nevertheless, based on findings from other preliminary studies highlighting the potential of binge watching of representing an addictive behaviour,^{Reference Riddle, Peebles, Davis, Xu and Schroeder38} recent clinical and anecdotal reports among individuals seeking help to address symptoms of problematic use of online streaming platforms,^{Reference Sharma, Sharma, Anand, Thamilselvan, Suma and Nisha15} and in the light of the societal and technological transformations happening globally, research is recommended to continue investigating the phenomenon, treating the results from the present study with caution and in the context of the wider evidence available to date. Moreover, findings of the present study will help researchers to address a number of unresolved theoretical and measurement-related questions on problematic watching, particularly (a) the extent to which an excessive, prolonged or serialised use of online video streaming services might represent a ‘positive addiction’^{Reference Shapira, Lessig, Goldsmith, Szabo, Lazoritz and Gold13} versus a continuum of problematic behaviours, determining a low to mild, or even potentially high degree of individual impairment; (b) the degree of distinctiveness of problematic watching in relation to the wider study of problematic internet use; (c) the circumstances and the factors determining the activation, maintenance and reinforcement of binge watching and excessive watching behaviours, following the lines proposed for a behavioural analysis of problematic and addictive behaviours;^{Reference James and Tunney29} (d) the prevalence of the phenomenon in the population, and the groups at higher exposure and possibly risk; and (e) how researchers and practitioners can validly assess an individual's change in their interaction with a potentially problematic or addictive behaviour. Regarding the latter point, it has been argued that it took years for a reliable psychometric instrument to be developed to assess pathological gambling, following the classification of it in the DSM-III, carrying dramatic consequences for those affected by the condition.

Critical evaluation of the validity of the PSWS

In the present study, a positive association was found between PSWS total scores and time spent watching series and the PANAS negative affect scale, and a negative association with the PANAS positive affect scale, mental well-being (assessed with the SWEMWBS) and sleep quality (assessed with the SQS), providing preliminary evidence regarding the ‘problematic’ nature of series watching as measured through the PSWS. This could be explained by looking at findings from research showing moderate correlations between coping/escapism as motives for binge watching and negative affect, and zero-to-low correlations between binge watching and positive affect.^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3}

However, we recommend future research to attempt to replicate and further test such relationships, ideally by means of exploratory factor analyses, to determine the factor space of problematic series watching in relation to correlates of excessive series watching, and by testing the validity of the PSWS, particularly in relation to the available measures of binge watching motives and engagement.^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3} In fact, in the study by Flayelle and collaborators, the authors explored four ‘positive’ factors, such as engagement, positive emotions, desire savouring and pleasure preservation, as well as three factors seemingly representing dimensions of problematic behaviour, such as binge watching, dependency and loss of control.^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux39} Nevertheless, when looking at the correlations between the WTSMQ scores and the seven BWESQ factor scores, they were all low to moderate, whereas positive and moderate correlations were found between binge watching, dependency and loss of control assessed with the BWESQ, negative affect assessed with the PANAS and compulsive internet use assessed with the Compulsive Internet Use Scale.^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3} In turn, compulsive internet use has also been associated with depressive symptoms and poor sleep quality,^{Reference de Vries, Nakamae, Fukui, Denys and Narumoto25} and negative correlations have been found between the former three ‘negative’ BWESQ subdimensions and positive affect, prompting further investigation on the dysfunctional and problematic nature of series watching.^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux39}

Regarding mental well-being, findings from a recent study indicated that time spent binge watching, measured in terms of number of episodes watched, correlated with individuals’ free time and played a key role in the effect of binge watching on mental well-being.^{Reference de Feijter, Khan and van Gisbergen40} As for sleep quality, a noteworthy result emerging from the present study is the significant negative correlation between PSWS scores and sleep quality, consistent with the results from a recent study^{Reference Exelmans and Van den Bulck27} that found binge viewing to be increasing in its prevalence and potentially representing ‘a threat to sleep’. The authors of that study argued that a high cognitive arousal may serve as the key mechanism determining such phenomenon, and they concluded that excessive viewing time and cognitive arousal before sleep should be considered by further research and as possible targets for effective intervention in binge viewers.

Nevertheless, to avoid any speculations beyond the scope of the present study, we recommend further research to investigate the validity of the PSWS compared with other existing measures of binge watching and problematic series watching and common symptoms of mental distress, to help better define the construct both theoretically and operationally, and help to determine the extent to which the PSWS might help to shed a light on this relatively new and understudied phenomenon.

General limitations

Beyond the discussion on the construct and criterion validity of the measure, the study also has other notable limitations. First, it was based on university student samples from one study discipline (i.e. psychology), requiring caution in the generalisation of the results to other populations. Second, the sample size was modest (although adequate for the testing of a six-item scale), requiring further testing in larger samples. Third, the test–retest reliability of the PSWS was not investigated. Fourth, participants were all recruited during the COVID-19 pandemic, which might have significantly skewed responses because of the social restrictions in place in the UK at the time when the data were collected, and the associated change in everyday habits and behaviours.

Limitations of the confirmatory factor analytic approach

As suggested in recent literature,^{Reference Flayelle, Canale, Vögele, Karila, Maurage and Billieux3} one of the main risk of a confirmatory approach to factor analysis might be represented by a possible overpathologisation of everyday behaviours such as television series watching, and the possibility that applying such models without a specific analysis of the validity of the model under investigation might generate problems in the interpretation of the model itself. In this regard, some have even argued that the confirmatory approach might even be conceptually untenable, overlooking alternative valid conceptualisations and proposing suboptimal treatment options,^{Reference Billieux, Schimmenti, Khazaal, Maurage and Heeren41–Reference Starcevic45} and for these reasons, further research exploring the factor space of problematic series watching in relation to the phenomenological underpinnings of binge watching and excessive series watching (e.g. motives, engagement and symptoms, e.g. as measured through the WTSMQ and BWESQ) is warranted.

In conclusion, our results show that the English version of the PSWS is internally consistent and adaptable to samples of UK university students, providing preliminary evidence on its criterion-related (concurrent) validity. Future research will benefit from comparing PSWS scores and compulsive internet use by means of validated and established measures like the Compulsive Internet Use Scale,^{Reference Meerkerk, Van Den Eijnden, Vermulst and Garretsen24} aiming to differentiate between these two constructs and provide a clearer understanding of problematic series watching by means of a targeted analysis of the discriminant validity of the scale, along with further investigation of problematic series watching as excessive versus problematic behaviour.

Data availability

The data that support the findings of this study are available from the corresponding author, E.F., upon reasonable request.

Acknowledgements

The authors would like to thank all the students who have helped with data collection.

Author contributions

E.F. conceived the study, was responsible for data curation, study methodology, software, data visualisation, study supervision, study investigation and data validation, wrote the original draft of the manuscript, and reviewed and edited the manuscript. M.H. and J.R. conceived the study, were responsible for data curation and reviewed and edited the manuscript. G.O. conceived the study and edited the manuscript. M.D.G. conceived the study and reviewed and edited the manuscript.

Funding

This research received no specific grant from any funding agency, commercial or not-for-profit sectors.

Declaration of interest

None.

References

Granow, VC, Reinecke, L, Ziegele, M. Binge-watching and psychological well-being: media use between lack of control and perceived autonomy. Commun Res Rep 2018; 35: 392–401.CrossRef Google Scholar

Ort, A, Wirz, DS, Fahr, A. Is binge-watching addictive? Effects of motives for TV series use on the relationship between excessive media consumption and problematic viewing habits. Addict Behav Rep 2021; 13: 100325.Google Scholar PubMed

Flayelle, M, Canale, N, Vögele, C, Karila, L, Maurage, P, Billieux, J. Assessing binge-watching behaviors: development and validation of the “watching TV series motives” and “binge-watching engagement and symptoms” questionnaires. Comput Hum Behav 2019; 90: 26–36.CrossRef Google Scholar

Flayelle, M, Maurage, P, Vögele, C, Karila, L, Billieux, J. Time for a plot twist: beyond confirmatory approaches to binge-watching research. Psychol Pop Media Cult 2019; 8: 308–18.CrossRef Google Scholar

Balakrishnan, J, Griffiths, MD. Social media addiction: what is the role of content in YouTube? J Behav Addict 2017; 6: 364–77.CrossRef Google Scholar PubMed

Fernandes, B, Maia, BR, Pontes, HM. Internet addiction or problematic internet use? Which term should be used? USP Psicol 2019; 30: 1–8.Google Scholar

Griffiths, MD. A ‘components’ model of addiction within a biopsychosocial framework. J Subst Use 2005; 10: 191–7.CrossRef Google Scholar

Griffiths, MD, Kuss, DJ, Pontes, HM, Billieux, J. Where do gambling and internet ‘addictions’ belong? The status of ‘other’ addictions. In The SAGE Handbook of Drug & Alcohol Studies Volume 2 (eds Wolff, K, White, J, Karch, S): 446: 470. SAGE Publications, 2016.Google Scholar

Griffiths, MD. Compulsive sexual behaviour as a behavioural addiction: the impact of the internet and other issues. Addiction 2016; 111: 2107–8.CrossRef Google Scholar PubMed

Kuss, DJ, Shorter, GW, van Rooij, AJ, Griffiths, MD, Schoenmakers, TM. Assessing internet addiction using the parsimonious internet addiction components model - a preliminary study. Int J Ment Health Addict 2014; 12: 351–66.Google Scholar

Orosz, G, Bőthe, B, Tóth-Király, I. The development of the Problematic Series Watching Scale (PSWS). J Behav Addict 2016; 5: 144–50.CrossRef Google Scholar

Caplan, SE. Problematic internet use and psychosocial well-being: development of a theory-based cognitive–behavioral measurement instrument. Comput Hum Behav 2002; 18: 553–75.CrossRef Google Scholar

Shapira, NA, Lessig, MC, Goldsmith, TD, Szabo, ST, Lazoritz, M, Gold, MS, et al. Problematic internet use: proposed classification and diagnostic criteria. Depress Anxiety 2003; 17: 207–16.CrossRef Google Scholar PubMed

Flayelle, M, Castro-Calvo, J, Vögele, C, Astur, R, Ballester-Arnal, R, Challet-Bouju, G, et al. Towards a cross-cultural assessment of binge-watching: psychometric evaluation of the “Watching TV Series Motives” and “Binge-Watching Engagement and Symptoms” questionnaires across nine languages. Comput Hum Behav 2020; 111: 106410.CrossRef Google Scholar

Sharma, M, Sharma, MK, Anand, N, Thamilselvan, P, Suma, N, Nisha, J, et al. Binge watching: an emerging manifestation of technology use. Asian J Psychiatr 2019; 45: 81–2.CrossRef Google Scholar PubMed

Dixit, A, Marthoenis, M, Arafat, SMY, Sharma, P, Kar, SK. Binge watching behavior during COVID 19 pandemic: a cross-sectional, cross-national online survey. Psychiatry Res 2020; 289: 113089.CrossRef Google Scholar PubMed

Boursier, V, Musetti, A, Gioia, F, Flayelle, M, Billieux, J, Schimmenti, A. Is watching TV series an adaptive coping strategy during the COVID-19 pandemic? Insights from an Italian community sample. Front Psychiatry 2021; 12: 599859.CrossRef Google Scholar PubMed

Raza, SH, Yousaf, M, Sohail, F, Munawar, R, Ogadimma, EC, Siang, JMLD. Investigating binge-watching adverse mental health outcomes during Covid-19 pandemic: moderating role of screen time for web series using online streaming. Psychol Res Behav Manag 2021; 14: 1615–29.CrossRef Google Scholar PubMed

Sigre-Leirós, V, Billieux, J, Mohr, C, Maurage, P, King, DL, Schimmenti, A, et al. Binge-watching in times of COVID-19: a longitudinal examination of changes in affect and TV series consumption patterns during lockdown. Psychol Pop Media Cult [Epub ahead of print] 2022. Available from: https://doi.org/10.1037/ppm0000390.CrossRef Google Scholar

Zhang, MWB, Tran, BX, Huong, LT, Hinh, ND, Nguyen, HLT, Tho, TD, et al. Internet addiction and sleep quality among Vietnamese youths. Asian J Psychiatr 2017; 28: 15–20.CrossRef Google Scholar PubMed

Tran, BX, Huong, LT, Hinh, ND, Nguyen, LH, Le, BN, Nong, VM, et al. A study on the influence of internet addiction and online interpersonal influences on health-related quality of life in young Vietnamese. BMC Public Health 2017; 17: 138.CrossRef Google Scholar

Ho, RC, Zhang, MWB, Tsang, TY, Toh, AH, Pan, F, Lu, Y, et al. The association between internet addiction and psychiatric co-morbidity: a meta-analysis. BMC Psychiatry 2014; 14: 183.CrossRef Google Scholar PubMed

Watson, D, Clark, LA, Tellegen, A. Development and validation of brief measures of positive and negative affect: the PANAS scales. J Pers Soc Psychol 1988; 54: 1063–70.CrossRef Google Scholar PubMed

Meerkerk, G-J, Van Den Eijnden, RJJM, Vermulst, AA, Garretsen, HFL. The Compulsive Internet Use Scale (CIUS): some psychometric properties. CyberPsychol Behav 2009; 12: 1–6.CrossRef Google Scholar PubMed

de Vries, HT, Nakamae, T, Fukui, K, Denys, D, Narumoto, J. Problematic internet use and psychiatric co-morbidity in a population of Japanese adult psychiatric patients. BMC Psychiatry 2018; 18: 9.CrossRef Google Scholar

Orosz, G, Vallerand, RJ, Bőthe, B, Tóth-Király, I, Paskuj, B. On the correlates of passion for screen-based behaviors: the case of impulsivity and the problematic and non-problematic Facebook use and TV series watching. Pers Individ Differ 2016; 101: 167–76.CrossRef Google Scholar

Exelmans, L, Van den Bulck, J. Binge viewing, sleep, and the role of pre-sleep arousal. J Clin Sleep Med 2017; 13: 1001–8.CrossRef Google Scholar PubMed

Weatherly, JN, Dymond, S, Samuels, L, Austin, JL, Terrell, HK. Validating the gambling functional assessment–revised in a United Kingdom sample. J Gambl Stud 2014; 30: 337.CrossRef Google Scholar

James, RJE, Tunney, RJ. The need for a behavioural analysis of behavioural addictions. Clin Psychol Rev 2017; 52: 69–76.CrossRef Google Scholar PubMed

Andreassen, CS, Griffiths, MD, Hetland, J, Pallesen, S. Development of a work addiction scale. Scand J Psychol 2012; 53: 265–72.CrossRef Google Scholar PubMed

Beaton, DE, Bombardier, C, Guillemin, F, Ferraz, MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine 2000; 25: 3186–91.CrossRef Google Scholar PubMed

Rosseel, Y. lavaan: An R package for structural equation modeling. J Stat Softw 2012; 48: 1–36.CrossRef Google Scholar

Choi, SW, Gibbons, LE, Crane, PK. lordif: An R package for detecting differential item functioning using iterative hybrid ordinal logistic regression/item response theory and Monte Carlo simulations. J Stat Softw 2011; 39: 1–30.CrossRef Google Scholar

Chalmers, RP. A multidimensional item response theory package for the R environment. J Stat Softw 2012; 48: 1–29.CrossRef Google Scholar

Stewart-Brown, S, Tennant, A, Tennant, R, Platt, S, Parkinson, J, Weich, S. Internal construct validity of the Warwick-Edinburgh Mental Well-Being Scale (WEMWBS): a Rasch analysis using data from the Scottish health education population survey. Health Qual Life Outcomes 2009; 7: 15.CrossRef Google Scholar PubMed

Tennant, R, Hiller, L, Fishwick, R, Platt, S, Joseph, S, Weich, S, et al. The Warwick-Edinburgh Mental Well-Being Scale (WEMWBS): development and UK validation. Health Qual Life Outcomes 2007; 5: 63.CrossRef Google Scholar PubMed

Snyder, E, Cai, B, DeMuro, C, Morrison, MF, Ball, W. A new single-item Sleep Quality Scale: results of psychometric evaluation in patients with chronic primary insomnia and depression. J Clin Sleep Med 2018; 14: 1849–57.CrossRef Google Scholar PubMed

Riddle, K, Peebles, A, Davis, C, Xu, F, Schroeder, E. The addictive potential of television binge watching: comparing intentional and unintentional binges. Psychol Pop Med Cult 2018; 7: 589–604.Google Scholar

Flayelle, M, Canale, N, Vögele, C, Karila, L, Maurage, P, Billieux, J. Assessing binge-watching behaviors: development and validation of the 'watching TV series motives' and 'bingewatching engagement and symptoms' questionnaires. Comput Hum Behav 2019; 90: 26–36.CrossRef Google Scholar

de Feijter, D, Khan, V-J, van Gisbergen, M. Confessions of a ‘guilty’ couch potato Understanding and using context to optimize binge-watching behavior. Proceedings of the ACM International Conference on Interactive Experiences for TV and Online Video (Chicago, IL, USA, 22–24 Jun 2016). Association for Computing Machinery. 2016.CrossRef Google Scholar

Billieux, J, Schimmenti, A, Khazaal, Y, Maurage, P, Heeren, A. Are we overpathologizing everyday life? A tenable blueprint for behavioral addiction research. J Behav Addict 2015; 4: 119–23.CrossRef Google Scholar PubMed

Castro-Calvo, J, King, DL, Stein, DJ, Brand, M, Carmi, L, Chamberlain, SR, et al. Expert appraisal of criteria for assessing gaming disorder: an international Delphi study. Addiction 2021; 116: 2463–75.CrossRef Google Scholar

Flayelle, M, Schimmenti, A, Starcevic, V, Billieux, J. The pitfalls of recycling substance-use disorder criteria to diagnose behavioral addictions. In Evaluating the Brain Disease Model of Addiction (eds Heather, N, Field, M, Moss, A, Satel, S): 339–49. Routledge, 2022.CrossRef Google Scholar

Kardefelt-Winther, D, Heeren, A, Schimmenti, A, van Rooij, AV, Maurage, P, Carras, M, et al. How can we conceptualize behavioural addiction without pathologizing common behaviours? Addiction 2017; 112: 1709–15.CrossRef Google Scholar PubMed

Starcevic, V. Tolerance and withdrawal symptoms may not be helpful to enhance understanding of behavioural addictions. Addiction 2016; 111: 1307–8.CrossRef Google Scholar

Table 1 Descriptive statistics (Study 1; n = 333)

Table 2 Problematic Series Watching Scale, Spearman's rho correlation matrix (Study 1; n = 333)

Fig. 1 Confirmatory factor analysis (Study 1; n1 = 166). PSWS, Problematic Series Watching Scale.

Fig. 2 Item characteristic curves (Study 1; n2 = 167). P(θ), probability endorsing a category option; P1–P5, item response curves for category options 1–5.

Table 3 Graded response model, item discrimination parameters, category thresholds and information function (Study 1; n2 = 167)

Table 4 Descriptive statistics (Study 2; n = 209)

Table 5 Spearman's rho correlation matrix (Study 2; n = 209)

Submit a response

eLetters

No eLetters have been published for this article.

Article contents

Factor structure, reliability and criterion-related validity of the English version of the Problematic Series Watching Scale

Abstract

Keywords

Addictive versus problematic behaviours

Binge and problematic watching during the COVID-19 pandemic

Problematic series watching, internet addiction and mental well-being

Development of an English version of the PSWS

Research aims and hypotheses

Method

Study 1

Participants and procedure

Measures

Statistical analyses

Study 2

Participants and procedure

Additional measures

Statistical analyses

Results

Study 1

Study 2

Discussion

Implications of the results

Critical evaluation of the validity of the PSWS

General limitations

Limitations of the confirmatory factor analytic approach

Data availability

Acknowledgements

Author contributions

Funding

Declaration of interest

References

eLetters

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests