Estimating prevalence of child and youth mental disorder and mental health-related service contacts: a comparison of survey data and linked administrative health data

L. Duncan; K. Georgiades; L. Wang; J. Edwards; J. Comeau

doi:10.1017/S204579602200018X

Estimating prevalence of child and youth mental disorder and mental health-related service contacts: a comparison of survey data and linked administrative health data

Published online by Cambridge University Press: 19 May 2022

L. Duncan

K. Georgiades ,

L. Wang ,

J. Edwards and

J. Comeau

Show author details

L. Duncan*: Affiliation:
Offord Centre for Child Studies, Psychiatry & Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
K. Georgiades: Affiliation:
Offord Centre for Child Studies, Psychiatry & Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
L. Wang: Affiliation:
Offord Centre for Child Studies, Psychiatry & Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
J. Edwards: Affiliation:
Offord Centre for Child Studies, Psychiatry & Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
J. Comeau: Affiliation:
Offord Centre for Child Studies, Psychiatry & Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
*: Author for correspondence: L. Duncan, E-mail: [email protected]

Article contents

Abstract
Aims
Methods
Results
Conclusions
Introduction
Method
Measures
Results
Discussion
Conclusion
Data
References

Rights & Permissions

Abstract

Aims

Prevalence estimates of child and youth mental disorder and mental health-related service contacts are needed for policy formulation, research, advocacy and resource allocation. Our aim is to compare prevalence estimates of child and youth mental disorder and mental health-related service contacts derived from general population survey data v. linked administrative health data.

Methods

Provincially representative 2014 Ontario Child Health Study data were linked to administrative health records for 5563 children and youth aged 4–17 in Ontario. Emotional disorders (mood and anxiety) and attention-deficit/hyperactivity disorder were assessed using a standardised diagnostic interview in the survey and using diagnostic codes in administrative health data. Physician-based mental health-related service contacts were assessed using parent self-reports from the survey and administrative data related to mental health-related diagnostic codes. Prevalence estimates were calculated and compared based on one-sample z-tests and ratios of survey data to administrative data-based prevalence. Sensitivity, specificity and agreement between classifications were compared using κ. Prevalence estimates were calculated by age, sex and geography sub-groups and consistent group differences across data source were counted.

Results

Disorder prevalence and service contact estimates were significantly higher in survey data in all cases, except for mood disorder. Ratios of survey data to administrative data-based prevalence varied, ranging from 0.80 (mood) to 11.01 (attention-deficit/hyperactivity disorder). Specificity was high (0.98–1.00), sensitivity was low (0.07–0.41) and agreement ranged from slight (κ = 0.13) to moderate (κ = 0.46). Out of 18 sub-group difference comparisons, half were non-significant in either data source. In the remaining nine comparisons, the only significant differences between groups that were consistent across data source were for sex-based differences (attention-deficit/hyperactivity disorder and service contacts). There were no consistent age- or geography-based differences in prevalence across data sources.

Conclusions

Our findings suggest that conclusions drawn about prevalence, service contacts and sub-group differences in these estimates are dependent on data source. Further research is needed to understand who and what is being captured by each source. Researchers should conduct data linkage where possible to access and compare multiple sources of information.

Keywords

Administrative data epidemiology health services population surveys prevalence

Type: Original Article
Information: Epidemiology and Psychiatric Sciences , Volume 31 , 2022 , e35

DOI: https://doi.org/10.1017/S204579602200018X [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2022. Published by Cambridge University Press

Introduction

Mental disorders affect one in five children and youth worldwide (Polanczyk et al., Reference Polanczyk, Salum and Sugaya2015) and lead to individual and social burdens (Waddell et al., Reference Waddell, Schwartz, Andres, Barican and Yung2018), and adverse outcomes (Erskine et al., Reference Erskine, Norman, Ferrari, Chan, Copeland, Whiteford and Scott2016; Ploubidis et al., Reference Ploubidis, Batty, Patalay, Bann and Goodman2021). Documenting the prevalence of mental disorder among children (age 4–11) and youth (age 12–17) and the extent to which children/youth are in contact with mental health services is critical if we are to monitor the size of the problem and design effective policies to address need.

To inform equitable policy, accurate and timely estimates are needed of the number of children/youth: (1) living with a mental disorder (known prevalence); (2) in contact with mental health services (known service contacts); and (3) living with a disorder who are not in contact with mental health services (unmet need). Examining differences across population sub-groups and geographic areas reveals important patterns of disorder and service contacts (Kessler et al., Reference Kessler, Avenevoli and Costello2012, ; Costello et al., Reference Costello, He and Sampson2014; Cairney et al., Reference Cairney, Gandhi, Guttmann, Iron, Khan, Kurdyak, Lam and Julie Yang2015; Georgiades et al., Reference Georgiades, Duncan, Wang, Comeau and Boyle2019). This information provides critical evidence for examining patterns and correlates of mental health need, temporal trends, economic analysis of service costs and the individual/social/economic impact of mental disorders.

General population epidemiological surveys and administrative health records are used to estimate population-level disorder prevalence and mental health-related service contacts. Population health surveys (herein survey data) include standardised measurement of common mental disorders and random sampling approaches to produce representative estimates. They contain detailed demographic information and include both service users and non-users enabling more precise estimation of population prevalence (Merikangas et al., Reference Merikangas, He and Burstein2010; Pitchforth et al., Reference Pitchforth, Fahy, Ford, Wolpert, Viner and Hargreaves2019). Large-scale surveys are also expensive to implement, experiencing dwindling response rates (Luiten et al., Reference Luiten, Hox and de Leeuw2020), and are not conducted with the same frequency or geographic coverage as administrative data (Boyle et al., Reference Boyle, Georgiades, Duncan, Comeau and Wang2019).

Administrative health data (herein administrative data) are used for population monitoring/surveillance, despite not being collected for this purpose (Birkhead et al., Reference Birkhead, Klompas and Shah2015). Collected routinely in Canadian publicly funded health systems and as part of mandated record-keeping processes, administrative data include all individuals who have been in contact with physician-based health services and comprehensive coverage of number and type of diagnoses – including rare conditions often excluded from survey data. Limitations include (a) no standardised and validated approach to classifying individuals with disorder (Hinds et al., Reference Hinds, Lix, Smith, Quan and Sanmartin2016); (b) availability of age, sex and postal code only as demographic characteristics; and (c) disorder classification conditional on service contact (i.e. an individual requires an administrative database record for a diagnostic code to be present) (Gandhi et al., Reference Gandhi, Chiu, Lam, Cairney, Guttmann and Kurdyak2016).

Attempts to validate the use of administrative data to estimate adult mental disorder across jurisdictions have concluded that it is suitable for surveillance, but has limitations (Kisely et al., Reference Kisely, Lin, Gilbert, Smith, Campbell and Vasiliadis2009a, Reference Kisely, Lin, Lesage, Gilbert, Smith, Campbell and Vasiliadis2009b; Doktorchik et al., Reference Doktorchik, Patten, Eastwood, Peng, Chen, Beck, Jetté, Williamson and Quan2019). Studies comparing mood and anxiety disorder prevalence in administrative v. survey data found differences in prevalence and low concordance (O'Donnell et al., Reference O'Donnell, Vanderloo, McRae, Onysko, Patten and Pelletier2016; Edwards et al., Reference Edwards, Thind, Stranges, Chiu and Anderson2020). Purported reasons for differences were that disorders were captured differently depending on the stage of illness and treatment. Other work has found large discrepancies between mental health service contacts from administrative data and both self-reported contacts (Drapeau et al., Reference Drapeau, Boyer and Diallo2011; Palin et al., Reference Palin, Goldner, Koehoorn and Hertzman2011) and chart abstractions (Steele et al., Reference Steele, Glazier, Lin and Evans2004) – pointing to social desirability and recall bias as possible reasons for low levels of agreement. No studies have considered whether between-group differences are consistent, even if overall prevalence estimates differ between administrative and survey data.

This study is the first to compare child/youth mental disorder prevalence and physician-based mental health-related service contact estimates from survey and administrative data. By describing and documenting the level of agreement between these data sources, this work is a first step and a pre-requisite to future work aimed at making recommendations about the appropriate use of different data sources for specific purposes. Study objectives are to evaluate differences in overall prevalence and sub-group estimates of individual and grouped classifications of disorder, and mental health-related service contacts. We address the following questions: (1) Are disorder prevalence and service contact estimates different by data source? (2) What is the level of agreement between classifications of disorder and service contacts? (3) Are patterns of population sub-group estimates based on age, sex and geography consistent? The information generated will help determine how our conclusions may differ depending on data source and the potential usefulness of the respective data in estimating prevalence of child/youth mental disorder and mental health-related service contacts.

Method

Data

This study uses provincially representative, cross-sectional 2014 Ontario Child Health Study (OCHS) data (Statistics Canada, 2017a) linked, in 2018, to Ontario Ministry of Health (MOH; formerly MOHLTC) administrative data. Using the 2014 Canadian Child Tax Benefit (CCTB) file as the sampling frame, households were selected based on a complex cluster sample of areas stratified by urban/rural residency and income. The CCTB is a tax system-based family benefit, for which all parents are assessed, creating a roster of all families with children under 18. Data were collected during home visits in 2015 by trained Statistics Canada interviewers. Survey design, content and data collection details are available elsewhere (Boyle et al., Reference Boyle, Georgiades, Duncan, Comeau and Wang2019). Interviewers informed parents that survey data would be combined with administrative data and, with consent, collected a provincial health number to assist with the linkage. Parents were asked for agreement to share their data with the Ontario MOH. A probability sample of 6537 households (50.8% response) participated, with 10 802 children/youth aged 4–17. Of those, 89.5% (n = 9666) agreed to share and 96.2% of those were linked to administrative data (n = 9301) (sample selection in Fig. 1). Statistics Canada recalculated linked sample survey weights and shared data, weights and health card numbers with the MOH who extracted and provided administrative and survey data to the research team who conducted the linkage.

Fig. 1. Diagram of sample selection.

Ontario administrative health records pertain to inpatient, outpatient and other physician services covered by the Ontario Health Insurance Program (OHIP) which provides nearly universal coverage of Ontario residents (>96%) (Government of Ontario, 2018). Data from 2004 onwards were extracted from the Claims History Database (CHD: physician services billings), Discharge Abstract Database (DAD: inpatient services) and National Ambulatory Care Reporting System (NACRS: day procedures and outpatient services) and combined with the Registered Persons and Client Agency Program Enrolment Databases to determine individual eligibility for OHIP services.

Measures

Mental health disorder

The survey uses the DSM-IV-TR (APA, 2000) diagnostic classification system, while administrative data use the 10th revision of the International Statistical Classification of Diseases and Related Health Problems [ICD10; World Health Organization (WHO), 2004] to classify mental disorders, except for CHD data where the 9th revision (ICD9; WHO, 1978) was used. The diagnostic specifications for child/youth mental disorders were largely unchanged between ICD9 and 10. Disorders were selected by aligning groupings across data sources. Given extremely low prevalence in the administrative data, behavioural disorders were dropped. (Only conduct disorder appears in ICD10 and cases were insufficient to meet Statistics Canada disclosure rules.) Final disorder selection includes emotional, consisting of mood and anxiety, attention-deficit/hyperactivity and a group of both emotional and attention-deficit/hyperactivity disorders.

Survey data

In families with two or more eligible children, one randomly ‘selected child’ (SC) was assessed on all survey components, including a structured diagnostic interview. The Mini International Neuropsychiatric Interview for Children and Adolescents (MINI-KID) was administered to parents about the SC by trained interviewers to assess disorders in the past 6 months. MINI-KID classifications demonstrate convergent and discriminant construct validity and adequate test-retest reliability across disorders, informants and samples (Sheehan et al., Reference Sheehan, Sheehan, Shytle, Janavs, Bannon, Rogers, Milo, Stock and Wilkinson2010; Duncan et al., Reference Duncan, Georgiades, Wang, Van Lieshout, MacMillan, Ferro, Lipman, Szatmari, Bennett, Kata, Janus and Boyle2018). Disorders included mood (major depressive episode), anxiety (generalised anxiety, separation anxiety, social phobia and specific phobia) and attention-deficit/hyperactivity disorder.

Administrative data

Mental disorders were classified according to presence of the corresponding ICD9/10 diagnostic code for an individual in any of the administrative data files in the 6 months preceding the study interview date. Diagnostic codes were identified for mood, depressive, anxiety, emotional and hyperkinetic disorders using case definitions identified by Cairney et al. (Reference Cairney, Gandhi, Guttmann, Iron, Khan, Kurdyak, Lam and Julie Yang2015) following code selections (online Supplementary Table A1) that have shown adequate specificity (97%) and sensitivity (81%) in adults (Steele et al., Reference Steele, Glazier, Lin and Evans2004). The DAD and NACRS identify fewer disorders than the CHD but were included as eligible sources of disorder identification.

Mental health-related service contacts

Survey data

Service contacts were coded following procedures used by Georgiades et al. (Reference Georgiades, Duncan, Wang, Comeau and Boyle2019) to identify physician- and physician office-based services, as this is the only type of service captured in administrative data. Non-physician-based community and school-based services were excluded. Parents were asked whether they had visited different providers and settings for their child/youth's mental health concerns in the past 6 months. Providers included family doctors, paediatricians, other regular health care providers, other types of physicians or specialists, nurses, psychiatrists or other health professionals. Settings included specialised mental health or addictions agencies funded by MOH, walk-in clinics, urgent care facilities and hospital emergency rooms. An indicator of any physician-based mental health-related service contact included endorsement of contact with at least one provider or in at least one service setting.

Administrative data

Mental health-related service contacts were identified as any entry in the administrative databases associated with a mental health-related diagnostic classification in the 6 months prior to the child/youth interview date. Diagnostic classifications included disorders beyond emotional and attention disorders as survey-based service contact questions asked about ‘problems children might have with their emotions, attention or behaviour. For adolescents these problems might also include use of alcohol or drugs’. Diagnostic ICD10 classifications included substance-related disorders, schizophrenia, neurodevelopmental and personality disorders, self-harm and eating disorders, and for ICD9 included psychotic, non-psychotic, substance use disorder, social problems and a set of other mental health-related codes that are appropriate when looking at child/youth mental health-related concerns (Cairney et al., Reference Cairney, Gandhi, Guttmann, Iron, Khan, Kurdyak, Lam and Julie Yang2015; Amartey et al., Reference Amartey, Chiu, Gatov, Guttmann, Lebenbaum, Kurdyak, Saunders, Vigod and Yang2017).

Sub-group definitions

We grouped the sample into child (age 4–11) v. youth (age 12–17) and male v. female based on standard Statistics Canada questions administered to the parent about child/youth age and sex. Calculated based on population size and density, urban–rural residency is a three-category variable of large urban (100 000+), small-medium urban (1000–99 999) and rural residency (<1000 or <400 residents per km²) (Statistics Canada, 2017b).

Analysis

To address question 1, we calculated the prevalence in survey and administrative data of individual and grouped mental disorders, and service contacts. One-sample z-tests for differences in the estimates were conducted. Simple ratios (not odds ratios) were calculated to quantify the size of the differences between estimates along with bootstrap standard errors and confidence intervals to measure precision. To address question 2, we calculated sensitivity, specificity and agreement between survey and administrative data classifications using Cohen's κ coefficient (Cohen, Reference Cohen1960). This is a chance-corrected agreement measure that indicates agreement is slight (κ = 0.01–0.20), fair (κ = 0.21–0.40), moderate (κ = 0.41–0.60) and substantial or excellent (κ = 0.61–0.80) (Shrout, Reference Shrout1998). Although sensitivity/specificity usually refer to accuracy in comparison to a gold standard, in our analysis there is none. Sensitivity is the likelihood of a child/youth being classified with disorder/service contact in both the survey and administrative data. Specificity is the likelihood of a child/youth not being classified in neither the survey nor administrative data. To address question 3, we evaluated whether group differences were consistent across data sources, even though overall prevalence might be different. To do this, we calculated prevalence estimates by sub-group and conducted z-tests for group differences by age and sex. For geography, joint χ ² tests were used to identify statistically significant group differences in two-way comparisons. We counted instances where group differences were consistent across data sources. Significant differences were flagged at the p < 0.05 level as the least conservative threshold by which group differences are identified.

Sample for analysis

Analysis was conducted in the representative sample of SCs who were assessed using the MINI-KID (n = 5563) linked to administrative data. A complete case analysis was conducted to avoid masking any impact of missing data – a practical consideration when evaluating data source comparability. Sample loss was less than 1% across the linked dataset. Using the sampling weights developed by Statistics Canada adjusted for the likelihood of being linked, population prevalence estimates were calculated. To account for the complex survey design, mean bootstrap weights were applied to produce accurate standard errors.

Results

Table 1 presents sample characteristics for the shared, linked sample of SCs. There were no differences in characteristics between the full OCHS sample, linked sample and the subsample of SCs (not shown).

Table 1. Sample characteristics of linked sample

s.e., standard error.

Table 2 shows estimated prevalence of emotional (mood and anxiety), attention-deficit/hyperactivity disorder and mental health-related service contacts based on survey and administrative data classifications, the results of one-sample z-tests and ratios representing how much larger the prevalence estimate is based on survey v. administrative data – with values closer to 1 representing prevalence estimates closer in size. Disorder prevalence and contacts are significantly higher in survey data, except for mood disorder, which is significantly but slightly numerically lower (2.87 v. 3.57). Ratios are highly variable; the prevalence of emotional disorders is 2.16 times larger based on survey data whereas the prevalence of attention-deficit/hyperactivity disorder is over 11 times larger. For mood disorder, prevalence was 20% lower in survey data, although the 95% confidence interval for the ratio contains 1 indicating this difference may be due to random chance.

Table 2. Six-month prevalence of DSM-IV-TR disorders and service contacts based on 2014 OCHS survey data (survey) and administrative health data (admin)

s.e., standard error.

Table 3 presents sensitivity (probability of being classified in both the survey and administrative data), specificity (probability of not being classified in neither survey data nor administrative data) and κ coefficients for agreement between data sources. Specificity is high, ranging from 0.98 to 1.00, and sensitivity is low, ranging from 0.07 to 0.41. According to our criteria, agreement ranges from slight (κ = 0.13 for attention-deficit/hyperactivity disorder) to moderate (κ = 0.46 for service contacts) and average agreement is fair (κ = 0.23).

Table 3. Concordance between individual disorder classifications and service contacts

s.e., standard error.

Table 4 presents prevalence estimates by sub-group in survey and administrative data. The larger estimate in statistically significant sub-group comparisons is bolded (age, sex) or flagged (geography). For half of the 18 comparisons, there were no significant group differences in either data source. In the remaining nine comparisons, there were consistent significant group differences in two comparisons – both sex-based (attention-deficit/hyperactivity disorder and service contacts). In these cases, prevalence estimates were double or ten times the size in survey data. There were no consistent age- or geography-based differences in prevalence. Age differences were found for the prevalence of mood and attention-deficit/hyperactivity disorder in survey data, and for anxiety disorder in administrative data. Sex differences were found for mood and emotional + attention-deficit/hyperactivity disorder in administrative data. Differences in prevalence between large urban and small-medium urban geography were identified in survey data for emotional and anxiety disorder.

Table 4. Six-month prevalence of DSM-IV-TR disorders by age group and sex based on 2014 OCHS survey data (survey) and administrative health data (admin)

LU, large urban centre; SMU, small-medium urban centre; R, rural.

Note: bolded estimates represent the larger estimate in z-tests for statistical differences at p < 0.05; i = SMU v. R χ ² test significant at p < 0.05.

Discussion

We found differences in disorder prevalence and prevalence patterns across sub-groups, along with slight to fair agreement between individual classifications by data source. Estimates are consistently higher in survey data, suggesting that administrative data underestimate characterisations of mental health need in the population. The magnitude of differences between estimates from the two data sources are inconsistent, making it difficult to draw conclusions about potential systematic bias in survey or administrative data or make recommendations about the possibility for administrative data to replace surveys for certain purposes. More evidence is needed as ours is the first study to date.

Except for mood disorder, overall prevalence estimates were higher in survey data, but the magnitude of the difference varies widely. The higher survey estimates are not surprising given that a selection mechanism for inclusion in administrative data is contact with physician-based services – an important limitation that should be clearly stated and reiterated in child/youth psychiatric epidemiological analysis using only administrative data. Although 70% of children/youth with mental health agency service contacts also report physician-based service contacts (Schraeder et al., Reference Schraeder, Barwick, Cairney, Carter, Kurdyak, Neufeld, Stewart, St. Pierre, Tobon, Vingilis, Zaric and Reid2021), some children/youth with disorders are only seen by community or school-based mental health services, and approximately half are not seen by any sector (Georgiades et al., Reference Georgiades, Duncan, Wang, Comeau and Boyle2019). These children/youth are not captured in administrative data which could explain the discordance. However, in a sensitivity analysis where survey data prevalence was calculated, conditional on reports of a physician-based service contact (see online Supplementary Table A2), estimates were slightly lower (emotional + attention-deficit/hyperactivity: by 3.0%, emotional: 2.1%, mood: 0.2%, anxiety: 2.1%, attention: 0.9%) but overall, results were consistent, showing significantly higher estimates in survey data, with the exception of mood disorder. This suggests that the exclusion of disorders counted among children/youth in contact with school or community-based mental health services only partially contributes to observed differences. Other reasons likely include the purpose, process and accuracy of disorder identification, discussed below.

Differences were smaller in disorder groupings and larger for specific disorders, even though common disorders were selected across datasets. This is likely due to different assessment approaches (standardised interview v. clinical diagnosis for administrative purposes) and because prevalence estimates will be more similar and likely to agree when disorders are grouped together – for example, mood and anxiety disorders grouped together under emotional disorders.

Prevalence for mood disorder was similar across survey and administrative data (2.9 v. 3.6%). However, agreement in individual mood disorder classifications was still low (κ = 0.16), which is similar to findings of discordance in adult mood/anxiety disorders between individuals classified by survey v. administrative data (Edwards et al., Reference Edwards, Thind, Stranges, Chiu and Anderson2020; O'Donnell et al., Reference O'Donnell, Vanderloo, McRae, Onysko, Patten and Pelletier2016). These studies did find differences in prevalence but did not report disorder-specific estimates, so it is unknown if our findings for mood disorder prevalence were consistent. We encourage researchers to report disorder-specific results where possible.

Agreement between disorder classifications was highly variable, with higher agreement observed for grouped disorders. Service contacts demonstrated the highest level of agreement suggesting they are better captured in both sources than mental disorder. The prevalence of disorder and service contacts (<17%) reduces the likelihood of individual agreement (Byrt et al., Reference Byrt, Bishop and Carlin1993), but agreement was still only fair when disorders were grouped, and selection was harmonised as far as possible. This raises questions about how well these classifications discriminate between disorder types. Findings of low agreement and lack of disorder specificity align with previous evidence examining adult mental disorders (Kisely et al., Reference Kisely, Lin, Gilbert, Smith, Campbell and Vasiliadis2009a; Edwards et al., Reference Edwards, Thind, Stranges, Chiu and Anderson2020) and chronic illness (Fortin et al., Reference Fortin, Haggerty, Sanche and Almirall2017) that point to issues of recording accuracy and completeness of health records in administrative data or that data sources are capturing people at different stages of illness and treatment. Survey data could capture children/youth suffering from clinical levels of symptom severity whose parents have not recognised their need or started seeking physician-based services. Similarly, administrative data could capture children/youth who are being treated and, as a result, may report symptoms at sub-clinical levels in survey data.

Even in the presence of prevalence differences and low agreement, we examined consistency in sub-group differences in prevalence based on child/youth age, sex and geography across data source. Only two sex-based group differences were consistent – both attention-deficit/hyperactivity disorder and service contacts were higher in males than females. There were no consistent age or geography-based differences even though there were sometimes differences in one data source. This suggests that conclusions drawn about key correlates of mental disorder and service contacts will be different and inconsistent within and across groups, depending on data source. This has implications for users of both survey and administrative data who have objectives related to examining between-group differences. Unlike previous studies comparing administrative data to survey self-report (Muggah et al., Reference Muggah, Graves, Bennett and Manuel2013; Payette et al., Reference Payette, de Moura, Boileau, Bernatsky and Noisel2020), survey-based disorder classifications are based on a validated, interviewer-administered, standardised diagnostic interview that has demonstrated validity and reliability in general population samples of children/youth. Prevalence estimates produced are consistent with estimates from studies elsewhere (Georgiades et al., Reference Georgiades, Duncan, Wang, Comeau and Boyle2019). Administrative data are subject to reporting bias and inaccuracy in psychiatric diagnoses (Davis et al., Reference Davis, Sudlow and Hotopf2016). Previous adult depression case definition validation efforts have produced suboptimal to moderate levels of sensitivity (Fiest et al., Reference Fiest, Jette, Quan, St Germaine-Smith, Metcalfe, Patten and Beck2014; Doktorchik et al., Reference Doktorchik, Patten, Eastwood, Peng, Chen, Beck, Jetté, Williamson and Quan2019). It is unknown whether case definition validation for child/youth mental disorders would produce more accurate estimates – further work is needed. In contrast, we relied on self-reported use of health services, which are notoriously difficult to capture via self-reported questionnaires (Reid et al., Reference Reid, Tobon and Shanley2008). Extensive survey-based questions were asked about mental health-related services but the extent to which these questions capture the same contacts as administrative data are likely limited (Rhodes et al., Reference Rhodes, Lin and Mustard2002). We made every effort to reconcile differences between data sources in disorder selection and focused service contacts on those that were physician-based. Nonetheless, differences remained and were greater for mental disorder than for service contacts.

In the absence of evidence about data source reliability, some researchers are calling for the development of approaches to synthesise disorder classifications across different data sources as a way of improving estimation (Edwards et al., Reference Edwards, Pananos, Thind, Stranges, Chiu and Anderson2021; Vigo et al., Reference Vigo, Jones, Dove, Maidana, Tallon, Small and Samji2021). This could be more appropriate for mental disorders than chronic conditions given unique challenges associated with case definitions. It will be important to determine the strengths of these combined approaches and whether they can be used to examine between-group differences in prevalence in addition to simple prevalence estimation. Linkage of Statistics Canada population surveys like the Canadian Community Health Survey and the 2014 OCHS to administrative data provides opportunities to improve our understanding of the reasons for differences between data sources for certain research objectives and make recommendations about the types of questions that can be answered with these data. Research funder-stipulated data linkage of publicly funded surveys would increase the availability of linked databases, and ongoing linkages are needed to investigate the impact of changes to health services (e.g. changes to virtual services) and diagnosis coding practices on data source difference. Based on our findings, we recommend: (1) focusing on high-level disorder groupings in prevalence estimation; (2) conducting data linkage where possible to access and compare alternate sources of information; and (3) clear reporting of the limited generalisability of administrative data due to disorder inclusion being conditional on contact with physician-based services.

This study is the first to compare prevalence estimates and agreement between child/youth mental disorder classifications and service contacts between general population survey data v. linked administrative health data. Study strengths include using a large, linked sample of children/youth within the same health jurisdiction and having access to all available health records. The study also has limitations. First, it is restricted to Ontario, and we know that there is provincial variability in the content and completeness of diagnostic coding (Doyle et al., Reference Doyle, Lix, Hemmelgarn, Paterson and Renoux2020). Replicate work is needed to determine generalisability of our findings. Second, the burden of mental disorder includes symptom severity and complexity, and their impact on child's functioning in daily life and with family/peers (Eaton et al., Reference Eaton, Martins, Nestadt, Bienvenu, Clarke and Alexandre2008). These factors could determine whether a family seeks health services and surveys may identify individuals whose symptoms are not severely affecting their daily lives. This issue is partially addressed in the current study as the diagnostic interview incorporates questions about the impact of mental health problems into the disorder classification. Nonetheless, more work is needed to determine the mechanisms by which children/youth are identified in each data source. Finally, evidence shows disorder prevalence differences depending on informant (Georgiades et al., Reference Georgiades, Duncan, Wang, Comeau and Boyle2019). This study relies on parent informants in survey data as this allowed for a comparison across an age span of 4–17 years. Agreement between youth-reported survey data and administrative data should also be examined.

Conclusion

This study compares survey and administrative data sources for estimating child/youth mental disorder prevalence and physician-based mental health-related service contacts – important estimates for policy formulation, research, advocacy and resource allocation. Except for mood disorder, estimates were consistently higher in survey data, and we found variable differences in the prevalence of child/youth mental disorder and service contacts and in age-, sex- and geography-based patterns of prevalence. Individual agreement was, on average, fair and also variable. Our findings highlight the need for users of both survey and administrative data to contextualise their understanding of prevalence of child/youth mental disorder and service contacts based on the data source being used. Further research is needed to understand who and what is being captured by each source. Researchers should conduct data linkage where possible to access and compare multiple sources of information.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S204579602200018X.

Data

Data access to the 2014 Ontario Child Health Study is available through Statistics Canada Research Data Centres. Data access to the linked dataset will not be shared due to data sharing agreements with Statistics Canada and the Ministry of Health in Ontario.

Acknowledgements

The authors would like to acknowledge Theodora Bogdan and Raha Mahmoudi for technical assistance with editing and formatting the manuscript.

Financial support

This study was supported by research operating grant 125941 from the Canadian Institutes of Health Research (CIHR); Health Services Research Grant 8-42298 from the Ontario Ministry of Health and Long-Term Care (MOHLTC); and funding from MOHLTC, the Ontario Ministry of Children and Youth Services, and the Ontario Ministry of Education. L.D. is supported by a Research Early Career Award from Hamilton Health Sciences Foundation, K.G. by the David R. (Dan) Offord Chair in Child Studies, and J. E. is supported by a CIHR Postdoctoral Fellowship.

Conflict of interest

None.

Ethical standards

The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.

References

Amartey, A, Chiu, M, Gatov, E, Guttmann, A, Lebenbaum, M, Kurdyak, P, Saunders, N, Vigod, S and Yang, J (2017) The Mental Health of Children and Youth in Ontario: 2017 Scorecard. Toronto, ON: ICES. Available at https://www.ices.on.ca/Publications/Atlases-and-Reports/2017/MHASEF (Accessed 21 April 2021).Google Scholar

American Psychiatric Association (2000) Diagnostic and Statistical Manual of Mental Disorders, 4th edn: DSM-IV. Washington, DC: American Psychiatric Association.Google Scholar

Birkhead, GS, Klompas, M and Shah, NR (2015) Uses of electronic health records for public health surveillance to advance public health. Annual Review of Public Health 36, 345–359.CrossRef Google Scholar PubMed

Boyle, MH, Georgiades, K, Duncan, L, Comeau, J, Wang, L and the 2014 Ontario Child Health Study Team (2019) The 2014 Ontario child health study – methodology. Canadian Journal of Psychiatry 64, 237–245.CrossRef Google Scholar PubMed

Byrt, T, Bishop, J and Carlin, JB (1993) Bias, prevalence and kappa. Journal of Clinical Epidemiology 46, 423–429.CrossRef Google Scholar PubMed

Cairney, J, Gandhi, S, Guttmann, A, Iron, K, Khan, S, Kurdyak, P, Lam, K and Julie Yang, J (2015) The Mental Health of Children and Youth in Ontario: A Baseline Scorecard. Toronto, ON: ICES. Available at https://www.ices.on.ca/Publications/Atlases-and-Reports/2015/Mental-Health-of-Children-and-Youth (Accessed 21 April 2021).Google Scholar

Cohen, J (1960) A coefficient of agreement for nominal scales. Educational and psychological measurement. Educational and Psychological Measurement 20, 37–46.CrossRef Google Scholar

Costello, EJ, He, J and Sampson, NA (2014) Services for adolescents with psychiatric disorders: 12-month data from the National Comorbidity Survey–Adolescent. Psychiatric Services 65, 359–366.CrossRef Google Scholar PubMed

Davis, KA, Sudlow, CL and Hotopf, M (2016) Can mental health diagnoses in administrative data be used for research? A systematic review of the accuracy of routinely collected diagnoses. BMC Psychiatry 16, 1–11.CrossRef Google Scholar PubMed

Doktorchik, C, Patten, S, Eastwood, C, Peng, M, Chen, G, Beck, CA, Jetté, N, Williamson, T and Quan, H (2019) Validation of a case definition for depression in administrative data against primary chart data as a reference standard. BMC Psychiatry 19, 9.CrossRef Google Scholar PubMed

Doyle, CM, Lix, LM, Hemmelgarn, BR, Paterson, JM and Renoux, C (2020) Data variability across Canadian administrative health databases: differences in content, coding, and completeness. Pharmacoepidemiology and Drug Safety 29, 68–77.CrossRef Google Scholar

Drapeau, A, Boyer, R and Diallo, FB (2011) Discrepancies between survey and administrative data on the use of mental health services in the general population: findings from a study conducted in Québec. BMC Public Health 11, 837.CrossRef Google Scholar PubMed

Duncan, L, Georgiades, K, Wang, L, Van Lieshout, RJ, MacMillan, HL, Ferro, MA, Lipman, EL, Szatmari, P, Bennett, K, Kata, A, Janus, M and Boyle, MH (2018) Psychometric evaluation of the Mini International Neuropsychiatric Interview for Children and Adolescents (MINI-KID). Psychological Assessment 30, 916.CrossRef Google Scholar

Eaton, WW, Martins, SS, Nestadt, G, Bienvenu, OJ, Clarke, D and Alexandre, P (2008) The burden of mental disorders. Epidemiologic Reviews 30, 1–14.CrossRef Google Scholar PubMed

Edwards, J, Thind, A, Stranges, S, Chiu, M and Anderson, KK (2020) Concordance between health administrative data and survey-derived diagnoses for mood and anxiety disorders. Acta Psychiatrica Scandinavica 141, 385–395.CrossRef Google Scholar PubMed

Edwards, J, Pananos, AD, Thind, A, Stranges, S, Chiu, M and Anderson, KK (2021) A Bayesian approach to estimating the population prevalence of mood and anxiety disorders using multiple measures. Epidemiology and Psychiatric Sciences 30, e4.CrossRef Google Scholar PubMed

Erskine, HE, Norman, RE, Ferrari, AJ, Chan, GCK, Copeland, WE, Whiteford, HA and Scott, JG (2016) Long-term outcomes of attention-deficit/hyperactivity disorder and conduct disorder: a systematic review and meta-analysis. American Academy of Child and Adolescent Psychiatry 55, 841–850.CrossRef Google Scholar PubMed

Fiest, KM, Jette, N, Quan, H, St Germaine-Smith, C, Metcalfe, A, Patten, SB and Beck, CA (2014) Systematic review and assessment of validated case definitions for depression in administrative data. BMC Psychiatry 14, 289.CrossRef Google Scholar PubMed

Fortin, M, Haggerty, J, Sanche, S and Almirall, J (2017) Self-reported versus health administrative data: implications for assessing chronic illness burden in populations. A cross-sectional study. CMAJ Open 5, E729.CrossRef Google Scholar PubMed

Gandhi, S, Chiu, M, Lam, K, Cairney, JC, Guttmann, A and Kurdyak, P (2016) Mental health service use among children and youth in Ontario: population-based trends over time. Canadian Journal of Psychiatry 61, 119–124.CrossRef Google Scholar PubMed

Georgiades, K, Duncan, L, Wang, L, Comeau, J, Boyle, MH and the 2014 Ontario Child Health Study Team (2019) Six-month prevalence of mental disorders and service contacts among children and youth in Ontario: evidence from the 2014 Ontario Child Health Study. Canadian Journal of Psychiatry 64, 246–255.CrossRef Google Scholar PubMed

Government of Ontario (2018) OHIP for ALL: Healthier Together. A Universal Health Care System. Toronto, ON: Queen's Printer for Ontario.Google Scholar

Hinds, A, Lix, LM, Smith, M, Quan, H and Sanmartin, C (2016) Quality of administrative health databases in Canada: a scoping review. Canadian Journal of Public Health 107, e56–e61.CrossRef Google Scholar PubMed

Kessler, RC, Avenevoli, S and Costello, EJ (2012) Prevalence, persistence, and sociodemographic correlates of DSM-IV disorders in the National Comorbidity Survey Replication–Adolescent supplement. Archives of General Psychiatry 69, 372–380.Google Scholar PubMed

Kisely, S, Lin, E, Gilbert, C, Smith, M, Campbell, L-A and Vasiliadis, H-M (2009 a) Use of administrative data for the surveillance of mood and anxiety disorders. The Australian and New Zealand Journal of Psychiatry 43, 1118–1125.CrossRef Google Scholar PubMed

Kisely, S, Lin, E, Lesage, A, Gilbert, C, Smith, M, Campbell, L-A and Vasiliadis, H-M (2009 b) Use of administrative data for the surveillance of mental disorders in 5 provinces. Canadian Journal of Psychiatry 54, 571–575.CrossRef Google Scholar PubMed

Luiten, A, Hox, J and de Leeuw, E (2020) Survey nonresponse trends and fieldwork effort in the 21st century: results of an international study across countries and surveys. Journal of Official Statistics 36, 469–487.CrossRef Google Scholar

Merikangas, KR, He, J and Burstein, M (2010) Lifetime prevalence of mental disorders in U.S. adolescents: results from the National Comorbidity Survey Replication–Adolescent supplement (NCS-A). Journal of the American Academy of Child and Adolescent Psychiatry 49, 980–989.CrossRef Google Scholar

Muggah, E, Graves, E, Bennett, C and Manuel, DG (2013) Ascertainment of chronic diseases using population health data: a comparison of health administrative data and patient self-report. BMC Public Health 13, 1–8.CrossRef Google Scholar PubMed

O'Donnell, S, Vanderloo, S, McRae, L, Onysko, J, Patten, SB and Pelletier, L (2016) Comparison of the estimated prevalence of mood and/or anxiety disorders in Canada between self-report and administrative data. Epidemiology and Psychiatric Sciences 25, 360–369.CrossRef Google Scholar PubMed

Palin, JL, Goldner, EM, Koehoorn, M and Hertzman, C (2011) Primary mental health care visits in self-reported data versus provincial administrative records. Health Reports 22, 41–47.Google Scholar PubMed

Payette, Y, de Moura, CS, Boileau, C, Bernatsky, S and Noisel, N (2020) Is there an agreement between self-reported medical diagnosis in the CARTaGENE cohort and the Québec administrative health databases? International Journal of Population Data Science Payette 5, 1155.Google Scholar PubMed

Pitchforth, J, Fahy, K, Ford, T, Wolpert, M, Viner, RM and Hargreaves, DS (2019) Mental health and well-being trends among children and young people in the UK, 1995–2014: analysis of repeated cross-sectional national health surveys. Psychological Medicine 49, 1275–1285.CrossRef Google Scholar

Ploubidis, GB, Batty, GD, Patalay, P, Bann, D and Goodman, A (2021) Association of early-life mental health with biomarkers in midlife and premature mortality: evidence from the 1958 British Birth Cohort. JAMA Psychiatry 78, 38–46.CrossRef Google Scholar PubMed

Polanczyk, GV, Salum, GA and Sugaya, LS (2015) Annual research review: a meta-analysis of the worldwide prevalence of mental disorders in children and adolescents. Journal of Child Psychology and Psychiatry, and Allied Disciplines 56, 345–365.CrossRef Google Scholar PubMed

Reid, GJ, Tobon, JI and Shanley, DC (2008) What is a mental health clinic? How to ask parents about help-seeking contacts within the mental health system. Administration and Policy in Mental Health and Mental Health Services Research 35, 241–249.CrossRef Google Scholar PubMed

Rhodes, AE, Lin, E and Mustard, CA (2002) Self-reported use of mental health services versus administrative records: should we care? International Journal of Methods in Psychiatric Research 11, 125–133.CrossRef Google Scholar PubMed

Schraeder, KE, Barwick, M, Cairney, J, Carter, J, Kurdyak, P, Neufeld, RW, Stewart, SL, St. Pierre, J, Tobon, J, Vingilis, E, Zaric, G and Reid, GJ (2021) Re-accessing mental health care after age 18: A longitudinal cohort study of youth involved with community-based Child and Youth Mental Health agencies in Ontario. Journal of the Canadian Academy of Child and Adolescent Psychiatry 30, 12–24.Google Scholar PubMed

Sheehan, DV, Sheehan, KH, Shytle, RD, Janavs, J, Bannon, Y, Rogers, JE, Milo, KM, Stock, SL and Wilkinson, B (2010) Reliability and validity of the mini international neuropsychiatric interview for children and adolescents (MINI-KID). The Journal of Clinical Psychiatry 71, 313–326.CrossRef Google Scholar

Shrout, PE (1998) Measurement reliability and agreement in psychiatry. Statistical Methods in Medical Research 7, 301–317.CrossRef Google Scholar

Statistics Canada (2017 a) Microdata User Guide 2014 Ontario Child Health Study (Confidential). Ottawa, ON: Special Surveys Division.Google Scholar

Statistics Canada (2017 b) Population Centre and Rural Area Classification 2016. Ottawa: Statistics Canada. Available at https://www.statcan.gc.ca/eng/subjects/standard/pcrac/2016/introduction (Accessed 21 April 2021).Google Scholar

Steele, LS, Glazier, RH, Lin, E and Evans, M (2004) Using administrative data to measure ambulatory mental health service provision in primary care. Medical Care 42, 960–965.CrossRef Google Scholar PubMed

Vigo, D, Jones, W, Dove, N, Maidana, DE, Tallon, C, Small, W and Samji, H (2021) Estimating the prevalence of mental and substance use disorders: a systematic approach to triangulating available data to inform health systems planning. Canadian Journal of Psychiatry 67, 107–116.CrossRef Google Scholar

Waddell, C, Schwartz, C, Andres, C, Barican, JL and Yung, D (2018) Fifty years of preventing and treating childhood behaviour disorders: a systematic review to inform policy and practice. Evidence-Based Mental Health 21, 45–52.CrossRef Google Scholar PubMed

World Health Organization (1978)International Classification of Diseases, 9th Revision, Basic Tabulation List with Alphabetic Index. Geneva: World Health Organization. Available at https://apps.who.int/iris/handle/10665/39473 (Accessed 21 April 2021).Google Scholar

World Health Organization (2004) ICD-10: International Statistical Classification of Diseases and Related Health Problems, 10th Revision, 2nd Edn. Geneva: World Health Organization. Available at https://apps.who.int/iris/handle/10665/42980 (Accessed 21 April 2021).Google Scholar

Fig. 1. Diagram of sample selection.

Table 1. Sample characteristics of linked sample

Table 2. Six-month prevalence of DSM-IV-TR disorders and service contacts based on 2014 OCHS survey data (survey) and administrative health data (admin)

Table 3. Concordance between individual disorder classifications and service contacts

Table 4. Six-month prevalence of DSM-IV-TR disorders by age group and sex based on 2014 OCHS survey data (survey) and administrative health data (admin)

Duncan et al. supplementary material

File 16.3 KB

Article contents

Estimating prevalence of child and youth mental disorder and mental health-related service contacts: a comparison of survey data and linked administrative health data

Abstract

Keywords

Introduction

Method

Data

Measures

Mental health disorder

Survey data

Administrative data

Mental health-related service contacts

Survey data

Administrative data

Sub-group definitions

Analysis

Sample for analysis

Results

Discussion

Conclusion

Supplementary material

Data

Acknowledgements

Financial support

Conflict of interest

Ethical standards

References

Duncan et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests