Quality of food-frequency questionnaire validation studies in the dietary assessment of children aged 12 to 36 months: a systematic literature review

Amy Lovell; Rhodi Bulloch; Clare R. Wall; Cameron C. Grant

doi:10.1017/jns.2017.12

Quality of food-frequency questionnaire validation studies in the dietary assessment of children aged 12 to 36 months: a systematic literature review

Published online by Cambridge University Press: 08 May 2017

Amy Lovell ,

Rhodi Bulloch ,

Clare R. Wall and

Cameron C. Grant

Show author details

Amy Lovell*: Affiliation:
Discipline of Nutrition, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
Rhodi Bulloch: Affiliation:
Discipline of Nutrition, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
Clare R. Wall: Affiliation:
Discipline of Nutrition, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
Cameron C. Grant: Affiliation:
Department of Paediatrics: Child and Youth Health, University of Auckland, Auckland, New Zealand Centre for Longitudinal Research He Ara ki Mua, University of Auckland, Auckland, New Zealand Starship Children's Hospital, Auckland District Health Board, Auckland, New Zealand
*: *Corresponding author: A. Lovell, email [email protected]

Article contents

Abstract
Methods
Results
Discussion
References

Abstract

A child's diet is an important determinant of growth and development. Because of this, the accurate assessment of dietary intake in young children remains a challenge. A systematic search of studies validating FFQ methodologies in children 12 to 36 months of age was completed. English-language articles published until March 2016 were searched using three electronic databases (MEDLINE, EMBASE and CINAHL). Quality assessment of the identified studies was carried out using The Reduced Summary Score and EURopean micronutrient RECommendations Aligned (EURRECA) scoring system. Seventeen studies were included and categorised according to whether they reflected long-term (≥7 d) or short-term (<7 d) intake, or used a biomarker. A total score for each micronutrient was calculated from the mean of the correlation coefficients weighted by the study quality score. At least three validation studies per micronutrient were required for inclusion. Fifteen studies (83 %) that considered validity of the FFQ in assessing nutrient intakes had quality scores from 2·5 to 6·0. Of those, ten (67 %) studies found FFQ to have good correlations in assessing dietary intake (>0·4). Of the nutrients with three or more studies available, FFQ validated using a reference method reflecting short-term intake had a good weighted correlation for Ca (0·51), and acceptable weighted correlations for vitamin C (0·31) and Fe (0·33). Semi-quantitative FFQ were shown to be valid and reproducible when estimating dietary intakes at a group level, and are an acceptable instruments for estimating intakes of Ca, vitamin C and Fe in children 12 to 36 months of age.

Keywords

Food-frequency questionnaires Infants Validity Dietary assessment methods

Type: Systematic Review
Information: Journal of Nutritional Science , Volume 6 , 2017 , e16

DOI: https://doi.org/10.1017/jns.2017.12 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: Copyright © The Author(s) 2017

The accurate description and measurement of dietary intake is a necessary step in determining the nutritional adequacy of diets in individuals or a population⁽ Reference Livingstone and Robson ¹ ⁾. Having valid and reliable assessment tools is essential to increase our understanding of the relationship between dietary intake and health outcomes, and our understanding of the dietary determinants of nutritional status⁽ Reference Kolodziejczyk, Merchant and Norman ² ⁾.

Food and nutrient intakes are estimated via dietary assessment methods that differ according to a study's aims and objectives, skills of the study population, accuracy of the required dietary data, study resources and study design⁽ Reference Willett ³ ⁾. Most epidemiological studies use variations of the FFQ, which can be validated using biomarkers or tools that measure daily dietary intake⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ⁾. The FFQ has an advantage of being an inexpensive method of obtaining data from a large number of participants, with a relatively low respondent burden and can be used to estimate an individual's average consumption over an extended period of time⁽ Reference Willett ³ ^, Reference Vereecken ⁶ ⁾.

There is no definitive ‘gold standard’ in dietary assessment, nor is there a ‘gold standard’ for assessing the validity of FFQ⁽ Reference Cade, Thompson and Burley ⁷ ⁾. Therefore estimation of a tool's relative validity relies upon a comparison with a superior and preferably independent technique, known as comparative validation⁽ Reference Willett ³ ⁾. Here, weighed food records (WFR) and 24-h recalls (24-HR) are commonly used due to their greater precision in the quantification of intake⁽ Reference Willett ³ ⁾. Factors that may affect the validity of a diet questionnaire have been reviewed⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ^, Reference Block and Hartman ⁸ ⁾.

Early childhood is a life phase where the assessment of dietary intake is particularly challenging. Measurement of energy and nutrient intakes in young children is affected by unique respondent and observer considerations, making the collection of accurate and reliable dietary intakes difficult⁽ Reference Livingstone and Robson ¹ ⁾. Young children aged 12 to 36 months, have highly variable diets that are characterised by rapidly changing food habits and transitions in dietary patterns, and often not all food served to an infant is consumed in its entirety⁽ Reference Coulston and Boushey ⁹ ^– Reference Ortiz-Andrellucchi, Henríquez-Sánchez and Sánchez-Villegas ¹² ⁾. The acquisition of dietary intake information for children less than 7 years of age is dependent upon surrogate reporters, e.g. parents, caregivers and external caretakers⁽ Reference Livingstone and Robson ¹ ^, Reference Livingstone, Robson and Wallace ¹³ ⁾. Therefore, the accuracy of dietary assessment in this age group depends on an adult's ability to reliably report on their intake, with previous evidence suggesting that parents can provide a more reliable report on foods consumed in the home setting, rather than away from home⁽ Reference Livingstone and Robson ¹ ^, Reference Livingstone, Robson and Wallace ¹³ ⁾.

As a consequence of these methodological challenges, the number and type of validated tools available to assess the dietary intake of young children, particularly children 12 to 36 months of age, are limited. The aim of this systematic literature review was to describe and assess the quality of studies reporting on the validity of FFQ as a method for assessing food and nutrient intakes or dietary patterns in 12- to 36-month-old children.

Methods

Protocol registration

The inclusion and exclusion criteria, and analysis methods were specified in advance in a documented protocol. This protocol was not registered with PROSPERO⁽ ¹⁴ ⁾ as it is an assessment of the quality of validation studies and does not report on a health-related outcome.

Eligibility criteria

Studies that evaluated the validity of FFQ in the assessment of dietary intake, food(s), and dietary patterns with a reference dietary assessment tool (e.g. 24-HR, diet records, diet histories, WFR and biomarkers) in healthy children aged 12 to 36 months and met all the inclusion criteria (Fig. 1.) were included in the review. Randomised controlled trials were not available; therefore analytical study designs were limited to prospective and retrospective cohort studies. Case series, case reports and case–control studies were excluded due to the high potential for bias.

Fig. 1. Inclusion and exclusion criteria used to select studies for inclusion in the systematic review.

Information sources

Studies were identified via searching online databases, hand-searching reference lists of original articles, and cited reference searches. The search focused on relevant studies published before March 2016 and was limited to those published in English, without limits on time frame or country. Grey literature was also considered.

Search strategy

A literature search was applied to MEDLINE (1946 to present), EMBASE (1980 to present) and CINAHL (1937 to present) electronic databases, and Google Scholar. Medical Subject Headings (MeSH), MeSH major topics, and free text terms were developed under four group headings in MEDLINE and EMBASE databases. The MeSH search terms used in the search were developed under four group headings: (1) infant (12–36 months), e.g. toddler, preschool*, child, infant, newborn*, pre-school*, babies, baby, kindergarten, children under 2, children under 3; (2) diet, e.g. nutrition, dietary pattern, food intake, diet quality, infant nutrition, child nutrition, nutritional assessment, eating pattern, nutritional status, feeding behaviour, food combination, childhood diet, infant food; (3) dietary assessment, e.g. diet surveys, questionnaires, instrument, dietary intake methods, assess*, evaluat*, dietary intake methods, nutrition surveys; (4) dietary assessment tool, e.g. food frequency questionnaire, FFQ; (5) instrument validation, e.g. validity, reproducibility, correlation coefficient, reliability, validation studies, replication stud*, correlation stud*, repeatability. Key words and combinations were identified in free text, article titles and abstracts, and were used to perform a comprehensive search of the databases. Search terms and strategies were adapted for use in other databases and were peer reviewed. All retrieved articles were sent to Refworks^® (version 4.4.1237; ProQuest LLC) where duplicates were removed.

Study selection

Two reviewers (A. L. and R. B.) determined a study's eligibility in an independent, unblinded and standardised manner. Systematic literature reviews were not included in the analysis. Titles and abstracts were reviewed to assess whether they met the inclusion criteria for full-text review (Fig. 1). Disagreements between reviewers were resolved by consensus, or if the decision on study inclusion or exclusion were unclear, the full text was obtained. In studies where the age range of participants was included, but was much wider than 12 to 36 months, e.g. 2 to 9 years, the reviewers attempted to obtain results from authors specific to the age range of interest. Full-text articles that fulfilled all criteria for inclusion were reviewed in a second screening process as the definitive step for inclusion.

Data collection process

A data extraction sheet based on examples found in the selected literature was developed. One review author (A. L.) extracted key data into a prepared table, which was checked by a co-author (R. B.). Any disagreements were resolved through discussion between the review authors (A. L. and R. B.), and if no agreement could be reached a prearranged third reviewer was asked to arbitrate (C. W.). Direct contact via email was made with four authors to obtain information in addition to that which could be abstracted from the published paper. In all four cases this request was for information within the age range of interest (12 to 36 months) from a study that reported data over a wider age range. One follow-up email was sent if no response was received. No authors responded with data from their studies specific to the age range of interest.

Data items

A concise overview of the seventeen included studies is shown in Table 1 ⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^– Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Watson, Heath and Taylor ²⁶ ^– Reference Mills, Skidmore and Watson ³¹ ⁾. The areas of interest included: population characteristics (size, age, location, ethnicity), FFQ characteristics (food groups, food items, consumption interval, administration method, portion estimation, number of FFQ administered, and FFQ re-test interval), reference method used, outcome measures (validity, reproducibility) and the statistics employed to assess validity between two methods or reproducibility of the FFQ.

Table 1. Characteristics of included studies evaluating long-term or short-term nutrient intake, or biomarker, food or food group

3D, three-dimensional; 24HR, 24 h recall; CC, correlation coefficient; DR, diet record; FD, food diary; FR, food record; HFFQ, Harvard Service Food Frequency Questionnaire; ICC, intra-class correlation; LOA, limits of agreement; NA, not applicable; NR, not reported; rec., record; WFR, weighed food record.

* Mean.

† Median.

Synthesis of results

Studies were classified into three categories based on the reference method applied to the validation study. This method has been previously reported and consisted of:

(1) Long-term intake – the reference method covered ≥7 d.
(2) Short-term intake – the reference method covered <7 d.
(3) Biomarker – the reference method was a biomarker.

Quality assessment

Following classification, the two reviewers (A. L. and R. B.) independently completed quality assessment of the included validation studies using the reduced summary score by Dennis et al.⁽ Reference Dennis, Snetselaar and Nothwehr ³² ⁾ which assessed the quality of the nutrition information from the FFQ, and an additional scoring system developed by the EURopean Micronutrient RECommendations Aligned (EURRECA) network used in studies assessing nutrient intakes with the aim of including, excluding and weighting studies⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ^, Reference Ortiz-Andrellucchi, Henríquez-Sánchez and Sánchez-Villegas ¹² ⁾. These scoring tools evaluated methodological quality of the identified studies and determined the extent to which a study addressed the possibility of bias in their design, conduct and analysis. This dual scoring system approach was used in a previous review of FFQ for assessing dietary intake in adolescents⁽ Reference Tabacchi, Amodio and Di Pasquale ³³ ⁾.

Because of the heterogeneity between the dietary assessment methods used as the reference, study designs, populations, and duration of the study, only a narrative review of the literature was performed. A meta-analysis could not be conducted due to a lack of randomised controlled trials.

The summary score by Dennis et al.⁽ Reference Dennis, Snetselaar and Nothwehr ³² ⁾ scores studies based on objective measures of quality dietary assessment. The reduced summary score with a maximum score of 8 was utilised for simplified quality assessment of the FFQ as seen in Tabacchi et al.⁽ Reference Tabacchi, Amodio and Di Pasquale ³³ ⁾ Validation studies that had a reduced summary score of ≥5 were classified as being ‘high quality’ and scores <5 as ‘low quality’. This scoring tool was used for all included studies. The EURRECA⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ⁾ scoring system was only applied to studies that assessed nutrient intakes. Summary scores range from 0 (poorest quality) to 7 (highest possible score) and are ranked as ‘very good/excellent’ score ≥5; ‘good’ score 3·5 ≤ and <5; ‘acceptable’ score 2·5 ≤ and <3·5; and ‘poor’ score <2·5⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ⁾. In order to estimate a mean correlation per micronutrient for the included studies, the correlation coefficient from each study was initially multiplied by its quality score. Next, the sum of the weighted correlations was divided by the sum of the quality scores to provide a correlation coefficient that was adjusted for the study's methodological quality. Mean weighted correlation coefficients were only calculated for micronutrients with correlations available from three or more studies⁽ Reference Roman-Viñas, Ortiz-Andrellucchi and Mendez ³⁴ ⁾. This allows for concurrent analysis of multiple validation studies and gives an estimate of a mean correlation coefficient per micronutrient for a given dietary assessment method⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ⁾. The intake method was rated as poor when the correlation was <0·30, acceptable between 0·30 and 0·50, good between 0·51 and 0·70, and correlations >0·70 were very good⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ⁾.

Results

Study selection

A total of 373 articles were identified (Fig. 2). Following removal of duplicates, 236 articles unique by title and abstract remained for review. Application of inclusion and exclusion criteria resulted in fifty-nine articles being selected for full-text review. Thirty-nine studies were included for quality appraisal. All studies were cross-sectional in their design, and thus classified as level IV evidence⁽ ³⁵ ⁾. Following quality appraisal twenty-two studies were excluded, leaving seventeen articles⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^– Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Sochacka-Tatara and Pac ²⁷ ^– Reference Klohe, Clarke and George ²⁹ ^, Reference Mills, Skidmore and Watson ³¹ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾ identified as assessing the validity of an FFQ against a dietary reference instrument in children 12 to 36 months of age.

Fig. 2. Selection process flow of articles identified that assess validity of FFQ methods in children aged 12–36 months.

Nine of the publications reported results from North American countries (USA and Canada)⁽ Reference Iannotti, Zuckerman and Blyer ¹⁶ ^– Reference Williams and Innis ²⁰ ^, Reference Rankin, Levy and Warren ²³ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ^, Reference Klohe, Clarke and George ²⁹ ⁾, five from the UK and Europe⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference Sochacka-Tatara and Pac ²⁷ ^, Reference Bel-Serrat, Fernandez Alvira and Pala ³⁰ ⁾, and three from New Zealand⁽ Reference Watson, Heath and Taylor ²⁶ ^, Reference Mills, Skidmore and Watson ³¹ ^, Reference Metcalf, Scragg and Sharpe ³⁸ ⁾. The number of participants ranged from seventeen⁽ Reference Iannotti, Zuckerman and Blyer ¹⁶ ⁾ to 240⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ⁾, with two studies presenting data from large cohorts: The Iowa Fluoride Study⁽ Reference Rankin, Levy and Warren ²³ ⁾ and The IDEFICS Study (Identification and prevention of Dietary- and lifestyle-induced health EFfects In Children and infantS)⁽ Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾.

Characteristics of included studies

Characteristics of each of the seventeen included validation studies are described in Table 1. Fourteen studies considered the validity of the FFQ to assess nutrient intakes⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^– Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Sochacka-Tatara and Pac ²⁷ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾, and three studies considered values on the validity of the FFQ to assess food or food group(s)⁽ Reference Klohe, Clarke and George ²⁹ ^, Reference Mills, Skidmore and Watson ³¹ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾. Two studies assessing nutrient intakes also used biomarkers as an additional reference method⁽ Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Williams and Innis ²⁰ ⁾. Eleven of the included FFQ were semi-quantitative⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^– Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference Rankin, Levy and Warren ²³ ^, Reference Sochacka-Tatara and Pac ²⁷ ^– Reference Klohe, Clarke and George ²⁹ ⁾, five were quantitative⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Williams and Innis ²⁰ ^, Reference Vereecken, Covents and Maes ²² ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Mills, Skidmore and Watson ³¹ ⁾, and one recorded frequency of consumption and not portion sizes⁽ Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾. The number of food items ranged from seventy-eight⁽ Reference Marriott, Inskip and Borland ²¹ ⁾ to 191⁽ Reference Williams and Innis ²⁰ ^, Reference Klohe, Clarke and George ²⁹ ⁾ with an average of 113 food items. Those studies that assessed food and/or food group intakes had between seven⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ⁾ and seventy-seven⁽ Reference Vereecken, Covents and Maes ²² ⁾ food groups. Food intake intervals ranged from intake over the previous 7 d⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Rankin, Levy and Warren ²³ ⁾ to over the last year⁽ Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾, with the majority describing intake over the last month⁽ Reference Blum, Wei and Rockett ¹⁷ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Sochacka-Tatara and Pac ²⁷ ^, Reference Klohe, Clarke and George ²⁹ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾.

Two studies were grouped according to a reference method that reflected long-term intake (7-d WFR)⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ⁾. Ten studies were grouped according to a reference method that reflected short-term intake where four applied 24-HR⁽ Reference Blum, Wei and Rockett ¹⁷ ^, Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Sochacka-Tatara and Pac ²⁷ ⁾ and five applied WFR⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ^– Reference Rankin, Levy and Warren ²³ ⁾, one of these being online⁽ Reference Vereecken, Covents and Maes ²² ⁾. One study utilised biomarkers as a reference method⁽ Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾. Among the seven studies that used WFR, the number of recorded days varied from 3 to 7 d⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Williams and Innis ²⁰ ^– Reference Vereecken, Covents and Maes ²² ^, Reference Klohe, Clarke and George ²⁹ ⁾. The number of repeated 24-HR ranged from 2⁽ Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾ or 3⁽ Reference Blum, Wei and Rockett ¹⁷ ^, Reference Williams and Innis ²⁰ ^, Reference Sochacka-Tatara and Pac ²⁷ ⁾ days of non-consecutive administration. Eleven studies were self-administered⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Blum, Wei and Rockett ¹⁷ ^– Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Vereecken, Covents and Maes ²² ^, Reference Rankin, Levy and Warren ²³ ^, Reference Sochacka-Tatara and Pac ²⁷ ^– Reference Klohe, Clarke and George ²⁹ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾, by a parent or equivalent proxy reporter, and six studies were interviewer administered⁽ Reference Iannotti, Zuckerman and Blyer ¹⁶ ^, Reference Williams and Innis ²⁰ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Mills, Skidmore and Watson ³¹ ⁾. Methods of portion size estimation ranged from household measures/standard portion sizes⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Sochacka-Tatara and Pac ²⁷ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾ to portion sizes derived from national nutrition survey data⁽ Reference Blum, Wei and Rockett ¹⁷ ^, Reference Vereecken, Covents and Maes ²² ^, Reference Klohe, Clarke and George ²⁹ ⁾. Three studies did not describe portion estimation⁽ Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Williams and Innis ²⁰ ^, Reference Rankin, Levy and Warren ²³ ⁾, and two studies used a unique ‘palm’ measurement⁽ Reference Watson, Heath and Taylor ²⁶ ^, Reference Mills, Skidmore and Watson ³¹ ⁾. Of the thirteen studies that calculated food intakes into nutrient intakes, six reported using national food composition databases (e.g. United States Department of Agriculture)⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Vereecken, Covents and Maes ²² ^– Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾, and two used other food composition databases (e.g. Harvard Nutrient Database)⁽ Reference Blum, Wei and Rockett ¹⁷ ^, Reference Parrish, Marshall and Krebs ¹⁸ ⁾. Although not the primary aim of the validation study, two studies⁽ Reference Iannotti, Zuckerman and Blyer ¹⁶ ^, Reference Parrish, Marshall and Krebs ¹⁸ ⁾ examined whether there were any differences between sex and care status (i.e. in child care or at home) when comparing mean nutrient intake values.

Statistical analysis

Statistical analyses used in the assessment of FFQ validity, and in some cases reproducibility, are described in Table 1. All included studies calculated differences in means and/or mean comparisons. Pearson or Spearman's correlation coefficients were calculated in all studies. Paired Student's t tests were used evaluate whether there was any difference between the mean nutrient and food intakes determined by the two assessment methods⁽ Reference Parrish, Marshall and Krebs ¹⁸ ⁾. Factors that affect the validity of a dietary assessment instrument included: population characteristics, acceptability of the reference method data, FFQ design/quantification, quality control and data management⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ^, Reference Tabacchi, Amodio and Di Pasquale ³³ ⁾.

The calculation of weighted correlation coefficients allowed comparison with the other included studies. Here, correlation coefficients between 0·51 and 0·7 are considered good⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ^, Reference Cade, Thompson and Burley ⁷ ⁾. Four studies considered crude correlation coefficients⁽ Reference Iannotti, Zuckerman and Blyer ¹⁶ ^, Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Vereecken, Covents and Maes ²² ^, Reference Rankin, Levy and Warren ²³ ⁾, whilst seven studies adjusted nutrients using energy-adjusted values⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Blum, Wei and Rockett ¹⁷ ^, Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Williams and Innis ²⁰ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾, and three studies calculated de-attenuated values to account for measurement error⁽ Reference Watson, Heath and Taylor ²⁶ ^, Reference Sochacka-Tatara and Pac ²⁷ ^, Reference Mills, Skidmore and Watson ³¹ ⁾ or intra-class correlations⁽ Reference D'Ambrosio, Tiessen and Simpson ²⁴ ⁾. All six studies that performed cross-classification analysis ranked participants by using the same or adjacent quartile. Three of these studies⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Watson, Heath and Taylor ²⁶ ⁾ assessed the classification of participants according to their nutrient intakes and three studies⁽ Reference Klohe, Clarke and George ²⁹ ^, Reference Mills, Skidmore and Watson ³¹ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾ assessed the classification of participants according to their food or food group intakes. Weighted κ was calculated in two studies that considered food intakes⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Rankin, Levy and Warren ²³ ⁾. Here, four categories were used to calculate κ statistics and classify food intake data.

Two studies⁽ Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Watson, Heath and Taylor ²⁶ ⁾ assessed the reproducibility of the FFQ for estimating dietary intake patterns and estimation of reproducibility of nutrient intakes was achieved by calculating correlation coefficients and intra-class correlations. Acceptable intra-class correlations ranged from >0·4⁽ Reference Cade, Thompson and Burley ⁷ ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Gibson ³⁹ ⁾ to 0·7⁽ Reference D'Ambrosio, Tiessen and Simpson ²⁴ ⁾ when establishing test–retest reliability of the FFQ. In order to test reproducibility, five⁽ Reference Blum, Wei and Rockett ¹⁷ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Klohe, Clarke and George ²⁹ ^, Reference Mills, Skidmore and Watson ³¹ ⁾ studies administered the FFQ on two occasions. Intervals between test and retest ranged from 2 weeks⁽ Reference Klohe, Clarke and George ²⁹ ⁾ to 1 month⁽ Reference Blum, Wei and Rockett ¹⁷ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Mills, Skidmore and Watson ³¹ ⁾. One study⁽ Reference Blum, Wei and Rockett ¹⁷ ⁾ administered the FFQ on two occasions, 1 month apart but did not report on the statistical analysis used for reproducibility.

Results of individual studies by validation method used

Included reviews were analysed according to the reference method used (i.e. WFR, 24-HR or biomarker) and whether the tool reflected long-term or short-term intake.

FFQ v. 24-h recalls

Five studies⁽ Reference Blum, Wei and Rockett ¹⁷ ^, Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Sochacka-Tatara and Pac ²⁷ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾ used 24-HR as their reference method to validate an FFQ. In all studies the FFQ overestimated median/mean nutrient intake estimates but could provide reliable estimates of nutrient intakes in young children with good agreement when compared with the 24-HR (Table 1). Nutrient correlations that were energy-adjusted or de-attenuated (to reduce dependency on between-person variation) were found to have higher correlation coefficients compared with crude values. Cross-classification into low, medium and high consumers was moderate (>30 % classification into the same quartile). One study⁽ Reference D'Ambrosio, Tiessen and Simpson ²⁴ ⁾ assessed repeatability/reproducibility using a 24-HR as a reference tool. Correlations for most nutrients were >0·70, indicating low within-person variation.

FFQ v. food record (±weighing)

Eleven studies used WFR as their reference method to validate an FFQ⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Iannotti, Zuckerman and Blyer ¹⁶ ^, Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Williams and Innis ²⁰ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference Vereecken, Covents and Maes ²² ^, Reference Rankin, Levy and Warren ²³ ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Klohe, Clarke and George ²⁹ ^, Reference Mills, Skidmore and Watson ³¹ ⁾. Ten studies that estimated nutrient intakes found that the FFQ tended to overestimate intakes (Table 1) but found good correlations (>0·4)⁽ Reference Cade, Thompson and Burley ⁷ ⁾ between the FFQ and WFR for most nutrients, energy intakes and food intakes. The included FFQ mostly indicated a moderate ability to rank infants according to their nutrient intakes, with two studies by Andersen et al.⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ⁾ showing that the ability of the questionnaire to rank infants according to their intakes increased when using nutrient density values over absolute values.

FFQ v. biomarker

Using biomarkers as the reference method was less frequent. Three studies used biomarkers⁽ Reference Andersen, Lande and Trygg ¹⁵ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, ³⁵ ⁾. Two articles⁽ Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Williams and Innis ²⁰ ⁾ presented validation of an FFQ using biomarkers and a second dietary assessment instrument (24-HR or WFR) as reference methods. The biomarkers analysed included: total lipids, plasma levels of vitamins C, D and E, retinol and β-carotene⁽ Reference Parrish, Marshall and Krebs ¹⁸ ⁾, serum markers of Fe⁽ Reference Williams and Innis ²⁰ ⁾ and fatty acid composition measured in erythrocytes⁽ Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾.

Evaluation of food or food groups

Using a semi-quantitative FFQ excellent reliability and adequate validity were seen in assessing food choices of low-income children⁽ Reference Klohe, Clarke and George ²⁹ ⁾, with low levels of agreement and limited ability to rank children according to intakes of food groups⁽ Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾. More recently, in Otago, New Zealand, a semi-quantitative FFQ displayed good validity (r 0·52) and high reproducibility in the identification of dietary patterns, and in ranking the diets of toddlers when compared with a 5-d WFR. The FFQ overestimated energy and nutrient intakes and cannot measure absolute intakes, but could be used to identify toddlers at extreme ends of intake distribution⁽ Reference Watson, Heath and Taylor ²⁶ ^, Reference Mills, Skidmore and Watson ³¹ ⁾.

Additional analysis: quality assessment

A summary of the quality assessment of the seventeen included studies are shown in Table 2. Using the reduced summary score⁽ Reference Dennis, Snetselaar and Nothwehr ³² ⁾, one validation study that assessed nutrient intakes received a low quality ranking⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ⁾ and one study that assessed food intake received a low quality ranking⁽ Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾. The remaining fifteen studies received high quality rankings. Criteria that reduced the quality of the study included the number of food items in the FFQ (<70 food items is likely to reduce the quality of the nutrition information), and if the FFQ was self-administered.

Table 2. Quality scores using methods described by Dennis et al.⁽ Reference Dennis, Snetselaar and Nothwehr ³² ⁾ and the EURopean Micronutrient RECommendations Aligned (EURRECA) scoring tool⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ⁾

NA, not available, fewer than three studies found.

* Dennis et al. ⁽ Reference Dennis, Snetselaar and Nothwehr ³² ⁾ quality level: high (≥5); low (<5).

† EURRECA quality score: very good/excellent (≥5); good (3·5≥ to <5); acceptable/reasonable (2·5≥ to <3·5); poor (<2·5).

Using the EURRECA scoring system⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ⁾, fourteen studies assessed nutrient intakes, with quality scores ranging from 2·5 to 6·0 (maximum 7·0). The average quality score was 3·8, with a median of 3·5. Table 2 illustrates the classification of the included studies according to their reference method and methodological quality, with three studies⁽ Reference Williams and Innis ²⁰ ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Sochacka-Tatara and Pac ²⁷ ⁾ (21 %) rating as very good/excellent, five studies⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ⁾ (36 %) as good quality, five studies⁽ Reference Iannotti, Zuckerman and Blyer ¹⁶ ^, Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Vereecken, Covents and Maes ²² ^, Reference Rankin, Levy and Warren ²³ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾ (36 %) having an acceptable quality, and one study⁽ Reference Rankin, Levy and Warren ²³ ⁾ (7 %) having a poor quality rating. ‘Good’ quality scores were seen in the validation studies where FFQ were compared with a reference method that was reflective of long-term intakes, and a majority (58 %) of validation studies where the FFQ was compared with a reference method that was reflective of short-term intakes were either ‘good’ or ‘very good’. Factors affecting the EURRECA quality assessment score⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ⁾ were the statistical analyses used and data collection via interviewer-administration. Calculation of energy-adjusted⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Blum, Wei and Rockett ¹⁷ ^, Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾, de-attentuated (to reduce the dependency on between-person variation)⁽ Reference Watson, Heath and Taylor ²⁶ ^, Reference Sochacka-Tatara and Pac ²⁷ ⁾, or intra-class correlation coefficients increased quality scores⁽ Reference D'Ambrosio, Tiessen and Simpson ²⁴ ⁾.

Concurrent validation analysis

Table 3. Classification of dietary assessment methods for infants aged 12–36 months according to the weighted mean of the correlations of micronutrients with three or more studies available (separate comparisons of those studies reflecting long-term and short-term intakes or comparison of FFQ with a reference method)

WFR, weighed food record; 24HR, 24 h recall; BM, biomarker; NA, not available, fewer than three studies found.

* Correlation: G, good (0·51–0·70); A, acceptable (0·30–0·50); P, poor (<0·30).

Discussion

In this review, using standardised quality assessment methods, we evaluated seventeen studies reporting on the validity of FFQ as a method for assessing food and nutrient intakes or dietary patterns in 12- to 36-month-old children. From the identified studies⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^– Reference Williams and Innis ²⁰ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference Rankin, Levy and Warren ²³ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ^, Reference Watson, Heath and Taylor ²⁶ ^, Reference Sochacka-Tatara and Pac ²⁷ ^– Reference Mills, Skidmore and Watson ³¹ ^, Reference Metcalf, Scragg and Sharpe ³⁸ ⁾, semi-quantitative FFQ were shown to be valid and reproducible instruments in children as young as 1 year of age, generating adequate estimates specifically for Ca, vitamin C and Fe, with results similar to those seen in older children and adolescents⁽ Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Vereecken, Covents and Maes ²² ⁾.

FFQ are used to assess dietary intake due to their practicality, relative ease of administration, low participant burden, ability to assess intake over a prolonged period of time, and lower associated costs⁽ Reference Subar ⁴¹ ^, Reference Schatzkin, Kipnis and Carroll ⁴² ⁾. However, there are limited FFQ that have been specifically validated in 12- to 36-month-old children. In the present review, the methodological qualities of FFQ were considered in conjunction with analysis of weighted correlation coefficients where higher weights were given to studies that employed higher quality methodologies⁽ Reference Serra-Majem, Frost Andersen and Henríque-Sánchez ⁵ ^, Reference Roman-Viñas, Ortiz-Andrellucchi and Mendez ³⁴ ⁾. Qualities included data collection methods, administration, seasonality, sample size, supplement use and statistics.

It is estimated that at approximately 7 to 8 years of age children become aware of their own food intake. Prior to this age the cognition and attention span required to perceive time frames, have knowledge of foods, recall food intake, and self-report are not sufficiently developed⁽ Reference Livingstone and Robson ¹ ⁾. Other explicit issues that arise in this age group of interest relate to the change in dietary practices seen across the age range and the variability in information provided by parent or proxy reporter, on foods that are eaten outside of their supervision, especially when the child is in day care.

The ability of FFQ to rank nutrient and energy intakes is improved through providing detailed quality information which can be achieved through interviewer administration⁽ Reference Marriott, Inskip and Borland ²¹ ⁾.The majority (71 %) of the included FFQ were self-administered by a parent or proxy reporter, similar to that seen in reviews conducted in wider age groups⁽ Reference Roman-Viñas, Ortiz-Andrellucchi and Mendez ³⁴ ^, Reference Henríquez-Sánchez, Sánchez-Villegas and Doreste-Alonso ⁴³ ⁾. Cade et al.⁽ Reference Cade, Thompson and Burley ⁷ ⁾ reported an increase in correlation coefficients when the FFQ was interviewer-administered, with the exception of vitamin C, in comparison with those that were self-administered. This is especially relevant in the age group in question, where all information is obtained from a parent or proxy-reporter. There is a need for further studies designed to evaluate the accuracy of parental-reported intakes in larger, ethnically diverse populations, using different dietary assessment methods⁽ Reference Collins, Burrows and Truby ⁴⁴ ⁾.

Estimation of portion size appears to have some advantage over using average or specified portion sizes, with higher measures of agreement between FFQ and reference method (r 0·5–0·6) and higher correlation coefficients when assessing repeatability⁽ Reference Tabacchi, Amodio and Di Pasquale ³³ ⁾. FFQ are seen to commonly overestimate energy intake, which is especially apparent in this population of interest⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Andersen, Lande and Trygg ¹⁵ ^, Reference Blum, Wei and Rockett ¹⁷ ^, Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference D'Ambrosio, Tiessen and Simpson ²⁴ ⁾. This could be attributed to the fact that parents/caregivers may not adequately take into account the small portion sizes consumed by their children and that young children often ‘taste’ many foods without consuming full portions, leading to the inclusion of too large a portion size for some foods⁽ Reference Andersen, Lande and Arsky ¹¹ ^, Reference Parrish, Marshall and Krebs ¹⁸ ⁾. Many of the included studies assessed wider age ranges, i.e. beyond 12 to 36 months, which, as identified in a recent validation study performed in New Zealand, may act to improve validity of the FFQ as older children are more likely to eat meals that are similar to that of the family member or adult completing the FFQ⁽ Reference Watson, Heath and Taylor ²⁶ ⁾. Improvements in validity and bias could be seen through reducing the number of food items in the FFQ, shortening the reporting period, or adjusting portion sizes to more closely reflect those consumed by a young child⁽ Reference Collins, Burrows and Truby ⁴⁴ ⁾. This unique method has been explored in a study performed in 12- to 24-month-old New Zealand children where the amount of food offered and the amount eaten were recorded separately to encourage parents to differentiate between the two, and portion sizes were described according to the child's ‘palm volume’. This FFQ showed acceptable to good validity and high reproducibility in the assessment of dietary patterns and ranking nutrient intakes⁽ Reference Watson, Heath and Taylor ²⁶ ^, Reference Mills, Skidmore and Watson ³¹ ⁾.

In a systematic review by Henríquez-Sánchez et al.⁽ Reference Henríquez-Sánchez, Sánchez-Villegas and Doreste-Alonso ⁴³ ⁾, an improvement in correlation coefficients (r 0·52) was seen when the number of food items included in the FFQ was greater than 100 (r 0·47). The average number of food items used in the present review was 113. Estimation of supplement use should be considered when evaluating nutrient intake. Information on supplements should be included in dietary assessment with emphasis on the type and dose used. Data from FFQ and reference methods correlated better when supplement intake was captured⁽ Reference Henríquez-Sánchez, Sánchez-Villegas and Doreste-Alonso ⁴³ ⁾. Supplement use was acknowledged in one study⁽ Reference Williams and Innis ²⁰ ⁾ and seasonality in another⁽ Reference D'Ambrosio, Tiessen and Simpson ²⁴ ⁾, but were not considered in the statistical analysis.

All studies calculated Pearson or Spearman's correlation coefficients (Table 1). Calculation of correlation coefficients does not measure agreement between the two methods of dietary assessment, only the degree in which the two methods are related⁽ Reference Bland and Altman ⁴⁵ ⁾. Their usefulness increases if used in conjunction with an alternative method such as Bland–Altman which provides an analysis of how well the FFQ and reference method agree on average⁽ Reference Bland and Altman ⁴⁵ ⁾. Other methods such as limits of agreement can be used to provide information on reliability and the direction and consistency of bias and the magnitude of errors between the two assessment methods⁽ Reference Cade, Thompson and Burley ⁷ ^, Reference Tabacchi, Amodio and Di Pasquale ³³ ⁾. It is difficult to summarise the correlation coefficients, agreement of validity and reproducibility of the included FFQ; therefore the present review should be used as a description of included FFQ, with potential for further meta-analyses.

Using 24-HR as the dietary reference method, FFQ were found to be a suitable tool for ranking children according to nutrient intakes (r 0·46), with stronger correlations in foods consumed more frequently⁽ Reference Sochacka-Tatara and Pac ²⁷ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾. This highlights the difficulties with episodically consumed food items, as seen in the high day-to-day variability of a young child's diet⁽ Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Bel-Serrat, Mouratidou and Pala ³⁷ ⁾. Unadjusted FFQ nutrient estimates were larger than unadjusted nutrient estimates from multiple 24-HR and additional analysis of children that regularly received meals and snacks from other caregivers alongside parents revealed no apparent compromise or differences in correlations⁽ Reference Parrish, Marshall and Krebs ¹⁸ ⁾.

Using WFR as the reference method to assess long-term intakes, correlations were found to increase using nutrient density values over absolute intakes, but the FFQ had a low to moderate ability to rank children according to intakes of nutrients and foods⁽ Reference Andersen, Lande and Arsky ¹¹ ⁾. WFR are not affected by the same errors, such as portion size estimation, and memory lapses, as the FFQ⁽ Reference Gibson ³⁹ ⁾. The FFQ was found to be a useful tool for estimating short-term energy and nutrient intakes in healthy infants (at a group level)⁽ Reference Marriott, Inskip and Borland ²¹ ^, Reference Vereecken, Covents and Maes ²² ⁾. Marriott et al. ⁽ Reference Marriott, Inskip and Borland ²¹ ⁾ found that differences in micronutrient intakes were partly explained by changes in the consumption of milk between the two dietary assessments and by the different nutrient compositions of cows’ milk and formula⁽ Reference Marriott, Inskip and Borland ²¹ ⁾. This underestimation of Ca intake by the FFQ has been reported in three studies within this age group⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Marriott, Inskip and Borland ²¹ ^, Reference Huybrechts, De Bacquer and Matthys ⁴⁶ ⁾.

The use of FFQ to provide estimates of beverage intake has not been widely investigated. Marshall and Rankin concluded that a quantitative FFQ could be used to provide relative estimates of beverage, Ca, vitamin D and fluoride intakes in this age group⁽ Reference Marshall, Gilmore and Broffitt ¹⁹ ^, Reference Rankin, Levy and Warren ²³ ⁾ and higher correlations were seen at younger ages when the diet was more limited (r 0·85 at 6 months v. r 0·65 at 60 months)⁽ Reference Rankin, Levy and Warren ²³ ⁾.

The present review included correlations from three studies using a biomarker for validation⁽ Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Williams and Innis ²⁰ ^, Reference Orton, Szabo and Clare-Salzler ²⁸ ⁾. In the assessment of specific nutritional status, Williams & Innis⁽ Reference Williams and Innis ²⁰ ⁾ showed that a semi-quantitative FFQ could be a useful tool in assessing Fe status in infants at a group level (energy adjusted r 0·71), but could result in underestimation of infants deemed to be at high risk of poor Fe status⁽ Reference Parrish, Marshall and Krebs ¹⁸ ^, Reference Williams and Innis ²⁰ ⁾.

Evaluating quality assessment

Where correlations for a given nutrient were available from three or more studies, quality-adjusted correlations were calculated. Higher weighted mean correlations were seen in studies that used WFR as the reference method for Ca, Fe and fibre when compared with other methods. This may be a reflection of the fact that a greater number of studies (60 %) used WFR as a reference method. The highest correlation coefficient weighted by quality was 0·51. There were not sufficient data to conduct the analysis for the remaining micronutrients, and only six out of the ten EURRECA priority nutrients could be assessed. This continues to remain a concern in this age group, where valid nutrient intake estimates could not be calculated. FFQ validation studies that assessed long-term intakes or used biomarkers as the reference tool were based on one or two studies, making them insufficient to reach any conclusion (Table 3).

Limitations

There was a lack of data available to assess the ability of the FFQ in providing adequate estimates for several of the micronutrients highlighted in the present review (Table 3). The heterogeneity in the study designs, methods, outcomes and assessment tools made comparisons difficult, therefore the data were narratively synthesised and described. Due to natural variation, biomarkers may not always be a suitable option for comparison⁽ Reference Parrish, Marshall and Krebs ¹⁸ ⁾ and few studies validating FFQ using biomarkers were available for inclusion in the present review which would act to reduce correlated errors associated when the reference method is based on self-reporting⁽ Reference Day, Wong and Bingham ⁴⁷ ⁾. Studies that assessed the validity of energy intake measurements using doubly labelled water did not meet our inclusion criteria. Due to the specific range of interest, several studies that reported over a wider age range were excluded as reviewers were unable to extract these data. Correlation coefficients of the included studies were used for analysis and quality assessment in the present review; this limits the interpretation of the review as correlation coefficients only measure the degree to which the two assessment methods are related in a validation study, and not the agreement between the methods⁽ Reference Altman ⁴⁸ ⁾. De-attenuation and energy adjustment have strong implications for correlation coefficients and make it difficult to compare and draw conclusions. Only validation studies written in English were included for analysis. This may have led to the exclusion of reliable validation studies from other countries.

Conclusion

This systematic literature review presents a summary of the quality of FFQ validation studies in children aged 12 to 36 months. The included studies and quality assessment have provided information on aspects of FFQ design that increase validity, such as the number of items included, portion size estimations, appropriate food choices, administration method, validation and reproducibility methods, pre-testing, supplement use, seasonality and the statistical analyses. Semi-quantitative FFQ were shown to be valid and reproducible when estimating dietary intakes at a group level, and are an acceptable instrument for estimating intakes of Ca, vitamin C and Fe in children 12 to 36 months of age. There is insufficient evidence for the evaluation of the validity of micronutrients such as folate, vitamin D, Zn and Cu in this population. Using the results of the included studies; meticulously designed and validated FFQ may be acceptable in estimating intakes of a number of important micronutrients in this age group.

Children aged 12 to 36 months would benefit from further validation studies using appropriate population-specific tools addressing areas highlighted in this review that are unique to dietary assessment in young children. Such areas include further development on portion size estimation, capturing irregular eating patterns, overcoming administration errors with the implementation of computer-assisted methods or the development of novel tools to provide evidence for further validation studies of appropriate population-specific tools, alongside the identification, management and primary prevention of diet-related disease processes.

Acknowledgements

There was no financial support.

A. L. and R. B. completed the literature search, screening process and quality assessment. R. B. was the second independent reviewer. A. L. extracted all data, completed the critical appraisal and completed the first draft of the manuscript and contributed to manuscript revision. C. R. W. and C. C. G. helped develop the review protocol and edited the manuscript. All authors approved the submitted version.

The authors would also like to acknowledge Frances Clements, University of Auckland Faculty of Medical and Health Sciences Subject Librarian for her expert assistance in developing the search strategy for this review.

There were no conflicts of interest.

References

1. Livingstone, M & Robson, P (2000) Measurement of dietary intake in children. Proc Nutr Soc 59, 279–293.Google Scholar

2. Kolodziejczyk, JK, Merchant, G & Norman, GJ (2012) Reliability and validity of child/adolescent food frequency questionnaires that assess foods and/or food groups. J Pediatr Gastroenterol Nutr 55, 4–13.CrossRef Google Scholar PubMed

3. Willett, W (2013) Nutritional Epidemiology: Monographs in Epidemiology and Biostatistics, 3rd ed. New York: Oxford University Press.Google Scholar

4.Number not used.Google Scholar

5. Serra-Majem, L, Frost Andersen, L, Henríque-Sánchez, P, et al. (2009) Evaluating the quality of dietary intake validation studies. Br J Nutr 102, S3–S9.Google Scholar

6. Vereecken, CA (2010) A longitudinal study on dietary habits and the primary socialization of these habits in young children. Verh K Acad Geneeskd Belg 72, 295–308.Google Scholar PubMed

7. Cade, J, Thompson, R, Burley, V, et al. (2002) Development, validation and utilisation of food-frequency questionnaires – a review. Public Health Nutr 5, 567–587.Google Scholar

8. Block, G & Hartman, AM (1989) Issues in reproducibility and validity of dietary studies. Am J Clin Nutr 50, 1133–1138; discussion 1231–1235.CrossRef Google Scholar PubMed

9. Coulston, AM & Boushey, C (2008) Nutrition in the Prevention and Treatment of Disease. Amsterdam: Academic Press.Google Scholar

10. Serdula, MK, Alexander, MP, Scanlon, KS, et al. (2001) What are preschool children eating? A review of dietary assessment 1. Annu Rev Nutr 21, 475–498.Google Scholar

11. Andersen, LF, Lande, B, Arsky, GH, et al. (2003) Validation of a semi-quantitative food-frequency questionnaire used among 12-month-old Norwegian infants. Eur J Clin Nutr 57, 881–888.Google Scholar

12. Ortiz-Andrellucchi, A, Henríquez-Sánchez, P, Sánchez-Villegas, A, et al. (2009) Dietary assessment methods for micronutrient intake in infants, children and adolescents: a systematic review. Br J Nutr 102, S87–S117.Google Scholar

13. Livingstone, M, Robson, P & Wallace, J (2004) Issues in dietary intake assessment of children and adolescents. Br J Nutr 92, S213–S222.Google Scholar

14. University of York, Centre for Reviews and Dissemination (2015) International Prospective Register of Systematic Reviews. http://www.crd.york.ac.uk/PROSPERO/ (accessed May 2016).Google Scholar

15. Andersen, L, Lande, B, Trygg, K, et al. (2004) Validation of a semi-quantitative food-frequency questionnaire used among 2-year-old Norwegian children. Public Health Nutr 7, 757–764.CrossRef Google Scholar PubMed

16. Iannotti, RJ, Zuckerman, AE, Blyer, EM, et al. (1994) Comparison of dietary intake methods with young children. Psychol Rep 74, 883–889.Google Scholar

17. Blum, RE, Wei, EK, Rockett, HR, et al. (1999) Validation of a food frequency questionnaire in Native American and Caucasian children 1 to 5 years of age. Matern Child Health J 3, 167–172.Google Scholar

18. Parrish, LA, Marshall, JA, Krebs, NF, et al. (2003) Validation of a food frequency questionnaire in preschool children. Epidemiology 14, 213–217.Google Scholar

19. Marshall, TA, Gilmore, JME, Broffitt, B, et al. (2003) Relative validation of a beverage frequency questionnaire in children ages 6 months through 5 years using 3-day food and beverage diaries. J Am Diet Assoc 103, 714–720.Google Scholar

20. Williams, PL & Innis, SM (2005) Food frequency questionnaire for assessing infant iron nutrition. Can J Diet Pract Res 66, 176–182.Google Scholar

21. Marriott, LD, Inskip, HM, Borland, SE, et al. (2009) What do babies eat? Evaluation of a food frequency questionnaire to assess the diets of infants aged 12 months. Public Health Nutr 12, 967–972.Google Scholar

22. Vereecken, C, Covents, M & Maes, L (2010) Comparison of a food frequency questionnaire with an online dietary assessment tool for assessing preschool children's dietary intake. J Hum Nutr Diet 23, 502–510.Google Scholar

23. Rankin, SJ, Levy, SM, Warren, JJ, et al. (2011) Relative validity of an FFQ for assessing dietary fluoride intakes of infants and young children living in Iowa. Public Health Nutr 14, 1229–1236.Google Scholar

24. D'Ambrosio, A, Tiessen, A & Simpson, JR (2012) Development of a food frequency questionnaire for toddlers of Low-German-speaking Mennonites from Mexico. Can J Diet Pract Res Spring 73, 40–44.Google Scholar

25.Number not used.Google Scholar

26. Watson, EO, Heath, AM, Taylor, RW, et al. (2015) Relative validity and reproducibility of an FFQ to determine nutrient intakes of New Zealand toddlers aged 12–24 months. Public Health Nutr 18, 3265–3271.CrossRef Google Scholar PubMed

27. Sochacka-Tatara, E & Pac, A (2014) Relative validity of a semi-quantitative FFQ in 3-year-old Polish children. Public Health Nutr 17, 1738–1744.Google Scholar

28. Orton, HD, Szabo, NJ, Clare-Salzler, M, et al. (2008) Comparison between omega-3 and omega-6 polyunsaturated fatty acid intakes as assessed by a food frequency questionnaire and erythrocyte membrane fatty acid composition in young children. Eur J Clin Nutr 62, 733–738.CrossRef Google Scholar PubMed

29. Klohe, DM, Clarke, KK, George, GC, et al. (2005) Relative validity and reliability of a food frequency questionnaire for a triethnic population of 1-year-old to 3-year-old children from low-income families. J Am Diet Assoc 105, 727–734.Google Scholar

30. Bel-Serrat, S, Fernandez Alvira, JM, Pala, V, et al. (2011) Relative validation of two dietary assessment methods: SACINA (24-h recall) and food frequency questionnaire. Int J Obes 35, S152.Google Scholar

31. Mills, VC, Skidmore, PM, Watson, EO, et al. (2015) Relative validity and reproducibility of a food frequency questionnaire for identifying the dietary patterns of toddlers in New Zealand. J Acad Nutr Diet 115, 551–558.Google Scholar

32. Dennis, LK, Snetselaar, LG, Nothwehr, FK, et al. (2003) Developing a scoring method for evaluating dietary methodology in reviews of epidemiologic studies. J Am Diet Assoc 103, 483–487.Google Scholar

33. Tabacchi, G, Amodio, E, Di Pasquale, M, et al. (2014) Validation and reproducibility of dietary assessment methods in adolescents: a systematic literature review. Public Health Nutr 17, 2700–2714.CrossRef Google Scholar PubMed

34. Roman-Viñas, B, Ortiz-Andrellucchi, A, Mendez, M, et al. (2010) Is the food frequency questionnaire suitable to assess micronutrient intake adequacy for infants, children and adolescents? Matern Child Nutr 6, 112–121.Google Scholar

35. National Health and Medical Research Council (NHMRC). (2000) How to Use the Evidence: Assessment and Application of Scientific Evidence. Canberra: Biotex.Google Scholar

36.Number not used.Google Scholar

37. Bel-Serrat, S, Mouratidou, T, Pala, V, et al. (2014) Relative validity of the Children's Eating Habits Questionnaire-food frequency section among young European children: the IDEFICS Study. Public Health Nutr 17, 266–276.Google Scholar

38. Metcalf, PA, Scragg, RKR, Sharpe, S, et al. (2003) Short-term repeatability of a food frequency questionnaire in New Zealand children aged 1–14 y. Eur J Clin Nutr 57, 1498–1503.Google Scholar

39. Gibson, RS (2005) Principles of Nutritional Assessment, 2nd ed. New York: Oxford University Press.Google Scholar

40. Treadwell, JR, Tregear, SJ, Reston, JT, et al. (2006) A system for rating the stability and strength of medical evidence. BMC Med Res Methodol 6, 52.Google Scholar

41. Subar, AF (2004) Developing dietary assessment tools. J Am Diet Assoc 104, 769–770.Google Scholar

42. Schatzkin, A, Kipnis, V, Carroll, RJ, et al. (2003) A comparison of a food frequency questionnaire with a 24-hour recall for use in an epidemiological cohort study: results from the biomarker-based Observing Protein and Energy Nutrition (OPEN) study. Int J Epidemiol 32, 1054–1062.Google Scholar

43. Henríquez-Sánchez, P, Sánchez-Villegas, A, Doreste-Alonso, J, et al. (2009) Dietary assessment methods for micronutrient intake: a systematic review on vitamins. Br J Nutr 102, S10–S37.Google Scholar

44. Collins, CE, Burrows, TL, Truby, H, et al. (2013) Comparison of energy intake in toddlers assessed by food frequency questionnaire and total energy expenditure measured by the doubly labeled water method. J Acad Nutr Diet 113, 459–463.CrossRef Google Scholar PubMed

45. Bland, JM & Altman, DG (1999) Measuring agreement in method comparison studies. Stat Methods Med Res 8, 135–160.Google Scholar

46. Huybrechts, I, De Bacquer, D, Matthys, C, et al. (2006) Validity and reproducibility of a semi-quantitative food-frequency questionnaire for estimating calcium intake in Belgian preschool children. Br J Nutr 95, 802–816.CrossRef Google Scholar PubMed

47. Day, NE, Wong, MY, Bingham, S, et al. (2004) Correlated measurement error – implications for nutritional epidemiology. Int J Epidemiol 33, 1373–1381.CrossRef Google Scholar PubMed

48. Altman, DG (1990) Practical Statistics for Medical Research. Boca Raton, FL: CRC Press.CrossRef Google Scholar

Fig. 1. Inclusion and exclusion criteria used to select studies for inclusion in the systematic review.

Table 1. Characteristics of included studies evaluating long-term or short-term nutrient intake, or biomarker, food or food group

Fig. 2. Selection process flow of articles identified that assess validity of FFQ methods in children aged 12–36 months.

Table 2. Quality scores using methods described by Dennis et al.(32) and the EURopean Micronutrient RECommendations Aligned (EURRECA) scoring tool(5)

Article contents

Quality of food-frequency questionnaire validation studies in the dietary assessment of children aged 12 to 36 months: a systematic literature review

Abstract

Keywords

Methods

Protocol registration

Eligibility criteria

Information sources

Search strategy

Study selection

Data collection process

Data items

Synthesis of results

Quality assessment

Results

Study selection

Characteristics of included studies

Statistical analysis

Results of individual studies by validation method used

FFQ v. 24-h recalls

FFQ v. food record (±weighing)

FFQ v. biomarker

Evaluation of food or food groups

Additional analysis: quality assessment

Concurrent validation analysis

Discussion

Evaluating quality assessment

Limitations

Conclusion

Acknowledgements

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests