Hostname: page-component-cd9895bd7-gxg78 Total loading time: 0 Render date: 2024-12-26T13:06:48.711Z Has data issue: false hasContentIssue false

Quality of food-frequency questionnaire validation studies in the dietary assessment of children aged 12 to 36 months: a systematic literature review

Published online by Cambridge University Press:  08 May 2017

Amy Lovell*
Affiliation:
Discipline of Nutrition, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
Rhodi Bulloch
Affiliation:
Discipline of Nutrition, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
Clare R. Wall
Affiliation:
Discipline of Nutrition, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
Cameron C. Grant
Affiliation:
Department of Paediatrics: Child and Youth Health, University of Auckland, Auckland, New Zealand Centre for Longitudinal Research He Ara ki Mua, University of Auckland, Auckland, New Zealand Starship Children's Hospital, Auckland District Health Board, Auckland, New Zealand
*
*Corresponding author: A. Lovell, email [email protected]

Abstract

A child's diet is an important determinant of growth and development. Because of this, the accurate assessment of dietary intake in young children remains a challenge. A systematic search of studies validating FFQ methodologies in children 12 to 36 months of age was completed. English-language articles published until March 2016 were searched using three electronic databases (MEDLINE, EMBASE and CINAHL). Quality assessment of the identified studies was carried out using The Reduced Summary Score and EURopean micronutrient RECommendations Aligned (EURRECA) scoring system. Seventeen studies were included and categorised according to whether they reflected long-term (≥7 d) or short-term (<7 d) intake, or used a biomarker. A total score for each micronutrient was calculated from the mean of the correlation coefficients weighted by the study quality score. At least three validation studies per micronutrient were required for inclusion. Fifteen studies (83 %) that considered validity of the FFQ in assessing nutrient intakes had quality scores from 2·5 to 6·0. Of those, ten (67 %) studies found FFQ to have good correlations in assessing dietary intake (>0·4). Of the nutrients with three or more studies available, FFQ validated using a reference method reflecting short-term intake had a good weighted correlation for Ca (0·51), and acceptable weighted correlations for vitamin C (0·31) and Fe (0·33). Semi-quantitative FFQ were shown to be valid and reproducible when estimating dietary intakes at a group level, and are an acceptable instruments for estimating intakes of Ca, vitamin C and Fe in children 12 to 36 months of age.

Type
Systematic Review
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s) 2017

The accurate description and measurement of dietary intake is a necessary step in determining the nutritional adequacy of diets in individuals or a population( Reference Livingstone and Robson 1 ). Having valid and reliable assessment tools is essential to increase our understanding of the relationship between dietary intake and health outcomes, and our understanding of the dietary determinants of nutritional status( Reference Kolodziejczyk, Merchant and Norman 2 ).

Food and nutrient intakes are estimated via dietary assessment methods that differ according to a study's aims and objectives, skills of the study population, accuracy of the required dietary data, study resources and study design( Reference Willett 3 ). Most epidemiological studies use variations of the FFQ, which can be validated using biomarkers or tools that measure daily dietary intake( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 ). The FFQ has an advantage of being an inexpensive method of obtaining data from a large number of participants, with a relatively low respondent burden and can be used to estimate an individual's average consumption over an extended period of time( Reference Willett 3 , Reference Vereecken 6 ).

There is no definitive ‘gold standard’ in dietary assessment, nor is there a ‘gold standard’ for assessing the validity of FFQ( Reference Cade, Thompson and Burley 7 ). Therefore estimation of a tool's relative validity relies upon a comparison with a superior and preferably independent technique, known as comparative validation( Reference Willett 3 ). Here, weighed food records (WFR) and 24-h recalls (24-HR) are commonly used due to their greater precision in the quantification of intake( Reference Willett 3 ). Factors that may affect the validity of a diet questionnaire have been reviewed( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 , Reference Block and Hartman 8 ).

Early childhood is a life phase where the assessment of dietary intake is particularly challenging. Measurement of energy and nutrient intakes in young children is affected by unique respondent and observer considerations, making the collection of accurate and reliable dietary intakes difficult( Reference Livingstone and Robson 1 ). Young children aged 12 to 36 months, have highly variable diets that are characterised by rapidly changing food habits and transitions in dietary patterns, and often not all food served to an infant is consumed in its entirety( Reference Coulston and Boushey 9 Reference Ortiz-Andrellucchi, Henríquez-Sánchez and Sánchez-Villegas 12 ). The acquisition of dietary intake information for children less than 7 years of age is dependent upon surrogate reporters, e.g. parents, caregivers and external caretakers( Reference Livingstone and Robson 1 , Reference Livingstone, Robson and Wallace 13 ). Therefore, the accuracy of dietary assessment in this age group depends on an adult's ability to reliably report on their intake, with previous evidence suggesting that parents can provide a more reliable report on foods consumed in the home setting, rather than away from home( Reference Livingstone and Robson 1 , Reference Livingstone, Robson and Wallace 13 ).

As a consequence of these methodological challenges, the number and type of validated tools available to assess the dietary intake of young children, particularly children 12 to 36 months of age, are limited. The aim of this systematic literature review was to describe and assess the quality of studies reporting on the validity of FFQ as a method for assessing food and nutrient intakes or dietary patterns in 12- to 36-month-old children.

Methods

Protocol registration

The inclusion and exclusion criteria, and analysis methods were specified in advance in a documented protocol. This protocol was not registered with PROSPERO( 14 ) as it is an assessment of the quality of validation studies and does not report on a health-related outcome.

Eligibility criteria

Studies that evaluated the validity of FFQ in the assessment of dietary intake, food(s), and dietary patterns with a reference dietary assessment tool (e.g. 24-HR, diet records, diet histories, WFR and biomarkers) in healthy children aged 12 to 36 months and met all the inclusion criteria (Fig. 1.) were included in the review. Randomised controlled trials were not available; therefore analytical study designs were limited to prospective and retrospective cohort studies. Case series, case reports and case–control studies were excluded due to the high potential for bias.

Fig. 1. Inclusion and exclusion criteria used to select studies for inclusion in the systematic review.

Information sources

Studies were identified via searching online databases, hand-searching reference lists of original articles, and cited reference searches. The search focused on relevant studies published before March 2016 and was limited to those published in English, without limits on time frame or country. Grey literature was also considered.

Search strategy

A literature search was applied to MEDLINE (1946 to present), EMBASE (1980 to present) and CINAHL (1937 to present) electronic databases, and Google Scholar. Medical Subject Headings (MeSH), MeSH major topics, and free text terms were developed under four group headings in MEDLINE and EMBASE databases. The MeSH search terms used in the search were developed under four group headings: (1) infant (12–36 months), e.g. toddler, preschool*, child, infant, newborn*, pre-school*, babies, baby, kindergarten, children under 2, children under 3; (2) diet, e.g. nutrition, dietary pattern, food intake, diet quality, infant nutrition, child nutrition, nutritional assessment, eating pattern, nutritional status, feeding behaviour, food combination, childhood diet, infant food; (3) dietary assessment, e.g. diet surveys, questionnaires, instrument, dietary intake methods, assess*, evaluat*, dietary intake methods, nutrition surveys; (4) dietary assessment tool, e.g. food frequency questionnaire, FFQ; (5) instrument validation, e.g. validity, reproducibility, correlation coefficient, reliability, validation studies, replication stud*, correlation stud*, repeatability. Key words and combinations were identified in free text, article titles and abstracts, and were used to perform a comprehensive search of the databases. Search terms and strategies were adapted for use in other databases and were peer reviewed. All retrieved articles were sent to Refworks® (version 4.4.1237; ProQuest LLC) where duplicates were removed.

Study selection

Two reviewers (A. L. and R. B.) determined a study's eligibility in an independent, unblinded and standardised manner. Systematic literature reviews were not included in the analysis. Titles and abstracts were reviewed to assess whether they met the inclusion criteria for full-text review (Fig. 1). Disagreements between reviewers were resolved by consensus, or if the decision on study inclusion or exclusion were unclear, the full text was obtained. In studies where the age range of participants was included, but was much wider than 12 to 36 months, e.g. 2 to 9 years, the reviewers attempted to obtain results from authors specific to the age range of interest. Full-text articles that fulfilled all criteria for inclusion were reviewed in a second screening process as the definitive step for inclusion.

Data collection process

A data extraction sheet based on examples found in the selected literature was developed. One review author (A. L.) extracted key data into a prepared table, which was checked by a co-author (R. B.). Any disagreements were resolved through discussion between the review authors (A. L. and R. B.), and if no agreement could be reached a prearranged third reviewer was asked to arbitrate (C. W.). Direct contact via email was made with four authors to obtain information in addition to that which could be abstracted from the published paper. In all four cases this request was for information within the age range of interest (12 to 36 months) from a study that reported data over a wider age range. One follow-up email was sent if no response was received. No authors responded with data from their studies specific to the age range of interest.

Data items

A concise overview of the seventeen included studies is shown in Table 1 ( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 Reference Mills, Skidmore and Watson 31 ). The areas of interest included: population characteristics (size, age, location, ethnicity), FFQ characteristics (food groups, food items, consumption interval, administration method, portion estimation, number of FFQ administered, and FFQ re-test interval), reference method used, outcome measures (validity, reproducibility) and the statistics employed to assess validity between two methods or reproducibility of the FFQ.

Table 1. Characteristics of included studies evaluating long-term or short-term nutrient intake, or biomarker, food or food group

3D, three-dimensional; 24HR, 24 h recall; CC, correlation coefficient; DR, diet record; FD, food diary; FR, food record; HFFQ, Harvard Service Food Frequency Questionnaire; ICC, intra-class correlation; LOA, limits of agreement; NA, not applicable; NR, not reported; rec., record; WFR, weighed food record.

* Mean.

† Median.

Synthesis of results

Studies were classified into three categories based on the reference method applied to the validation study. This method has been previously reported and consisted of:

  1. (1) Long-term intake – the reference method covered ≥7 d.

  2. (2) Short-term intake – the reference method covered <7 d.

  3. (3) Biomarker – the reference method was a biomarker.

Quality assessment

Following classification, the two reviewers (A. L. and R. B.) independently completed quality assessment of the included validation studies using the reduced summary score by Dennis et al.( Reference Dennis, Snetselaar and Nothwehr 32 ) which assessed the quality of the nutrition information from the FFQ, and an additional scoring system developed by the EURopean Micronutrient RECommendations Aligned (EURRECA) network used in studies assessing nutrient intakes with the aim of including, excluding and weighting studies( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 , Reference Ortiz-Andrellucchi, Henríquez-Sánchez and Sánchez-Villegas 12 ). These scoring tools evaluated methodological quality of the identified studies and determined the extent to which a study addressed the possibility of bias in their design, conduct and analysis. This dual scoring system approach was used in a previous review of FFQ for assessing dietary intake in adolescents( Reference Tabacchi, Amodio and Di Pasquale 33 ).

Because of the heterogeneity between the dietary assessment methods used as the reference, study designs, populations, and duration of the study, only a narrative review of the literature was performed. A meta-analysis could not be conducted due to a lack of randomised controlled trials.

The summary score by Dennis et al.( Reference Dennis, Snetselaar and Nothwehr 32 ) scores studies based on objective measures of quality dietary assessment. The reduced summary score with a maximum score of 8 was utilised for simplified quality assessment of the FFQ as seen in Tabacchi et al.( Reference Tabacchi, Amodio and Di Pasquale 33 ) Validation studies that had a reduced summary score of ≥5 were classified as being ‘high quality’ and scores <5 as ‘low quality’. This scoring tool was used for all included studies. The EURRECA( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 ) scoring system was only applied to studies that assessed nutrient intakes. Summary scores range from 0 (poorest quality) to 7 (highest possible score) and are ranked as ‘very good/excellent’ score ≥5; ‘good’ score 3·5 ≤ and <5; ‘acceptable’ score 2·5 ≤ and <3·5; and ‘poor’ score <2·5( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 ). In order to estimate a mean correlation per micronutrient for the included studies, the correlation coefficient from each study was initially multiplied by its quality score. Next, the sum of the weighted correlations was divided by the sum of the quality scores to provide a correlation coefficient that was adjusted for the study's methodological quality. Mean weighted correlation coefficients were only calculated for micronutrients with correlations available from three or more studies( Reference Roman-Viñas, Ortiz-Andrellucchi and Mendez 34 ). This allows for concurrent analysis of multiple validation studies and gives an estimate of a mean correlation coefficient per micronutrient for a given dietary assessment method( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 ). The intake method was rated as poor when the correlation was <0·30, acceptable between 0·30 and 0·50, good between 0·51 and 0·70, and correlations >0·70 were very good( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 ).

Results

Study selection

A total of 373 articles were identified (Fig. 2). Following removal of duplicates, 236 articles unique by title and abstract remained for review. Application of inclusion and exclusion criteria resulted in fifty-nine articles being selected for full-text review. Thirty-nine studies were included for quality appraisal. All studies were cross-sectional in their design, and thus classified as level IV evidence( 35 ). Following quality appraisal twenty-two studies were excluded, leaving seventeen articles( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 Reference Klohe, Clarke and George 29 , Reference Mills, Skidmore and Watson 31 , Reference Bel-Serrat, Mouratidou and Pala 37 ) identified as assessing the validity of an FFQ against a dietary reference instrument in children 12 to 36 months of age.

Fig. 2. Selection process flow of articles identified that assess validity of FFQ methods in children aged 12–36 months.

Nine of the publications reported results from North American countries (USA and Canada)( Reference Iannotti, Zuckerman and Blyer 16 Reference Williams and Innis 20 , Reference Rankin, Levy and Warren 23 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Orton, Szabo and Clare-Salzler 28 , Reference Klohe, Clarke and George 29 ), five from the UK and Europe( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Marriott, Inskip and Borland 21 , Reference Sochacka-Tatara and Pac 27 , Reference Bel-Serrat, Fernandez Alvira and Pala 30 ), and three from New Zealand( Reference Watson, Heath and Taylor 26 , Reference Mills, Skidmore and Watson 31 , Reference Metcalf, Scragg and Sharpe 38 ). The number of participants ranged from seventeen( Reference Iannotti, Zuckerman and Blyer 16 ) to 240( Reference Marshall, Gilmore and Broffitt 19 ), with two studies presenting data from large cohorts: The Iowa Fluoride Study( Reference Rankin, Levy and Warren 23 ) and The IDEFICS Study (Identification and prevention of Dietary- and lifestyle-induced health EFfects In Children and infantS)( Reference Bel-Serrat, Mouratidou and Pala 37 ).

Characteristics of included studies

Characteristics of each of the seventeen included validation studies are described in Table 1. Fourteen studies considered the validity of the FFQ to assess nutrient intakes( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Sochacka-Tatara and Pac 27 , Reference Orton, Szabo and Clare-Salzler 28 ), and three studies considered values on the validity of the FFQ to assess food or food group(s)( Reference Klohe, Clarke and George 29 , Reference Mills, Skidmore and Watson 31 , Reference Bel-Serrat, Mouratidou and Pala 37 ). Two studies assessing nutrient intakes also used biomarkers as an additional reference method( Reference Parrish, Marshall and Krebs 18 , Reference Williams and Innis 20 ). Eleven of the included FFQ were semi-quantitative( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 Reference Marshall, Gilmore and Broffitt 19 , Reference Marriott, Inskip and Borland 21 , Reference Rankin, Levy and Warren 23 , Reference Sochacka-Tatara and Pac 27 Reference Klohe, Clarke and George 29 ), five were quantitative( Reference Marshall, Gilmore and Broffitt 19 , Reference Williams and Innis 20 , Reference Vereecken, Covents and Maes 22 , Reference Watson, Heath and Taylor 26 , Reference Mills, Skidmore and Watson 31 ), and one recorded frequency of consumption and not portion sizes( Reference Bel-Serrat, Mouratidou and Pala 37 ). The number of food items ranged from seventy-eight( Reference Marriott, Inskip and Borland 21 ) to 191( Reference Williams and Innis 20 , Reference Klohe, Clarke and George 29 ) with an average of 113 food items. Those studies that assessed food and/or food group intakes had between seven( Reference Marshall, Gilmore and Broffitt 19 ) and seventy-seven( Reference Vereecken, Covents and Maes 22 ) food groups. Food intake intervals ranged from intake over the previous 7 d( Reference Marshall, Gilmore and Broffitt 19 , Reference Rankin, Levy and Warren 23 ) to over the last year( Reference Parrish, Marshall and Krebs 18 , Reference Orton, Szabo and Clare-Salzler 28 , Reference Orton, Szabo and Clare-Salzler 28 ), with the majority describing intake over the last month( Reference Blum, Wei and Rockett 17 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Sochacka-Tatara and Pac 27 , Reference Klohe, Clarke and George 29 , Reference Bel-Serrat, Mouratidou and Pala 37 ).

Two studies were grouped according to a reference method that reflected long-term intake (7-d WFR)( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 ). Ten studies were grouped according to a reference method that reflected short-term intake where four applied 24-HR( Reference Blum, Wei and Rockett 17 , Reference Parrish, Marshall and Krebs 18 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Sochacka-Tatara and Pac 27 ) and five applied WFR( Reference Marshall, Gilmore and Broffitt 19 Reference Rankin, Levy and Warren 23 ), one of these being online( Reference Vereecken, Covents and Maes 22 ). One study utilised biomarkers as a reference method( Reference Orton, Szabo and Clare-Salzler 28 ). Among the seven studies that used WFR, the number of recorded days varied from 3 to 7 d( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Marshall, Gilmore and Broffitt 19 , Reference Williams and Innis 20 Reference Vereecken, Covents and Maes 22 , Reference Klohe, Clarke and George 29 ). The number of repeated 24-HR ranged from 2( Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Bel-Serrat, Mouratidou and Pala 37 ) or 3( Reference Blum, Wei and Rockett 17 , Reference Williams and Innis 20 , Reference Sochacka-Tatara and Pac 27 ) days of non-consecutive administration. Eleven studies were self-administered( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Blum, Wei and Rockett 17 Reference Marshall, Gilmore and Broffitt 19 , Reference Vereecken, Covents and Maes 22 , Reference Rankin, Levy and Warren 23 , Reference Sochacka-Tatara and Pac 27 Reference Klohe, Clarke and George 29 , Reference Bel-Serrat, Mouratidou and Pala 37 ), by a parent or equivalent proxy reporter, and six studies were interviewer administered( Reference Iannotti, Zuckerman and Blyer 16 , Reference Williams and Innis 20 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Mills, Skidmore and Watson 31 ). Methods of portion size estimation ranged from household measures/standard portion sizes( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Sochacka-Tatara and Pac 27 , Reference Orton, Szabo and Clare-Salzler 28 ) to portion sizes derived from national nutrition survey data( Reference Blum, Wei and Rockett 17 , Reference Vereecken, Covents and Maes 22 , Reference Klohe, Clarke and George 29 ). Three studies did not describe portion estimation( Reference Parrish, Marshall and Krebs 18 , Reference Williams and Innis 20 , Reference Rankin, Levy and Warren 23 ), and two studies used a unique ‘palm’ measurement( Reference Watson, Heath and Taylor 26 , Reference Mills, Skidmore and Watson 31 ). Of the thirteen studies that calculated food intakes into nutrient intakes, six reported using national food composition databases (e.g. United States Department of Agriculture)( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Vereecken, Covents and Maes 22 Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Orton, Szabo and Clare-Salzler 28 ), and two used other food composition databases (e.g. Harvard Nutrient Database)( Reference Blum, Wei and Rockett 17 , Reference Parrish, Marshall and Krebs 18 ). Although not the primary aim of the validation study, two studies( Reference Iannotti, Zuckerman and Blyer 16 , Reference Parrish, Marshall and Krebs 18 ) examined whether there were any differences between sex and care status (i.e. in child care or at home) when comparing mean nutrient intake values.

Statistical analysis

Statistical analyses used in the assessment of FFQ validity, and in some cases reproducibility, are described in Table 1. All included studies calculated differences in means and/or mean comparisons. Pearson or Spearman's correlation coefficients were calculated in all studies. Paired Student's t tests were used evaluate whether there was any difference between the mean nutrient and food intakes determined by the two assessment methods( Reference Parrish, Marshall and Krebs 18 ). Factors that affect the validity of a dietary assessment instrument included: population characteristics, acceptability of the reference method data, FFQ design/quantification, quality control and data management( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 , Reference Tabacchi, Amodio and Di Pasquale 33 ).

The calculation of weighted correlation coefficients allowed comparison with the other included studies. Here, correlation coefficients between 0·51 and 0·7 are considered good( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 , Reference Cade, Thompson and Burley 7 ). Four studies considered crude correlation coefficients( Reference Iannotti, Zuckerman and Blyer 16 , Reference Marshall, Gilmore and Broffitt 19 , Reference Vereecken, Covents and Maes 22 , Reference Rankin, Levy and Warren 23 ), whilst seven studies adjusted nutrients using energy-adjusted values( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Blum, Wei and Rockett 17 , Reference Parrish, Marshall and Krebs 18 , Reference Williams and Innis 20 , Reference Marriott, Inskip and Borland 21 , Reference Orton, Szabo and Clare-Salzler 28 ), and three studies calculated de-attenuated values to account for measurement error( Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 , Reference Mills, Skidmore and Watson 31 ) or intra-class correlations( Reference D'Ambrosio, Tiessen and Simpson 24 ). All six studies that performed cross-classification analysis ranked participants by using the same or adjacent quartile. Three of these studies( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Watson, Heath and Taylor 26 ) assessed the classification of participants according to their nutrient intakes and three studies( Reference Klohe, Clarke and George 29 , Reference Mills, Skidmore and Watson 31 , Reference Bel-Serrat, Mouratidou and Pala 37 ) assessed the classification of participants according to their food or food group intakes. Weighted κ was calculated in two studies that considered food intakes( Reference Marshall, Gilmore and Broffitt 19 , Reference Rankin, Levy and Warren 23 ). Here, four categories were used to calculate κ statistics and classify food intake data.

Two studies( Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 ) assessed the reproducibility of the FFQ for estimating dietary intake patterns and estimation of reproducibility of nutrient intakes was achieved by calculating correlation coefficients and intra-class correlations. Acceptable intra-class correlations ranged from >0·4( Reference Cade, Thompson and Burley 7 , Reference Watson, Heath and Taylor 26 , Reference Gibson 39 ) to 0·7( Reference D'Ambrosio, Tiessen and Simpson 24 ) when establishing test–retest reliability of the FFQ. In order to test reproducibility, five( Reference Blum, Wei and Rockett 17 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Klohe, Clarke and George 29 , Reference Mills, Skidmore and Watson 31 ) studies administered the FFQ on two occasions. Intervals between test and retest ranged from 2 weeks( Reference Klohe, Clarke and George 29 ) to 1 month( Reference Blum, Wei and Rockett 17 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Mills, Skidmore and Watson 31 ). One study( Reference Blum, Wei and Rockett 17 ) administered the FFQ on two occasions, 1 month apart but did not report on the statistical analysis used for reproducibility.

Results of individual studies by validation method used

Included reviews were analysed according to the reference method used (i.e. WFR, 24-HR or biomarker) and whether the tool reflected long-term or short-term intake.

FFQ v. 24-h recalls

Five studies( Reference Blum, Wei and Rockett 17 , Reference Parrish, Marshall and Krebs 18 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Sochacka-Tatara and Pac 27 , Reference Bel-Serrat, Mouratidou and Pala 37 ) used 24-HR as their reference method to validate an FFQ. In all studies the FFQ overestimated median/mean nutrient intake estimates but could provide reliable estimates of nutrient intakes in young children with good agreement when compared with the 24-HR (Table 1). Nutrient correlations that were energy-adjusted or de-attenuated (to reduce dependency on between-person variation) were found to have higher correlation coefficients compared with crude values. Cross-classification into low, medium and high consumers was moderate (>30 % classification into the same quartile). One study( Reference D'Ambrosio, Tiessen and Simpson 24 ) assessed repeatability/reproducibility using a 24-HR as a reference tool. Correlations for most nutrients were >0·70, indicating low within-person variation.

FFQ v. food record (±weighing)

Eleven studies used WFR as their reference method to validate an FFQ( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Iannotti, Zuckerman and Blyer 16 , Reference Marshall, Gilmore and Broffitt 19 , Reference Williams and Innis 20 , Reference Marriott, Inskip and Borland 21 , Reference Vereecken, Covents and Maes 22 , Reference Rankin, Levy and Warren 23 , Reference Watson, Heath and Taylor 26 , Reference Klohe, Clarke and George 29 , Reference Mills, Skidmore and Watson 31 ). Ten studies that estimated nutrient intakes found that the FFQ tended to overestimate intakes (Table 1) but found good correlations (>0·4)( Reference Cade, Thompson and Burley 7 ) between the FFQ and WFR for most nutrients, energy intakes and food intakes. The included FFQ mostly indicated a moderate ability to rank infants according to their nutrient intakes, with two studies by Andersen et al.( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 ) showing that the ability of the questionnaire to rank infants according to their intakes increased when using nutrient density values over absolute values.

FFQ v. biomarker

Using biomarkers as the reference method was less frequent. Three studies used biomarkers( Reference Andersen, Lande and Trygg 15 , Reference D'Ambrosio, Tiessen and Simpson 24 , 35 ). Two articles( Reference Parrish, Marshall and Krebs 18 , Reference Williams and Innis 20 ) presented validation of an FFQ using biomarkers and a second dietary assessment instrument (24-HR or WFR) as reference methods. The biomarkers analysed included: total lipids, plasma levels of vitamins C, D and E, retinol and β-carotene( Reference Parrish, Marshall and Krebs 18 ), serum markers of Fe( Reference Williams and Innis 20 ) and fatty acid composition measured in erythrocytes( Reference Orton, Szabo and Clare-Salzler 28 ).

Evaluation of food or food groups

Using a semi-quantitative FFQ excellent reliability and adequate validity were seen in assessing food choices of low-income children( Reference Klohe, Clarke and George 29 ), with low levels of agreement and limited ability to rank children according to intakes of food groups( Reference Bel-Serrat, Mouratidou and Pala 37 ). More recently, in Otago, New Zealand, a semi-quantitative FFQ displayed good validity (r 0·52) and high reproducibility in the identification of dietary patterns, and in ranking the diets of toddlers when compared with a 5-d WFR. The FFQ overestimated energy and nutrient intakes and cannot measure absolute intakes, but could be used to identify toddlers at extreme ends of intake distribution( Reference Watson, Heath and Taylor 26 , Reference Mills, Skidmore and Watson 31 ).

Additional analysis: quality assessment

A summary of the quality assessment of the seventeen included studies are shown in Table 2. Using the reduced summary score( Reference Dennis, Snetselaar and Nothwehr 32 ), one validation study that assessed nutrient intakes received a low quality ranking( Reference Marshall, Gilmore and Broffitt 19 ) and one study that assessed food intake received a low quality ranking( Reference Bel-Serrat, Mouratidou and Pala 37 ). The remaining fifteen studies received high quality rankings. Criteria that reduced the quality of the study included the number of food items in the FFQ (<70 food items is likely to reduce the quality of the nutrition information), and if the FFQ was self-administered.

Table 2. Quality scores using methods described by Dennis et al.( Reference Dennis, Snetselaar and Nothwehr 32 ) and the EURopean Micronutrient RECommendations Aligned (EURRECA) scoring tool( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 )

NA, not available, fewer than three studies found.

* Dennis et al. ( Reference Dennis, Snetselaar and Nothwehr 32 ) quality level: high (≥5); low (<5).

† EURRECA quality score: very good/excellent (≥5); good (3·5≥ to <5); acceptable/reasonable (2·5≥ to <3·5); poor (<2·5).

Using the EURRECA scoring system( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 ), fourteen studies assessed nutrient intakes, with quality scores ranging from 2·5 to 6·0 (maximum 7·0). The average quality score was 3·8, with a median of 3·5. Table 2 illustrates the classification of the included studies according to their reference method and methodological quality, with three studies( Reference Williams and Innis 20 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 ) (21 %) rating as very good/excellent, five studies( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Parrish, Marshall and Krebs 18 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 ) (36 %) as good quality, five studies( Reference Iannotti, Zuckerman and Blyer 16 , Reference Marshall, Gilmore and Broffitt 19 , Reference Vereecken, Covents and Maes 22 , Reference Rankin, Levy and Warren 23 , Reference Orton, Szabo and Clare-Salzler 28 ) (36 %) having an acceptable quality, and one study( Reference Rankin, Levy and Warren 23 ) (7 %) having a poor quality rating. ‘Good’ quality scores were seen in the validation studies where FFQ were compared with a reference method that was reflective of long-term intakes, and a majority (58 %) of validation studies where the FFQ was compared with a reference method that was reflective of short-term intakes were either ‘good’ or ‘very good’. Factors affecting the EURRECA quality assessment score( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 ) were the statistical analyses used and data collection via interviewer-administration. Calculation of energy-adjusted( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Blum, Wei and Rockett 17 , Reference Parrish, Marshall and Krebs 18 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Orton, Szabo and Clare-Salzler 28 ), de-attentuated (to reduce the dependency on between-person variation)( Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 ), or intra-class correlation coefficients increased quality scores( Reference D'Ambrosio, Tiessen and Simpson 24 ).

Concurrent validation analysis

Table 3 displays concurrent analysis of the included studies where a mean correlation coefficient per nutrient for each dietary assessment method was calculated by multiplying the correlation coefficient by their quality assessment score. This was completed for the EURRECA priority micronutrients and those studies that met the criteria of having nutrient correlations from at least three studies( Reference Treadwell, Tregear and Reston 40 ). Micronutrients with a sufficient number of studies to be included (≥3 studies), and where the validation reference method reflected short-term intakes (<7 d), were vitamin B12 ( Reference Blum, Wei and Rockett 17 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 ), vitamin C( Reference Blum, Wei and Rockett 17 , Reference Williams and Innis 20 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 ), vitamin D( Reference Marshall, Gilmore and Broffitt 19 , Reference Marriott, Inskip and Borland 21 , Reference Sochacka-Tatara and Pac 27 ), Ca( Reference Blum, Wei and Rockett 17 , Reference Marshall, Gilmore and Broffitt 19 Reference Vereecken, Covents and Maes 22 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 ), Fe( Reference Blum, Wei and Rockett 17 , Reference Williams and Innis 20 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 ) and Zn( Reference Blum, Wei and Rockett 17 , Reference Marriott, Inskip and Borland 21 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 ). Fibre ( Reference Blum, Wei and Rockett 17 , Reference Williams and Innis 20 , Reference Vereecken, Covents and Maes 22 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 ) and vitamin E( Reference Blum, Wei and Rockett 17 , Reference Parrish, Marshall and Krebs 18 , Reference Marriott, Inskip and Borland 21 , Reference Sochacka-Tatara and Pac 27 ) also had sufficient studies to allow for concurrent analysis. There were insufficient data available for the analysis of two (20 %) micronutrients: folate and Cu. Using the EURRECA scoring tool classifications, correlations were acceptable for vitamin B12 (0·30), vitamin A (0·34) and Ca (0·49) using FFQ v. 24-HR whilst Fe showed a poor correlation (0·29) on validation. Acceptable correlations were seen for vitamin C (0·32) and Fe (0·39), and Ca presented a good correlation (0·51) using FFQ v. WFR. The intake method was rated as ‘good’ when the mean correlation coefficient weighted by the quality criteria score was at least 0·5. The number of studies that used a validation reference method that reflected long-term intakes (>7 d) were insufficient for concurrent analysis (<3 studies per micronutrient).

Table 3. Classification of dietary assessment methods for infants aged 12–36 months according to the weighted mean of the correlations of micronutrients with three or more studies available (separate comparisons of those studies reflecting long-term and short-term intakes or comparison of FFQ with a reference method)

WFR, weighed food record; 24HR, 24 h recall; BM, biomarker; NA, not available, fewer than three studies found.

* Correlation: G, good (0·51–0·70); A, acceptable (0·30–0·50); P, poor (<0·30).

Discussion

In this review, using standardised quality assessment methods, we evaluated seventeen studies reporting on the validity of FFQ as a method for assessing food and nutrient intakes or dietary patterns in 12- to 36-month-old children. From the identified studies( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 Reference Williams and Innis 20 , Reference Marriott, Inskip and Borland 21 , Reference Rankin, Levy and Warren 23 , Reference D'Ambrosio, Tiessen and Simpson 24 , Reference Watson, Heath and Taylor 26 , Reference Sochacka-Tatara and Pac 27 Reference Mills, Skidmore and Watson 31 , Reference Metcalf, Scragg and Sharpe 38 ), semi-quantitative FFQ were shown to be valid and reproducible instruments in children as young as 1 year of age, generating adequate estimates specifically for Ca, vitamin C and Fe, with results similar to those seen in older children and adolescents( Reference Parrish, Marshall and Krebs 18 , Reference Vereecken, Covents and Maes 22 ).

FFQ are used to assess dietary intake due to their practicality, relative ease of administration, low participant burden, ability to assess intake over a prolonged period of time, and lower associated costs( Reference Subar 41 , Reference Schatzkin, Kipnis and Carroll 42 ). However, there are limited FFQ that have been specifically validated in 12- to 36-month-old children. In the present review, the methodological qualities of FFQ were considered in conjunction with analysis of weighted correlation coefficients where higher weights were given to studies that employed higher quality methodologies( Reference Serra-Majem, Frost Andersen and Henríque-Sánchez 5 , Reference Roman-Viñas, Ortiz-Andrellucchi and Mendez 34 ). Qualities included data collection methods, administration, seasonality, sample size, supplement use and statistics.

It is estimated that at approximately 7 to 8 years of age children become aware of their own food intake. Prior to this age the cognition and attention span required to perceive time frames, have knowledge of foods, recall food intake, and self-report are not sufficiently developed( Reference Livingstone and Robson 1 ). Other explicit issues that arise in this age group of interest relate to the change in dietary practices seen across the age range and the variability in information provided by parent or proxy reporter, on foods that are eaten outside of their supervision, especially when the child is in day care.

The ability of FFQ to rank nutrient and energy intakes is improved through providing detailed quality information which can be achieved through interviewer administration( Reference Marriott, Inskip and Borland 21 ).The majority (71 %) of the included FFQ were self-administered by a parent or proxy reporter, similar to that seen in reviews conducted in wider age groups( Reference Roman-Viñas, Ortiz-Andrellucchi and Mendez 34 , Reference Henríquez-Sánchez, Sánchez-Villegas and Doreste-Alonso 43 ). Cade et al.( Reference Cade, Thompson and Burley 7 ) reported an increase in correlation coefficients when the FFQ was interviewer-administered, with the exception of vitamin C, in comparison with those that were self-administered. This is especially relevant in the age group in question, where all information is obtained from a parent or proxy-reporter. There is a need for further studies designed to evaluate the accuracy of parental-reported intakes in larger, ethnically diverse populations, using different dietary assessment methods( Reference Collins, Burrows and Truby 44 ).

Estimation of portion size appears to have some advantage over using average or specified portion sizes, with higher measures of agreement between FFQ and reference method (r 0·5–0·6) and higher correlation coefficients when assessing repeatability( Reference Tabacchi, Amodio and Di Pasquale 33 ). FFQ are seen to commonly overestimate energy intake, which is especially apparent in this population of interest( Reference Andersen, Lande and Arsky 11 , Reference Andersen, Lande and Trygg 15 , Reference Blum, Wei and Rockett 17 , Reference Parrish, Marshall and Krebs 18 , Reference Marriott, Inskip and Borland 21 , Reference D'Ambrosio, Tiessen and Simpson 24 ). This could be attributed to the fact that parents/caregivers may not adequately take into account the small portion sizes consumed by their children and that young children often ‘taste’ many foods without consuming full portions, leading to the inclusion of too large a portion size for some foods( Reference Andersen, Lande and Arsky 11 , Reference Parrish, Marshall and Krebs 18 ). Many of the included studies assessed wider age ranges, i.e. beyond 12 to 36 months, which, as identified in a recent validation study performed in New Zealand, may act to improve validity of the FFQ as older children are more likely to eat meals that are similar to that of the family member or adult completing the FFQ( Reference Watson, Heath and Taylor 26 ). Improvements in validity and bias could be seen through reducing the number of food items in the FFQ, shortening the reporting period, or adjusting portion sizes to more closely reflect those consumed by a young child( Reference Collins, Burrows and Truby 44 ). This unique method has been explored in a study performed in 12- to 24-month-old New Zealand children where the amount of food offered and the amount eaten were recorded separately to encourage parents to differentiate between the two, and portion sizes were described according to the child's ‘palm volume’. This FFQ showed acceptable to good validity and high reproducibility in the assessment of dietary patterns and ranking nutrient intakes( Reference Watson, Heath and Taylor 26 , Reference Mills, Skidmore and Watson 31 ).

In a systematic review by Henríquez-Sánchez et al.( Reference Henríquez-Sánchez, Sánchez-Villegas and Doreste-Alonso 43 ), an improvement in correlation coefficients (r 0·52) was seen when the number of food items included in the FFQ was greater than 100 (r 0·47). The average number of food items used in the present review was 113. Estimation of supplement use should be considered when evaluating nutrient intake. Information on supplements should be included in dietary assessment with emphasis on the type and dose used. Data from FFQ and reference methods correlated better when supplement intake was captured( Reference Henríquez-Sánchez, Sánchez-Villegas and Doreste-Alonso 43 ). Supplement use was acknowledged in one study( Reference Williams and Innis 20 ) and seasonality in another( Reference D'Ambrosio, Tiessen and Simpson 24 ), but were not considered in the statistical analysis.

All studies calculated Pearson or Spearman's correlation coefficients (Table 1). Calculation of correlation coefficients does not measure agreement between the two methods of dietary assessment, only the degree in which the two methods are related( Reference Bland and Altman 45 ). Their usefulness increases if used in conjunction with an alternative method such as Bland–Altman which provides an analysis of how well the FFQ and reference method agree on average( Reference Bland and Altman 45 ). Other methods such as limits of agreement can be used to provide information on reliability and the direction and consistency of bias and the magnitude of errors between the two assessment methods( Reference Cade, Thompson and Burley 7 , Reference Tabacchi, Amodio and Di Pasquale 33 ). It is difficult to summarise the correlation coefficients, agreement of validity and reproducibility of the included FFQ; therefore the present review should be used as a description of included FFQ, with potential for further meta-analyses.

Using 24-HR as the dietary reference method, FFQ were found to be a suitable tool for ranking children according to nutrient intakes (r 0·46), with stronger correlations in foods consumed more frequently( Reference Sochacka-Tatara and Pac 27 , Reference Bel-Serrat, Mouratidou and Pala 37 ). This highlights the difficulties with episodically consumed food items, as seen in the high day-to-day variability of a young child's diet( Reference Parrish, Marshall and Krebs 18 , Reference Bel-Serrat, Mouratidou and Pala 37 ). Unadjusted FFQ nutrient estimates were larger than unadjusted nutrient estimates from multiple 24-HR and additional analysis of children that regularly received meals and snacks from other caregivers alongside parents revealed no apparent compromise or differences in correlations( Reference Parrish, Marshall and Krebs 18 ).

Using WFR as the reference method to assess long-term intakes, correlations were found to increase using nutrient density values over absolute intakes, but the FFQ had a low to moderate ability to rank children according to intakes of nutrients and foods( Reference Andersen, Lande and Arsky 11 ). WFR are not affected by the same errors, such as portion size estimation, and memory lapses, as the FFQ( Reference Gibson 39 ). The FFQ was found to be a useful tool for estimating short-term energy and nutrient intakes in healthy infants (at a group level)( Reference Marriott, Inskip and Borland 21 , Reference Vereecken, Covents and Maes 22 ). Marriott et al. ( Reference Marriott, Inskip and Borland 21 ) found that differences in micronutrient intakes were partly explained by changes in the consumption of milk between the two dietary assessments and by the different nutrient compositions of cows’ milk and formula( Reference Marriott, Inskip and Borland 21 ). This underestimation of Ca intake by the FFQ has been reported in three studies within this age group( Reference Marshall, Gilmore and Broffitt 19 , Reference Marriott, Inskip and Borland 21 , Reference Huybrechts, De Bacquer and Matthys 46 ).

The use of FFQ to provide estimates of beverage intake has not been widely investigated. Marshall and Rankin concluded that a quantitative FFQ could be used to provide relative estimates of beverage, Ca, vitamin D and fluoride intakes in this age group( Reference Marshall, Gilmore and Broffitt 19 , Reference Rankin, Levy and Warren 23 ) and higher correlations were seen at younger ages when the diet was more limited (r 0·85 at 6 months v. r 0·65 at 60 months)( Reference Rankin, Levy and Warren 23 ).

The present review included correlations from three studies using a biomarker for validation( Reference Parrish, Marshall and Krebs 18 , Reference Williams and Innis 20 , Reference Orton, Szabo and Clare-Salzler 28 ). In the assessment of specific nutritional status, Williams & Innis( Reference Williams and Innis 20 ) showed that a semi-quantitative FFQ could be a useful tool in assessing Fe status in infants at a group level (energy adjusted r 0·71), but could result in underestimation of infants deemed to be at high risk of poor Fe status( Reference Parrish, Marshall and Krebs 18 , Reference Williams and Innis 20 ).

Evaluating quality assessment

Where correlations for a given nutrient were available from three or more studies, quality-adjusted correlations were calculated. Higher weighted mean correlations were seen in studies that used WFR as the reference method for Ca, Fe and fibre when compared with other methods. This may be a reflection of the fact that a greater number of studies (60 %) used WFR as a reference method. The highest correlation coefficient weighted by quality was 0·51. There were not sufficient data to conduct the analysis for the remaining micronutrients, and only six out of the ten EURRECA priority nutrients could be assessed. This continues to remain a concern in this age group, where valid nutrient intake estimates could not be calculated. FFQ validation studies that assessed long-term intakes or used biomarkers as the reference tool were based on one or two studies, making them insufficient to reach any conclusion (Table 3).

Limitations

There was a lack of data available to assess the ability of the FFQ in providing adequate estimates for several of the micronutrients highlighted in the present review (Table 3). The heterogeneity in the study designs, methods, outcomes and assessment tools made comparisons difficult, therefore the data were narratively synthesised and described. Due to natural variation, biomarkers may not always be a suitable option for comparison( Reference Parrish, Marshall and Krebs 18 ) and few studies validating FFQ using biomarkers were available for inclusion in the present review which would act to reduce correlated errors associated when the reference method is based on self-reporting( Reference Day, Wong and Bingham 47 ). Studies that assessed the validity of energy intake measurements using doubly labelled water did not meet our inclusion criteria. Due to the specific range of interest, several studies that reported over a wider age range were excluded as reviewers were unable to extract these data. Correlation coefficients of the included studies were used for analysis and quality assessment in the present review; this limits the interpretation of the review as correlation coefficients only measure the degree to which the two assessment methods are related in a validation study, and not the agreement between the methods( Reference Altman 48 ). De-attenuation and energy adjustment have strong implications for correlation coefficients and make it difficult to compare and draw conclusions. Only validation studies written in English were included for analysis. This may have led to the exclusion of reliable validation studies from other countries.

Conclusion

This systematic literature review presents a summary of the quality of FFQ validation studies in children aged 12 to 36 months. The included studies and quality assessment have provided information on aspects of FFQ design that increase validity, such as the number of items included, portion size estimations, appropriate food choices, administration method, validation and reproducibility methods, pre-testing, supplement use, seasonality and the statistical analyses. Semi-quantitative FFQ were shown to be valid and reproducible when estimating dietary intakes at a group level, and are an acceptable instrument for estimating intakes of Ca, vitamin C and Fe in children 12 to 36 months of age. There is insufficient evidence for the evaluation of the validity of micronutrients such as folate, vitamin D, Zn and Cu in this population. Using the results of the included studies; meticulously designed and validated FFQ may be acceptable in estimating intakes of a number of important micronutrients in this age group.

Children aged 12 to 36 months would benefit from further validation studies using appropriate population-specific tools addressing areas highlighted in this review that are unique to dietary assessment in young children. Such areas include further development on portion size estimation, capturing irregular eating patterns, overcoming administration errors with the implementation of computer-assisted methods or the development of novel tools to provide evidence for further validation studies of appropriate population-specific tools, alongside the identification, management and primary prevention of diet-related disease processes.

Acknowledgements

There was no financial support.

A. L. and R. B. completed the literature search, screening process and quality assessment. R. B. was the second independent reviewer. A. L. extracted all data, completed the critical appraisal and completed the first draft of the manuscript and contributed to manuscript revision. C. R. W. and C. C. G. helped develop the review protocol and edited the manuscript. All authors approved the submitted version.

The authors would also like to acknowledge Frances Clements, University of Auckland Faculty of Medical and Health Sciences Subject Librarian for her expert assistance in developing the search strategy for this review.

There were no conflicts of interest.

References

1. Livingstone, M & Robson, P (2000) Measurement of dietary intake in children. Proc Nutr Soc 59, 279293.Google Scholar
2. Kolodziejczyk, JK, Merchant, G & Norman, GJ (2012) Reliability and validity of child/adolescent food frequency questionnaires that assess foods and/or food groups. J Pediatr Gastroenterol Nutr 55, 413.CrossRefGoogle ScholarPubMed
3. Willett, W (2013) Nutritional Epidemiology: Monographs in Epidemiology and Biostatistics, 3rd ed. New York: Oxford University Press.Google Scholar
4.Number not used.Google Scholar
5. Serra-Majem, L, Frost Andersen, L, Henríque-Sánchez, P, et al. (2009) Evaluating the quality of dietary intake validation studies. Br J Nutr 102, S3S9.Google Scholar
6. Vereecken, CA (2010) A longitudinal study on dietary habits and the primary socialization of these habits in young children. Verh K Acad Geneeskd Belg 72, 295308.Google ScholarPubMed
7. Cade, J, Thompson, R, Burley, V, et al. (2002) Development, validation and utilisation of food-frequency questionnaires – a review. Public Health Nutr 5, 567587.Google Scholar
8. Block, G & Hartman, AM (1989) Issues in reproducibility and validity of dietary studies. Am J Clin Nutr 50, 1133–1138; discussion 12311235.CrossRefGoogle ScholarPubMed
9. Coulston, AM & Boushey, C (2008) Nutrition in the Prevention and Treatment of Disease. Amsterdam: Academic Press.Google Scholar
10. Serdula, MK, Alexander, MP, Scanlon, KS, et al. (2001) What are preschool children eating? A review of dietary assessment 1. Annu Rev Nutr 21, 475498.Google Scholar
11. Andersen, LF, Lande, B, Arsky, GH, et al. (2003) Validation of a semi-quantitative food-frequency questionnaire used among 12-month-old Norwegian infants. Eur J Clin Nutr 57, 881888.Google Scholar
12. Ortiz-Andrellucchi, A, Henríquez-Sánchez, P, Sánchez-Villegas, A, et al. (2009) Dietary assessment methods for micronutrient intake in infants, children and adolescents: a systematic review. Br J Nutr 102, S87S117.Google Scholar
13. Livingstone, M, Robson, P & Wallace, J (2004) Issues in dietary intake assessment of children and adolescents. Br J Nutr 92, S213S222.Google Scholar
14. University of York, Centre for Reviews and Dissemination (2015) International Prospective Register of Systematic Reviews. http://www.crd.york.ac.uk/PROSPERO/ (accessed May 2016).Google Scholar
15. Andersen, L, Lande, B, Trygg, K, et al. (2004) Validation of a semi-quantitative food-frequency questionnaire used among 2-year-old Norwegian children. Public Health Nutr 7, 757764.CrossRefGoogle ScholarPubMed
16. Iannotti, RJ, Zuckerman, AE, Blyer, EM, et al. (1994) Comparison of dietary intake methods with young children. Psychol Rep 74, 883889.Google Scholar
17. Blum, RE, Wei, EK, Rockett, HR, et al. (1999) Validation of a food frequency questionnaire in Native American and Caucasian children 1 to 5 years of age. Matern Child Health J 3, 167172.Google Scholar
18. Parrish, LA, Marshall, JA, Krebs, NF, et al. (2003) Validation of a food frequency questionnaire in preschool children. Epidemiology 14, 213217.Google Scholar
19. Marshall, TA, Gilmore, JME, Broffitt, B, et al. (2003) Relative validation of a beverage frequency questionnaire in children ages 6 months through 5 years using 3-day food and beverage diaries. J Am Diet Assoc 103, 714720.Google Scholar
20. Williams, PL & Innis, SM (2005) Food frequency questionnaire for assessing infant iron nutrition. Can J Diet Pract Res 66, 176182.Google Scholar
21. Marriott, LD, Inskip, HM, Borland, SE, et al. (2009) What do babies eat? Evaluation of a food frequency questionnaire to assess the diets of infants aged 12 months. Public Health Nutr 12, 967972.Google Scholar
22. Vereecken, C, Covents, M & Maes, L (2010) Comparison of a food frequency questionnaire with an online dietary assessment tool for assessing preschool children's dietary intake. J Hum Nutr Diet 23, 502510.Google Scholar
23. Rankin, SJ, Levy, SM, Warren, JJ, et al. (2011) Relative validity of an FFQ for assessing dietary fluoride intakes of infants and young children living in Iowa. Public Health Nutr 14, 12291236.Google Scholar
24. D'Ambrosio, A, Tiessen, A & Simpson, JR (2012) Development of a food frequency questionnaire for toddlers of Low-German-speaking Mennonites from Mexico. Can J Diet Pract Res Spring 73, 4044.Google Scholar
25.Number not used.Google Scholar
26. Watson, EO, Heath, AM, Taylor, RW, et al. (2015) Relative validity and reproducibility of an FFQ to determine nutrient intakes of New Zealand toddlers aged 12–24 months. Public Health Nutr 18, 32653271.CrossRefGoogle ScholarPubMed
27. Sochacka-Tatara, E & Pac, A (2014) Relative validity of a semi-quantitative FFQ in 3-year-old Polish children. Public Health Nutr 17, 17381744.Google Scholar
28. Orton, HD, Szabo, NJ, Clare-Salzler, M, et al. (2008) Comparison between omega-3 and omega-6 polyunsaturated fatty acid intakes as assessed by a food frequency questionnaire and erythrocyte membrane fatty acid composition in young children. Eur J Clin Nutr 62, 733738.CrossRefGoogle ScholarPubMed
29. Klohe, DM, Clarke, KK, George, GC, et al. (2005) Relative validity and reliability of a food frequency questionnaire for a triethnic population of 1-year-old to 3-year-old children from low-income families. J Am Diet Assoc 105, 727734.Google Scholar
30. Bel-Serrat, S, Fernandez Alvira, JM, Pala, V, et al. (2011) Relative validation of two dietary assessment methods: SACINA (24-h recall) and food frequency questionnaire. Int J Obes 35, S152.Google Scholar
31. Mills, VC, Skidmore, PM, Watson, EO, et al. (2015) Relative validity and reproducibility of a food frequency questionnaire for identifying the dietary patterns of toddlers in New Zealand. J Acad Nutr Diet 115, 551558.Google Scholar
32. Dennis, LK, Snetselaar, LG, Nothwehr, FK, et al. (2003) Developing a scoring method for evaluating dietary methodology in reviews of epidemiologic studies. J Am Diet Assoc 103, 483487.Google Scholar
33. Tabacchi, G, Amodio, E, Di Pasquale, M, et al. (2014) Validation and reproducibility of dietary assessment methods in adolescents: a systematic literature review. Public Health Nutr 17, 27002714.CrossRefGoogle ScholarPubMed
34. Roman-Viñas, B, Ortiz-Andrellucchi, A, Mendez, M, et al. (2010) Is the food frequency questionnaire suitable to assess micronutrient intake adequacy for infants, children and adolescents? Matern Child Nutr 6, 112121.Google Scholar
35. National Health and Medical Research Council (NHMRC). (2000) How to Use the Evidence: Assessment and Application of Scientific Evidence. Canberra: Biotex.Google Scholar
36.Number not used.Google Scholar
37. Bel-Serrat, S, Mouratidou, T, Pala, V, et al. (2014) Relative validity of the Children's Eating Habits Questionnaire-food frequency section among young European children: the IDEFICS Study. Public Health Nutr 17, 266276.Google Scholar
38. Metcalf, PA, Scragg, RKR, Sharpe, S, et al. (2003) Short-term repeatability of a food frequency questionnaire in New Zealand children aged 1–14 y. Eur J Clin Nutr 57, 14981503.Google Scholar
39. Gibson, RS (2005) Principles of Nutritional Assessment, 2nd ed. New York: Oxford University Press.Google Scholar
40. Treadwell, JR, Tregear, SJ, Reston, JT, et al. (2006) A system for rating the stability and strength of medical evidence. BMC Med Res Methodol 6, 52.Google Scholar
41. Subar, AF (2004) Developing dietary assessment tools. J Am Diet Assoc 104, 769770.Google Scholar
42. Schatzkin, A, Kipnis, V, Carroll, RJ, et al. (2003) A comparison of a food frequency questionnaire with a 24-hour recall for use in an epidemiological cohort study: results from the biomarker-based Observing Protein and Energy Nutrition (OPEN) study. Int J Epidemiol 32, 10541062.Google Scholar
43. Henríquez-Sánchez, P, Sánchez-Villegas, A, Doreste-Alonso, J, et al. (2009) Dietary assessment methods for micronutrient intake: a systematic review on vitamins. Br J Nutr 102, S10S37.Google Scholar
44. Collins, CE, Burrows, TL, Truby, H, et al. (2013) Comparison of energy intake in toddlers assessed by food frequency questionnaire and total energy expenditure measured by the doubly labeled water method. J Acad Nutr Diet 113, 459463.CrossRefGoogle ScholarPubMed
45. Bland, JM & Altman, DG (1999) Measuring agreement in method comparison studies. Stat Methods Med Res 8, 135160.Google Scholar
46. Huybrechts, I, De Bacquer, D, Matthys, C, et al. (2006) Validity and reproducibility of a semi-quantitative food-frequency questionnaire for estimating calcium intake in Belgian preschool children. Br J Nutr 95, 802816.CrossRefGoogle ScholarPubMed
47. Day, NE, Wong, MY, Bingham, S, et al. (2004) Correlated measurement error – implications for nutritional epidemiology. Int J Epidemiol 33, 13731381.CrossRefGoogle ScholarPubMed
48. Altman, DG (1990) Practical Statistics for Medical Research. Boca Raton, FL: CRC Press.CrossRefGoogle Scholar
Figure 0

Fig. 1. Inclusion and exclusion criteria used to select studies for inclusion in the systematic review.

Figure 1

Table 1. Characteristics of included studies evaluating long-term or short-term nutrient intake, or biomarker, food or food group

Figure 2

Fig. 2. Selection process flow of articles identified that assess validity of FFQ methods in children aged 12–36 months.

Figure 3

Table 2. Quality scores using methods described by Dennis et al.(32) and the EURopean Micronutrient RECommendations Aligned (EURRECA) scoring tool(5)

Figure 4

Table 3. Classification of dietary assessment methods for infants aged 12–36 months according to the weighted mean of the correlations of micronutrients with three or more studies available (separate comparisons of those studies reflecting long-term and short-term intakes or comparison of FFQ with a reference method)