Reproducibility and validity of dietary patterns identified using factor analysis among Chinese populations

Xin Hong; Qing Ye; Zhiyong Wang; Huafeng Yang; Xupeng Chen; Hairong Zhou; Chenchen Wang; Wenjie Chu; Yichao Lai; Liuyuan Sun; Youfa Wang; Fei Xu

doi:10.1017/S000711451600249X

Reproducibility and validity of dietary patterns identified using factor analysis among Chinese populations

Published online by Cambridge University Press: 13 July 2016

Xin Hong ,

Qing Ye ,

Wenjie Chu ,

Yichao Lai and

Liuyuan Sun

...Show all authors

Show author details

Xin Hong: Affiliation:
Department of Non-communicable Disease Prevention, Nanjing Municipal Center for Disease Control & Prevention, 2, Zizhulin, Nanjing 210003, People’s Republic of China
Qing Ye: Affiliation:
Department of Non-communicable Disease Prevention, Nanjing Municipal Center for Disease Control & Prevention, 2, Zizhulin, Nanjing 210003, People’s Republic of China
Zhiyong Wang: Affiliation:
Department of Non-communicable Disease Prevention, Nanjing Municipal Center for Disease Control & Prevention, 2, Zizhulin, Nanjing 210003, People’s Republic of China
Huafeng Yang: Affiliation:
Department of Non-communicable Disease Prevention, Nanjing Municipal Center for Disease Control & Prevention, 2, Zizhulin, Nanjing 210003, People’s Republic of China
Xupeng Chen: Affiliation:
Department of Non-communicable Disease Prevention, Nanjing Municipal Center for Disease Control & Prevention, 2, Zizhulin, Nanjing 210003, People’s Republic of China Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, People’s Republic of China
Hairong Zhou: Affiliation:
Department of Non-communicable Disease Prevention, Nanjing Municipal Center for Disease Control & Prevention, 2, Zizhulin, Nanjing 210003, People’s Republic of China Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, People’s Republic of China
Chenchen Wang: Affiliation:
Department of Non-communicable Disease Prevention, Nanjing Municipal Center for Disease Control & Prevention, 2, Zizhulin, Nanjing 210003, People’s Republic of China Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, People’s Republic of China
Wenjie Chu: Affiliation:
Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, People’s Republic of China
Yichao Lai: Affiliation:
Department of Non-communicable Disease Prevention, Qinhuai District Center for Disease Control & Prevention, Nanjing 210029, People’s Republic of China
Liuyuan Sun: Affiliation:
Department of Non-communicable Disease Prevention, Liuhe District Center for Disease Control & Prevention, Nanjing 211500, People’s Republic of China
Youfa Wang: Affiliation:
Department of International Health, Bloomberg School of Public Health, Johns Hopkins Global Center for Childhood Obesity, Johns Hopkins University, Baltimore, MD 21205, USA
Fei Xu*: Affiliation:
Department of Non-communicable Disease Prevention, Nanjing Municipal Center for Disease Control & Prevention, 2, Zizhulin, Nanjing 210003, People’s Republic of China Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, People’s Republic of China
*: *Corresponding author: F. Xu, email [email protected]

Article contents

Abstract
Methods
Results
Discussion
Footnotes
References

Rights & Permissions

Abstract

In the present study, we evaluated the reproducibility and validity of dietary patterns among Chinese adult populations. A random subsample of 203 participants (aged 31–80 years) from a community-based nutrition and health survey was enrolled. An eighty-seven-item FFQ was administered twice (FFQ1 and FFQ2) 1 year apart; four 3 consecutive day, 24-h dietary recalls (24-HDR, as a reference method) were performed between the administrations of the two FFQ every 3 months. Dietary patterns from three separate dietary sources were derived using factor analysis based on twenty-eight predefined food groups. Comparisons between dietary pattern scores were made by using Pearson’s or intraclass correlation coefficients (ICC), cross-classification analysis, weighted κ statistic and Bland–Altman plots; the four major dietary patterns identified from FFQ1, FFQ2 and 24-HDR were similar. Regarding reproducibility, ICC for z-scores between FFQ1 and FFQ2 were all >0·6 for dietary patterns. The ‘animal and plant protein’ pattern had the highest ICC of 0·870. For validity, the adjusted Pearson’s correlation coefficients for dietary pattern z-scores between two FFQ and the mean of four 3 consecutive day 24-HDR ranged from 0·387 for the ‘Chinese traditional’ pattern to 0·838 for the ‘animal and plant protein’ pattern. More than 75 % of the participants were classified into the same or adjacent quartile, and <5 % were misclassified into opposite quartiles. The weighted κ ranged from 0·259 to 0·680. Bland–Altman plots indicated that no significant deviation was found between two dietary assessment methods. Our findings indicate a good reasonable reproducibility and a reasonable validity of dietary patterns derived by factor analysis in China.

Keywords

Reliability Validity Dietary patterns Factor analyses China

Type: Full Papers
Information: British Journal of Nutrition , Volume 116 , Issue 5 , 14 September 2016 , pp. 842 - 852

DOI: https://doi.org/10.1017/S000711451600249X [Opens in a new window]
Copyright: Copyright © The Authors 2016

Epidemiological studies have suggested that dietary pattern analysis is a useful method for studying the role of diet in relation to health outcomes or disease risk. Dietary pattern analysis has been used increasingly as an alternative method to traditional analysis because it takes into account the diet’s overall effects, reflecting more closely the real-world habits⁽ Reference Hu ¹ ^, Reference Newby and Tucker ² ⁾.

The following three main approaches have been used to define dietary patterns: factor analysis, cluster analysis and dietary indices. Factor analysis is a multivariate statistical reduction technique that aggregates specific food groups based on analyses of the correlation–covariance matrix of a number of food items⁽ Reference DiBello, Kraft and McGarvey ³ ⁾. The continuous nature of factor analysis has been seen to be advantageous over other methods⁽ Reference Reedy, Wirfält and Flood ⁴ ⁾. Factor analysis was therefore commonly used to derive dietary patterns.

However, several subjective or arbitrary decisions can be made during factor analysis, including the consolidation of food items into food groups, the number of factors to extract, the method of rotation and the labelling of the components⁽ Reference Martinez, Marshall and Sechrest ⁵ ⁾. Furthermore, dietary patterns can be population specific, such that it is essential to identify dietary patterns in a specific study population of interest⁽ Reference Hu ¹ ^, Reference Villegas, Yang and Gao ⁶ ⁾, such as the Chinese population. In the past two decades, China has experienced a significant nutrition transition from the traditional Chinese diet to a Western diet pattern, with an increase in consumption of red meats, eggs and oils and a decrease in fruit and vegetable intakes.

Most dietary pattern studies have used FFQ to estimate dietary intakes, as they are easy to administer, comparatively inexpensive, and they can assess long-term dietary habits in large populations. However, FFQ are sensitive to the diverse lifestyle, eating habits and dietary preferences in the population concerned⁽ Reference Shim, Oh and Kim ⁷ ⁾. Dietary recalls may be superior to FFQ and have been frequently used as a reference method in many Chinese validation studies⁽ Reference Villegas, Yang and Liu ⁸ ^– Reference Zhuang, Yuan and Lin ¹⁰ ⁾. To date, some foreign studies⁽ Reference Nanri, Shimazu and Ishihara ¹¹ ^– Reference Loy and Jan Mohamed ¹⁸ ⁾ have been conducted to examine the validity of dietary patterns derived from FFQ using factor analysis. Unfortunately, no similar studies have been reported in China, with different culture-specific dietary habits.

The purpose of the present study was to examine the reproducibility and validity of dietary patterns derived from factor analysis among Chinese populations. The reproducibility was assessed by comparing the dietary pattern scores between two FFQ administered 1 year apart, and the validity was assessed by comparing the dietary pattern scores between FFQ and 24-h dietary recalls (24-HDR) as a reference method at 3-month intervals during the period of 1 year.

Methods

Study population

The present study was conducted using a subsample of the community-based, cross-sectional, nutrition and health survey in Nanjing, the capital of Jiangsu Province of China. The detailed study recruitment methods have been described previously⁽ Reference Ye, Hong and Wang ¹⁹ ⁾. In brief, a multi-stage random sampling method was adopted. First, we randomly selected two districts (one urban and one suburban). Next, three streets/towns from each chosen district were randomly selected. Finally, one community from each chosen street/town was randomly selected. This resulted in a total number of six communities. Of 2030 participants of the nutrition and health survey, a random sample of 250 members was invited to participate in the present study. Sample size of the present study was calculated according to subjects per food group ratios of 7:1⁽ Reference Loy and Jan Mohamed ¹⁸ ⁾. Inclusion criteria were as follows: local resident for at least 5 years, aged 30 years or above, free of chronic non-communicable diseases requiring a special diet and not on a weight-reduction diet. Among the 250 selected residents, 248 were eligible to participate and 223 agreed to take part in the survey (response rate: 89·9 %).

The Ethics Board of Nanjing Municipal Center for Disease Control and Prevention reviewed and approved the study protocol, and written informed consent was obtained from each participant before inclusion.

Study design

The study design with time frame is shown in Fig. 1. The study stared in June 2014 and ended in May 2015. Each participant completed the same FFQ twice – the first FFQ (FFQ1) was administrated at baseline and the second FFQ (FFQ2) was administrated 1 year later; four 3 consecutive day 24-HDR were collected between the administrations of the two FFQ every 3 months during a period of 1 year (a total of twelve 24-HDR). The first 3 consecutive day 24-HDR was performed 1 month after FFQ1, and the last 3 consecutive day 24-HDR was performed 1 month before FFQ2. We excluded participants who failed to provide two completed FFQ (n 6), did not complete four 3-d 24-HDR (n 9) or had extreme values for total energy intake (<2092 kJ/d (<500 kcal/d) or >20 920 kJ/d (>5000 kcal/d), n 5). Finally, 203 subjects (81·9 %) were included in the data analysis.

Fig. 1 Study design and time frame used in the present study. 24-HDR, 24-h dietary recalls; m24-HDR, mean of four 3 consecutive day 24-HDR.

Dietary assessment

A semi-quantitative FFQ was used to estimate habitual dietary intakes over the previous year. The reproducibility and validity of the FFQ used in this study have been published elsewhere⁽ Reference Ye, Hong and Wang ¹⁹ ⁾. The FFQ included eighty-seven food items and twelve food categories (grains and products; red meat (pork, beef, mutton); poultry meat; fish and shrimp; eggs; dairy products; soya-based foods; vegetables; fruits; beverages; alcohol; snacks/desserts), which covered 90 % of the commonly consumed foods in Nanjing. For each food item, participants were asked to recall the frequency of consumption (daily, weekly, monthly, annually or never) and the amount of consumption each time in a common unit of weight in China (1 liang=50 g) or in millilitre over the past 12 months. For seasonal vegetables and fruits, participants were asked to recall how often they ate these foods during the season. Individual consumption of food items was converted to grams per day for further analysis.

Owing to the small number of subjects (n 203) relative to the number of food items, and to reduce the complexity of the data, we collapsed the initial eighty-seven food items into twenty-eight predefined food groups (Appendix 1). The grouping scheme was based on the similarity of nutrient profiles or culinary usage among the foods⁽ Reference Hu, Rimm and Smith-Warner ¹³ ^, Reference Qin, Melse-Boonstra and Yuan ²⁰ ⁾.

In total, four 3 consecutive day (including 2 weekdays and 1 weekend day in a usual week) 24-HDR were collected at intervals of 3 months during the 1-year period. Each participant was asked to provide the name and amount of all foods consumed during the previous 24 h. If the previous day was a special day, such as feast or travel days, food consumption of the day before the 24 h was recorded or another day was chosen to interview the participant by telephone. The amounts of different food items that were mixed in one dish were recorded, respectively. The recalled food items were assigned to the corresponding food groups as defined by the FFQ. The Chinese Food Composition Tables ⁽ Reference Yang ²¹ ⁾ were used to estimate the intake of energy (kJ/d (kcal/d)) and key nutrients from each food group consumed by 24-HDR. All values obtained for key nutrient intake were adjusted for total energy intake using the regression residual method⁽ Reference Willett and Stampfer ²² ⁾. The mean of four 3 consecutive day 24-HDR (m24-HDR) data was used as the standard to measure the validity of the FFQ.

Trained interviewers from the local Center for Disease Control and Prevention administered the two FFQ and four 3-d 24-HDR by face-to-face interviews. All diet information were collected and checked after completion. Any implausible or ambiguous information would be further verified and obtained from the participants. Each participant had the same interviewer during the study period.

Dietary pattern analysis

Exploratory factor analysis (FA) was used to identify major dietary patterns based on a set of twenty-eight predefined food groups; FA was performed separately for FFQ1, FFQ2 and m24-HDR food groups. Factors were rotated with varimax rotation to maintain uncorrelated factors and enhance interpretability. A combined evaluation of the eigenvalues, scree plot test and factor interpretability was used in determining the number of retained factors⁽ Reference Nanri, Shimazu and Ishihara ¹¹ ^, Reference Shi, Hu and Yuan ²³ ⁾. Factor loadings were interpreted as correlation coefficients between food groups and dietary patterns. Food groups with positive loadings contributed to the dietary pattern, and food groups with negative loadings were inversely associated with the dietary pattern. Food groups with absolute factor loadings ≥0·30 were considered as significantly contributing to the pattern⁽ Reference Hatcher ²⁴ ⁾. The patterns were labelled according to food groups with high loadings in each dietary pattern. The sum of the squares of the respective factor loadings over all retained factors represented the percentage of variance that was explained by the final factors. Factor scores for each pattern were calculated as the sum of the products of the factor loading coefficients and the standardised daily intake of each food group⁽ Reference Hatcher ²⁴ ⁾.

Statistical analyses

The Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy (>0·6) and Bartlett’s test of sphericity (P<0·05) were used to determine the data suitability for FA. Three methods were used to examine the reliability and validity of dietary patterns.

First, the reproducibility and validity were assessed by comparing dietary pattern scores between FFQ1 and FFQ2, and between two FFQ and m24-HDR, respectively, using Pearson’s or intraclass correlation coefficients (ICC), cross-classification analysis and weighted κ (Kw) statistic. Cross-classification (quartiles method) analysis was conducted to classify the participants into same, adjacent, one quartile apart or opposite quartiles. The inter-rater agreement of the two assessment methods was analysed by Kw. ICC>0·4 indicated good agreement⁽ Reference Gibson ²⁵ ⁾. Pearson’s correlation coefficients of 0·5–0·7 were considered good⁽ Reference Willett and Lenart ²⁶ ⁾. Values of Kw >0·4 indicated moderate agreement⁽ Reference Willett and Lenart ²⁶ ⁾.

Second, a Bland–Altman plot was constructed to assess the agreement of dietary pattern scores between different dietary sources. The plots showed the difference between each individual’s z-scores derived from the mean of two FFQ (mFFQ) and m24-HDR (mFFQ−m24-HDR) against their averages ((mFFQ+m24-HDR)/2)⁽ Reference Bland and Altman ²⁷ ⁾. The mean differences and the 95 % limits of agreement (LOA, calculated as mean differences ±1·96 sd) were used to summarise agreement at the population level.

Third, Pearson’s correlation coefficients were used to compare energy-adjusted nutrient intakes estimated by m24-HDR with dietary pattern scores derived from FFQ and m24-HDR.

All statistical analyses were performed using SPSS software version 20.0 (IBM) and MedCalc version 11.4. All tests were two-tailed, and a P value <0·05 was considered statistically significant.

Results

Study sample characteristics

Among 203 participants, about 48·8 % were males and 92·2 % were married. Their mean age was 50·4 (sd 12·2) years (range 31–80 years); the mean BMI was 23·1 (sd 2·8) kg/m²; and 79·5 % had educational qualification of junior high school or above. The proportion of current smokers and drinkers was 22·0 and 28·8 %, respectively. There were no differences in baseline characteristics between the subsample (n 203) and the entire population (n 2030) (Appendix 2).

Dietary patters identified in the two FFQ and mean of four 3 consecutive day 24-HDR

The KMO measure of sampling adequacy was 0·734 for FFQ1, 0·806 for FFQ2 and 0·640 for m24-HDR, and P values for Bartlett’s test of sphericity were all <0·001. Using FA, four major dietary patterns were extracted from FFQ1, FFQ2 and m24-HDR (Table 1). These four derived patterns were relatively similar from three dietary sources. Factor 1, which loaded heavily on poultry meats, fish and shrimp, bean curd, livestock meats, dry bean and other soyabean products, was labelled the ‘animal and plant protein’ pattern. Factor 2, with high loadings for nuts, sweets and desserts and snacks, was labelled the ‘nuts and sweets’ pattern. Factor 3, which was rich in other grains and products, potatoes, fresh vegetables, fried food, high-fat dairy products, wheat and products, rice and products, and pickled vegetables, was labelled the ‘Chinese traditional’ pattern. Factor 4, characterised by higher intake of sodas, juice, beer, wine, processed meats and liquor, was labelled the ‘beverage and alcohol’ pattern. Overall, the total percentage of variance explained by the four patterns derived from FFQ1, FFQ2 and m24-HDR was 40·0, 44·9, 32·4 %, respectively. In addition, four similar dietary patterns were also identified in the overall samples (Appendix 3).

Table 1 Factor-loading matrix for four major dietary patternsFootnote * identified from FFQ1, FFQ2 and the mean of four 3 consecutive day 24-HDR (m24-HDR) in the subsamples (n 203)

* The test for suitability of factor analysis: the Kaiser–Meyer–Olkin measure of sampling adequacy was 0·734 for FFQ1, 0·806 for FFQ2 and 0·640 for m24-HDR, and P values for Bartlett’s test of sphericity were all <0·001.

† The factor loadings were >0·30.

Correlations and agreement between dietary pattern z-scores

Regarding reproducibility, ICC for dietary pattern z-scores between FFQ1 and FFQ2 were >0·6 for all four patterns. The ‘animal and plant protein’ pattern had the highest ICC of 0·870 (Table 2). For validity, the adjusted Pearson’s correlation coefficients for dietary pattern z-scores between two FFQ and m24-HDR ranged from 0·387 for the ‘Chinese traditional’ pattern to 0·838 for the ‘animal and plant protein’ pattern.

Table 2 Correlation coefficients for dietary pattern z-scores derived from FFQ1, FFQ2 and the mean of four 3 consecutive day 24-HDR (m24-HDR) in the subsamples (n 203)Footnote *

* All correlations were statistically significant (P<0·001).

† Values were intraclass correlation coefficients.

‡ Values were Pearson’s correlation coefficients adjusted for energy intake using the residual method.

When the four dietary pattern scores were categorised into quartiles, the ranges of agreement rates for the same or adjacent quartile classifications were 75·6–95·5 %, when derived from the two FFQ and the m24-HDR. Extreme misclassification into opposite quartiles was <5·0 % (Table 3). The Kw ranged from 0·259 to 0·680.

Table 3 Percentage agreement and κ statistic for dietary pattern z-scores derived from FFQ1, FFQ2 and the mean of four 3 consecutive day 24-HDR (m24-HDR) in the subsamples (n 203)

Kw, weighted κ.

The Bland–Altman plots of all dietary patterns are presented in Fig. 2–5. The mean agreement between the dietary pattern z-scores derived from the mFFQ and the m24-HDR were not significantly different from zero in all comparisons. The mean differences were 0·0 (95 % LOA −1·03, 1·04) for the ‘animal and plant protein’ pattern,−0·0 (95 % LOA −1·7, 1·6) for the ‘nuts and sweets’ pattern, −0·1 (95 % LOA −2·0, 1·8) for the ‘Chinese traditional’ pattern and −0·2 (95 % LOA −1·9, 1·5) for the ‘beverage and alcohol’ pattern between mFFQ and m24-HDR.

Fig. 2 Bland–Altman plots for ‘animal and plant protein’ pattern z-scores derived from the mean of two FFQ (mFFQ) and mean of four 3 consecutive day 24-HDR (m24-HDR).

Fig. 3 Bland–Altman plots for ‘nuts and sweets’ pattern z-scores derived from the mean of two FFQ (mFFQ) and mean of four 3 consecutive day 24-HDR (m24-HDR).

Fig. 4 Bland–Altman plots for ‘Chinese traditional’ pattern z-scores derived from the mean of two FFQ (mFFQ) and mean of four 3 consecutive day 24-HDR (m24-HDR).

Fig. 5 Bland–Altman plots for ‘beverage and alcohol’ pattern z-scores derived from the mean of two FFQ (mFFQ) and mean of four 3 consecutive day 24-HDR (m24-HDR).

Correlations between dietary pattern z-scores and nutrient intakes

Correlations between energy-adjusted nutrient intakes from the dietary recalls and dietary pattern scores derived from FFQ1, FFQ2 and m24-HDR are shown in Table 4. The majority of statistically significant correlations were consistent for the FFQ and m24-HDR. In particular, the ‘animal and plant protein’ pattern was positively correlated with intakes of protein, carbohydrates, fibre, vitamin A, retinol, thiamine, riboflavin, niacin, vitamin E, Ca, P, K, Mg, Fe, Zn, Se and Cu, and was negatively correlated with intakes of total fat and cholesterol. In contrast, the ‘Chinese traditional’ pattern was negatively correlated with intakes of vitamin A, carotene, niacin, vitamin C, Ca, P, Na, Zn and Mn. The ‘beverage and alcohol’ patterns were positively correlated with intakes of total fat and cholesterol and negatively correlated with intakes of retinol, thiamine and Se.

Table 4 Pearson’s correlation coefficients between dietary pattern scores and energy-adjusted nutrient intakes from the mean of four 3 consecutive day 24-HDR (m24-HDR) in the subsamples (n 203)

* P<0·05.

Discussion

To our knowledge, the present study is perhaps the first one to assess the reproducibility and validity of dietary patterns identified by FA derived from FFQ in comparison with dietary recalls in a Chinese population. In a random subsample of 203 subjects, four major dietary patterns were identified using FA – that is, the ‘animal and plant protein’ pattern, the ‘nuts and sweets’ pattern, the ‘Chinese traditional’ pattern and the ‘beverage and alcohol’ pattern. These four derived patterns were qualitatively similar across three sources of dietary data obtained from the two FFQ and the means of twelve 24-HDR. For all dietary patterns, factor loadings of the FFQ and m24-HDR food groups were partly different. This might be due to methodological differences between dietary assessment methods⁽ Reference Willett ²⁸ ^, Reference Livingstone and Black ²⁹ ⁾, random statistical variation and different assessment periods as mentioned previously⁽ Reference Okubo, Murakami and Sasaki ¹² ^, Reference Hu, Rimm and Smith-Warner ¹³ ^, Reference Khani, Ye and Terry ¹⁵ ^, Reference Asghari, Rezazadeh and Hosseini-Esfahani ¹⁷ ^, Reference Loy and Jan Mohamed ¹⁸ ⁾. The patterns identified in the present study were similar to previous findings in China⁽ Reference Luo, Chen and Miu ³⁰ ^– Reference Odegaard, Koh and Butler ³² ⁾.

The correlations of the dietary pattern z-scores between FFQ1 and FFQ2 revealed good reliability, and the correlations of the dietary pattern z-scores between the two FFQ and the m24-HDR represented a reasonable comparative validity of four major dietary patterns derived by FA using the data of FFQ in a Chinese population. In this study, the 24-h recall method was adopted as a reference method. For reducing the effect of difference in seasonal food availability and seasonal food preferences, twelve 24-HDR (one for 3-month intervals) were collected, which covered variability in food consumption during different seasons. Moreover, 3 consecutive day 24-HDR were administered for 2 weekdays and 1 weekend day in a usual week. Therefore, the influence of different diets between weekdays and weekends could be taken into consideration.

Although the methods of reproducibility and validity of dietary patterns were different, the obtained correlations in the present study were similar to those reported by other studies. In the first such study reported by Hu et al.⁽ Reference Hu, Rimm and Smith-Warner ¹³ ⁾ in 1999, the corrected correlations between the two FFQ and two 1-week diet records (DR) ranged from 0·45 to 0·74 for the prudent and the Western patterns among 127 US males. The correlations for the factor scores between the two FFQ were 0·70 for the prudent pattern and 0·67 for the Western pattern. In 879 Danish men and 927 Danish women⁽ Reference Togo, Heitmann and Sorensen ¹⁴ ⁾, three (green, sweet and traditional) for men and two (green and sweet-traditional) patterns for women were identified in data from a FFQ and a 7-d DR, with corrected correlations ranging between 0·34 and 0·61. Khani et al.⁽ Reference Khani, Ye and Terry ¹⁵ ⁾ provided results with uncorrected correlations ranging between 0·41 and 0·73 for healthy, Western and drinker patterns identified using a FFQ and four 1-week DR in a random subgroup of 362 Swedish women. The coefficients of reproducibility were 0·63 (healthy pattern), 0·68 (Western pattern) and 0·73 (drinker pattern). Among 585 pregnant women in the UK⁽ Reference Crozier, Inskip and Godfrey ¹⁶ ⁾, the correlation coefficients ranged between 0·35 and 0·67 for the dietary patterns derived from a FFQ and a 4-d food diary. In a subsample of 244 men and 254 women in Japan⁽ Reference Nanri, Shimazu and Ishihara ¹¹ ⁾, Pearson’s correlation coefficients between the two FFQ ranged from 0·55 for the Western pattern in men and the prudent pattern in women to 0·77 for the traditional Japanese pattern in men. The corresponding values between 1-week DR and the FFQ ranged from 0·32 for the Western pattern in men to 0·63 for the traditional pattern in women. In 132 Iranian populations⁽ Reference Asghari, Rezazadeh and Hosseini-Esfahani ¹⁷ ⁾, the ICC between factors scores of the two FFQ were 0·72 for the traditional and 0·80 for the Western pattern, and corrected correlations between FFQ2 and twelve 24-HDR were 0·48 for the traditional and 0·75 for the Western pattern. Loy & Jan Mohamed⁽ Reference Loy and Jan Mohamed ¹⁸ ⁾ found that Pearson’s correlation coefficients between FFQ and three 24-HDR for healthy and less-healthy patterns were 0·59 and 0·63, respectively, in 162 Malay pregnant women.

When the dietary pattern scores were classified into quartiles, a higher percentage of participants being classified into the same or adjacent quartile (>75 %) and a low percentage into opposite quartile (<5 %) were shown in four dietary patterns in the present study, which demonstrated moderate agreement and lower misclassification between two FFQ and m24-HDR. The weighted κ statistic, which overcame agreement by chance, depicted fair-to-good agreement for dietary patterns.

The Bland–Altman plot is a better method to illustrate the exact agreement between two different dietary assessment methods, which estimates the mean agreement and the 95 % LOA⁽ Reference Bland and Altman ²⁷ ⁾. A wide LOA indicates that the potential for large differences between methods and agreement is considered poor. The mean agreement was approximately equal to 0 for four patterns between FFQ and m-24HR in this study. The 95 % LOA for four dietary patterns were acceptable, in accordance with the results of previous studies⁽ Reference Okubo, Murakami and Sasaki ¹² ^, Reference Crozier, Inskip and Godfrey ¹⁶ ^– Reference Loy and Jan Mohamed ¹⁸ ⁾. Although the 95 % LOA in the ‘Chinese traditional’ pattern was wider than those in other patterns, these differences were marginal.

The correlation coefficients, κ statistics and percentage of agreement were higher and the 95 % LOA were slight narrower for the ‘animal and plant protein’ pattern compared with the other three patterns; meanwhile, the percentage of misclassification was lower for the ‘animal and plant protein’ pattern than others. This may due to the fact that the ‘animal and plant protein’ pattern was rich in some usual food groups during 1 year (such as red meat, poultry meat, fish and shrimp, eggs, soya foods) and that the other three patterns included infrequent (nuts, sweets and desserts, and snacks in the ‘nuts and sweets’ pattern) or seasonal food groups (fresh fruits and vegetables in the ‘Chinese traditional’ pattern, and beer, wine and liquor in the ‘beverage and alcohol’ pattern).

Examining nutrient profiles is a useful way to compare dietary patterns from different dietary methods. Nutrient intakes are informative because they describe the product of a dietary pattern. As expected, correlations of our study were weaker between the FFQ and the m24-HDR; however, the directions of associations were consistent.

A major strength of the present study was the fact that there were no differences in baseline characteristics between the subsample in the present study and the entire population in the cross-sectional nutrition and health study; four similar dietary patterns were also identified in the overall sample. Therefore, as the subsample in the reproducibility and validity study were representative, the results can be generalised to the entire population. In addition, a high recruitment rate (81·9 %) and detailed data collected by trained interviewers were included.

There are several limitations to the present study. First, the sample size was relative small (n 203), which might have led to inadequate study power. However, some studies⁽ Reference Loy and Jan Mohamed ¹⁸ ^, Reference Floyd and Widaman ³³ ⁾ have suggested that the generally accepted sample size is seven participants per food group for FA. Second, in the absence of an absolute gold standard for dietary assessment, we chose dietary recalls as a reference method. This method was advantageous in its ability to collect actual intake on specific days. However, dietary recalls might also be subject to recall bias, erroneous recording and potential changes in eating behaviour, leading to over-estimating or under-estimating food intake. Therefore, we attempted to minimise weakness by checking dietary recalls by following-up incomplete or ambiguous information directly with respondents. Moreover, four 3 consecutive day 24-HDR were shown to be sufficient to capture seasonal variations in food intake. Third, the analysis of reproducibility and validity was confined to adults aged 31–80 years. It is unclear whether our findings can be applied to children, adolescents and younger adults. Finally, the total variance explained by the four dietary patterns derived from FFQ1, FFQ2 and m24-HDR was 40·0, 44·9, 32·4 %, respectively, suggesting the existence of minor dietary patterns, which were less interpretable and highly variable; therefore, they were not presented in this study.

In conclusion, our study indicated a good reproducibility and a reasonable validity of the major dietary patterns identified by FA using data from a FFQ and dietary recalls among Chinese populations, suggesting that FFQ data provided useful information on dietary patterns. Dietary pattern might be used in nutrition epidemiology as a complementary approach to traditional analysis and is appropriate to examine the diet–disease association.

Acknowledgements

The authors are grateful to all the dedicated fieldworkers who took part in the survey and all participants who facilitated the survey implementation at each community.

The present study was supported by Nanjing Municipal Medical Science and Technique Development Foundation, China (grant no. 2012-YKK12166).

X. H., Q. Y. and F. X. contributed to the study design and data analysis; X. H., Q. Y., F. X., Z. W., H. Y., X. C., H. Z., C. W., W. C., Y. L. and L. S. contributed to data collection; Z. W., H. Y., X. C., H. Z., C. W., W. C., Y. L. and L. S. were responsible for manuscript revision; Y. W. was responsible for power calculation and language editing; and X. H., Q. Y., F. X. and Y. W. contributed to manuscript writing.

The authors declare that there are no conflicts of interest.

Appendix 1

The twenty-eight food groups used in the dietary pattern analysis

Appendix 2

Comparison of participants in the reliability and validity study with those in the cross-sectional survey (Mean values and standard deviations)

Appendix 3

Factor-loading matrix for the four major dietary patternsFootnote * identified using factor analysis in the overall samples (n 2030)

Footnotes

* The test for suitability of factor analysis: the Kaiser–Meyer–Olkin measure of sampling adequacy was 0·702 for FFQ1 and P values for Bartlett’s test of sphericity was <0·001.

† The factor loadings were >0·30.

References

1. Hu, FB (2002) Dietary pattern analysis: a new direction in nutritional epidemiology. Curr Opin Lipidol 13, 3–9.Google Scholar

2. Newby, PK & Tucker, KL (2004) Empirically derived eating patterns using factor or cluster analysis: a review. Nutr Rev 62, 177–203.CrossRef Google Scholar PubMed

3. DiBello, JR, Kraft, P, McGarvey, ST, et al. (2008) Comparison of 3 methods for identifying dietary patterns associated with risk of disease. Am J Epidemiol 168, 1433–1443.Google Scholar

4. Reedy, J, Wirfält, E, Flood, A, et al. (2010) Comparing 3 dietary pattern methods – cluster analysis, factor analysis, and index analysis – with colorectal cancer risk: the NIH-AARP diet and health study. Am J Epidemiol 171, 479–487.Google Scholar

5. Martinez, ME, Marshall, JR & Sechrest, L (1998) Invited commentary: factor analysis and the search for objectivity. Am J Epidemiol 148, 17–19.CrossRef Google Scholar PubMed

6. Villegas, R, Yang, G, Gao, YT, et al. (2010) Dietary patterns are associated with lower incidence of type 2 diabetes in middle-aged women: the Shanghai women’s health study. Int J Epidemiol 39, 889–899.Google Scholar

7. Shim, JS, Oh, K & Kim, HC (2014) Dietary assessment methods in epidemiologic studies. Epidemiol Health 36, e2014009 (Review).Google Scholar

8. Villegas, R, Yang, G, Liu, D, et al. (2007) Validity and reproducibility of the food-frequency questionnaire used in the Shanghai men’s health study. Br J Nutr 97, 993–1000.CrossRef Google Scholar PubMed

9. Xia, W, Sun, C, Zhang, L, et al. (2011) Reproducibility and relative validity of a food frequency questionnaire developed for female adolescents in Suihua, North China. PLoS ONE 6, e19656.Google Scholar

10. Zhuang, M, Yuan, Z, Lin, L, et al. (2012) Reproducibility and relative validity of a food frequency questionnaire developed for adults in Taizhou, China. PLOS ONE 7, e48341.Google Scholar

11. Nanri, A, Shimazu, T, Ishihara, J, et al. (2012) Reproducibility and validity of dietary patterns assessed by a food frequency questionnaire used in the 5-year follow-up survey of the Japan. Public health center-based prospective study. J Epidemiol 22, 205–215.Google Scholar

12. Okubo, H, Murakami, K, Sasaki, S, et al. (2010) Relative validity of dietary patterns derived from a self-administered diet history questionnaire using factor analysis among Japanese adults. Public Health Nutr 13, 1080–1089.CrossRef Google Scholar PubMed

13. Hu, FB, Rimm, E, Smith-Warner, SA, et al. (1999) Reproducibility and validity of dietary patterns assessed with a food-frequency questionnaire. Am J Clin Nutr 69, 243–249.Google Scholar

14. Togo, P, Heitmann, BL, Sorensen, TI, et al. (2003) Consistency of food intake factors by different dietary assessment methods and population groups. Br J Nutr 90, 667–678.Google Scholar

15. Khani, BR, Ye, W, Terry, P, et al. (2004) Reproducibility and validity of major dietary patterns among Swedish women assessed with a food-frequency questionnaire. J Nutr 134, 1541–1545.CrossRef Google Scholar PubMed

16. Crozier, SR, Inskip, HM, Godfrey, KM, et al. (2008) Dietary patterns in pregnant women: a comparison of food frequency questionnaires and 4d prospective diaries. Br J Nutr 99, 869–875.CrossRef Google Scholar

17. Asghari, G, Rezazadeh, A, Hosseini-Esfahani, F, et al. (2012) Reliability, comparative validity and stability of dietary patterns derived from an FFQ in the Tehran lipid and glucose study. Br J Nutr 108, 1109–1117.Google Scholar

18. Loy, SL & Jan Mohamed, HJ (2013) Relative validity of dietary patterns during pregnancy assessed with a food frequency questionnaire. Int J Food Sci Nutr 64, 668–673.Google Scholar

19. Ye, Q, Hong, X, Wang, Z, et al. (2016) Reproducibility and validity of an FFQ developed for adults in Nanjing, China. Br J Nutr 115, 887–894.Google Scholar

20. Qin, Y, Melse-Boonstra, A, Yuan, B, et al. (2012) Zinc biofortification of rice in China: a simulation of zinc intake with different dietary patterns. Nutrients 4, 517–528.Google Scholar

21. Yang, Y (2005) Chinese Food Composition Table 2004. Beijing: Peking University Medical Press.Google Scholar

22. Willett, W & Stampfer, MJ (1986) Total energy intake: implications for epidemiologic analyses. Am J Epidemiol 124, 17–27.Google Scholar

23. Shi, Z, Hu, X, Yuan, B, et al. (2008) Vegetable-rich food pattern is related to obesity in China. Int J Obes (Lond) 32, 975–984.Google Scholar

24. Hatcher, LA (1994) Step-By-Step Approach to Using SAS for Factor Analysis and Structural Equation Modeling. Cary, NC: SAS Institute.Google Scholar

25. Gibson, RS (2005) Principles of Nutritional Assessment, 2nd ed. New York, NY: Oxford University Press.Google Scholar

26. Willett, W & Lenart, E (1998) Reproducibility and validity of food-frequency questionnaires. In Nutritional Epidemiology, 2nd ed. pp 101–147 [W Willett, editor]. Oxford: Oxford University Press.CrossRef Google Scholar

27. Bland, JM & Altman, DG (1986) Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1, 307–310.Google Scholar

28. Willett, WC (1998) Nutritional Epidemiology, 2nd ed. New York: Oxford University Press.Google Scholar

29. Livingstone, MBE & Black, AE (2003) Markers of the validity of reported energy intake. J Nutr 133, 895S–920S.Google Scholar

30. Luo, YZ, Chen, XW, Miu, GZ, et al. (2009) Association between hypertension and dietary patterns in residents of Jiangyin city. Chin J Public Health 25, 314–316 (In Chinese).Google Scholar

31. Dai, X, He, P, Zhang, YF, et al. (2010) Dietary pattern of Shanghai community-based middle and aged women. J Hyg Res 39, 472–477 (In Chinese).Google Scholar

32. Odegaard, AO, Koh, WP, Butler, LM, et al. (2011) Dietary patterns and incident type 2 diabetes in Chinese men and women: the Singapore Chinese health study. Diabetes Care 34, 880–885.Google Scholar

33. Floyd, F & Widaman, K (1995) Factor analysis in the development and refinement of clinical assessment instruments. Psychol Assess 7, 286–299.Google Scholar

Fig. 1 Study design and time frame used in the present study. 24-HDR, 24-h dietary recalls; m24-HDR, mean of four 3 consecutive day 24-HDR.

Table 1 Factor-loading matrix for four major dietary patterns* identified from FFQ1, FFQ2 and the mean of four 3 consecutive day 24-HDR (m24-HDR) in the subsamples (n 203)

Table 2 Correlation coefficients for dietary pattern z-scores derived from FFQ1, FFQ2 and the mean of four 3 consecutive day 24-HDR (m24-HDR) in the subsamples (n 203)*

Table 3 Percentage agreement and κ statistic for dietary pattern z-scores derived from FFQ1, FFQ2 and the mean of four 3 consecutive day 24-HDR (m24-HDR) in the subsamples (n 203)

Fig. 2 Bland–Altman plots for ‘animal and plant protein’ pattern z-scores derived from the mean of two FFQ (mFFQ) and mean of four 3 consecutive day 24-HDR (m24-HDR).

Fig. 3 Bland–Altman plots for ‘nuts and sweets’ pattern z-scores derived from the mean of two FFQ (mFFQ) and mean of four 3 consecutive day 24-HDR (m24-HDR).

Fig. 4 Bland–Altman plots for ‘Chinese traditional’ pattern z-scores derived from the mean of two FFQ (mFFQ) and mean of four 3 consecutive day 24-HDR (m24-HDR).

Fig. 5 Bland–Altman plots for ‘beverage and alcohol’ pattern z-scores derived from the mean of two FFQ (mFFQ) and mean of four 3 consecutive day 24-HDR (m24-HDR).

Table 4 Pearson’s correlation coefficients between dietary pattern scores and energy-adjusted nutrient intakes from the mean of four 3 consecutive day 24-HDR (m24-HDR) in the subsamples (n 203)

* Factor-loading matrix for the four major dietary patterns* identified using factor analysis in the overall samples (n 2030)

Article contents

Reproducibility and validity of dietary patterns identified using factor analysis among Chinese populations

Abstract

Keywords

Methods

Study population

Study design

Dietary assessment

Dietary pattern analysis

Statistical analyses

Results

Study sample characteristics

Dietary patters identified in the two FFQ and mean of four 3 consecutive day 24-HDR

Correlations and agreement between dietary pattern z-scores

Correlations between dietary pattern z-scores and nutrient intakes

Discussion

Acknowledgements

Appendix 1

Appendix 2

Appendix 3

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests