Introduction
Gut microbial dysbiosis or imbalance has been associated with many diseases, including obesity, inflammatory bowel disease, cancer, neurodegeneration disorders, and others (Chen and Devaraj, Reference Chen and Devaraj2018; Parekh et al., Reference Parekh, Balart and Johnson2015). Diet may impact health via its modulation of the gut microbiota, and the structure and function of the colonic microbiome may play a critical role in mediating dietary effects on health and disease (Pallister et al., Reference Pallister, Jackson, Martin, Zierer, Jennings, Mohney, MacGregor, Steves, Cassidy, Spector and Menni2017; Valdes et al., Reference Valdes, Walter, Segal and Spector2018; Zierer et al., Reference Zierer, Jackson, Kastenmüller, Mangino, Long, Telenti, Mohney, Small, Bell, Steves, Valdes, Spector and Menni2018). In parallel with the rapid advances in understanding the role of host microbes in health and disease, there has been a growing appreciation of how dietary patterns, as opposed to individual components or nutrients, impact health. Our efforts have focused upon an empirical strategy to define novel dietary patterns that predict host blood biomarker concentrations that are strongly associated with disease risk (Shi et al., Reference Shi, Aroke, Jin, Lee, Hussan, Zhang, Manson, LeBlanc, Barac, Arcan, Clinton, Giovannucci and Tabung2021; Tabung et al., Reference Tabung, Smith-Warner, Chavarro, Wu, Fuchs, Hu, Chan, Willett and Giovannucci2016; Tabung et al., Reference Tabung, Smith-Warner, Chavarro, Wu, Fuchs, Hu, Chan, Willett and Giovannucci2016). Employing data from large prospective epidemiologic studies, we have defined novel dietary pattern indices based upon the ability to impact biomarkers of hyperinsulinaemia or chronic inflammation and designated as the Empirical Dietary Index for Hyperinsulinaemia (EDIH) and Empirical Dietary Inflammatory Pattern (EDIP), respectively (Shi et al., Reference Shi, Aroke, Jin, Lee, Hussan, Zhang, Manson, LeBlanc, Barac, Arcan, Clinton, Giovannucci and Tabung2021; Tabung et al., Reference Tabung, Smith-Warner, Chavarro, Wu, Fuchs, Hu, Chan, Willett and Giovannucci2016; Tabung et al., Reference Tabung, Smith-Warner, Chavarro, Wu, Fuchs, Hu, Chan, Willett and Giovannucci2016).
EDIH and EDIP have been examined in a series of epidemiological cohort studies with findings validating initial associations with risk and prognosis of multiple chronic metabolic diseases (Wang et al., Reference Wang, Song, Eliassen, Wang, Fung, Clinton, Rimm, Hu, Willett, Tabung and Giovannucci2023) including obesity (Tabung et al., Reference Tabung, Satija, Fung, Clinton and Giovannucci2019), type 2 diabetes (Jin et al., Reference Jin, Shi, Aroke, Lee, Joseph, Donneyong, Conwell, Hart, Zhang, Clinton, Cruz-Monserrate, Brasky, Jackson, Tinker, Liu, Phillips, Shadyab, Nassir, Bao and Tabung2021; Lee et al., Reference Lee, Li, Li, Liu, Wu, Bhupathiraju, Rimm, Rexrode, Manson, Willett, Hu, Tabung and Giovannucci2020), cardiovascular disease (Li et al., Reference Li, Lee, Hu, Tabung, Li, Bhupathiraju, Rimm, Rexrode, Manson, Willett, Giovannucci and Hu2020), and several cancers (Aroke et al., Reference Aroke, Folefac, Shi, Jin, Clinton and Tabung2020; Jin et al., Reference Jin, Shi, Lee, Rexrode, Manson, Balasubramanian, Zhang, Neuhouser, Lopez-Pentecost, Thomson, Zick, Felix, Stover, Sardesai, Esnakula, Mo, Clinton and Tabung2023; Lee et al., Reference Lee, Fung, Tabung, Colditz, Ghobrial, Rosner, Giovannucci and Birmann2019; Tabung et al., Reference Tabung, Liu, Wang, Fung, Wu, Smith-Warner, Cao, Hu, Ogino, Fuchs and Giovannucci2018). A key gap in our knowledge regarding the mechanisms whereby EDIH and EDIP may impact health and disease risk concerns the complex interactions with the faecal microbiome. Changes in microbial structure and function in response to diet (Pallister et al., Reference Pallister, Jackson, Martin, Zierer, Jennings, Mohney, MacGregor, Steves, Cassidy, Spector and Menni2017; Zierer et al., Reference Zierer, Jackson, Kastenmüller, Mangino, Long, Telenti, Mohney, Small, Bell, Steves, Valdes, Spector and Menni2018) may influence host metabolism and downstream functional effects, including modulation of inflammation and insulin resistance (Valdes et al., Reference Valdes, Walter, Segal and Spector2018).
This study employs a cohort with dietary assessment, serum biomarker assessment, and analysis of the faecal microbiome composition. Our study objectives were to identify the microbiota profiles associated with the EDIH and EDIP dietary patterns and assess their associations with circulating biomarkers of insulinaemia and inflammation. In addition, given that these dietary scores were being computed in a non-US population (TwinsUK cohort) for the first time, we conducted a construct validation of EDIH and EDIP dietary scores by evaluating their associations with circulating markers of insulin and inflammation. Quantifying how these variables are related can provide a basis for new hypotheses to inform dietary pattern intervention studies targeting the gut microbiome to reduce hyperinsulinaemia and chronic inflammation, to improve metabolic health, and reduce disease burden.
Methods
Study population
TwinsUK is a national adult twin registry in the United Kingdom initiated in 1992 and recruited more than 15,000 male and female community-dwelling twins aged 18–100 years, into the cohort (Spector and Williams, Reference Spector and Williams2006). TwinsUK is a multidisciplinary platform providing deeply phenotyped and genotyped data for health- and social-related research with multiple visits and prospective follow-up (Verdi et al., Reference Verdi, Abbasian, Bowyer, Lachance, Yarand, Christofidou, Mangino, Menni, Bell, Falchi, Small, Williams, Hammond, Hart, Spector and Steves2019). After about 7,000 twins enrolled at baseline (1992–2004), more than half of them finished a follow-up visit between April 2004 and May 2007. From August 2007 to April 2012, the second wave of follow-up visits invited 3,125 women with at least one previous clinical visit. The third wave of follow-up visits was performed between May 2012 and May 2018, including 5,151 participants from the earlier waves. Since February 2019, a further wave of follow-up visits has been ongoing. Multiple questionnaires and clinical samples were collected during the baseline and follow-up visits.
The baseline food frequency questionnaire (FFQb) was collected between 1993 and 2001 from 4,472 participants. The FFQb collection included a maximum of three time points per individual and 5,414 records. The first wave of follow-up visits (follow-up 1) happened between 2004 and 2007, and no dietary data were collected. The second FFQ (FFQ2), corresponding to the second wave of follow-up visits (follow-up 2), data were collected in 2007 from 3,370 participants at only one time point. The third FFQ3 (follow-up 3) was from 2014 to 2018 among 5,440 participants, with two time points for some participants. Circulating biomarkers, clinical characteristics, and other longitudinal data were collected multiple times during the follow-up periods, including stool samples in the third wave of follow-up visits. In the current study, we conducted cross-sectional analyses linking dietary scores from FFQ3 with faecal microbiome data, as well as cross-sectional and longitudinal analyses examining diet and biomarkers at all three time points for data collection (Supplementary Figure 1). Ethical approval for the TwinsUK cohort study was obtained from St. Thomas’ Hospital Research Ethics Committee, and written informed consent was obtained from all study participants.
Diet assessment in TwinsUK and calculation of EDIH and EDIP scores
In TwinsUK, habitual diet was estimated using a 131-item FFQ previously used in the European Prospective Investigation into Cancer and Nutrition study (Teucher et al., Reference Teucher, Skinner, Skidmore, Cassidy, Fairweather-Tait, Hooper, Roe, Foxall, Oyston, Cherkas, Perks, Spector and MacGregor2007). FFQ measurement characteristics have been evaluated and validated against urinary nitrogen, potassium, serum vitamin C, and carotenoids (Teucher et al., Reference Teucher, Skinner, Skidmore, Cassidy, Fairweather-Tait, Hooper, Roe, Foxall, Oyston, Cherkas, Perks, Spector and MacGregor2007). For participants with multiple clinic visits and FFQ data during one time point, we used the mean value as the final food intake value for each individual. The FFQ data estimated the frequency of intake of individual food items in servings per week. While most foods had the same portion size definition across the three FFQs, there were differences for a few food items, for example, the portion size for low-fat cottage cheese was medium serving in FFQb and tablespoons in FFQ2. Therefore, to improve the comparability of scores across the three FFQs and with external cohorts, we employed a standardised serving size strategy using the serving size information from the Nutrition Data System for Research (NDSR) software of the University of Minnesota (Schakel, Reference Schakel2001). NDSR is a dietary analysis program designed for foods (servings per day) and nutrient intake analyses. We used the 168 standardised serving-size food subgroups within the nine major food categories defined in the NDSR 2017 version. The serving size for each food subgroup was assigned based on the recommendations made by the 2000 Dietary Guidelines for Americans or Food and Drug Administration (FDA) serving sizes. We converted each food item in the TwinsUK FFQs into the NDSR serving sizes per day and assigned them to the appropriate NDSR food subgroups. We created two additional food groups for pizza and cream soup since they were disaggregated into ingredient levels in NDSR but treated as a whole food in the EDIH/EDIP score estimation process. We then calculated EDIH and EDIP scores based on the 170 standardised food subgroup servings and the components of each dietary pattern in TwinsUK, listed in Supplementary Table 1.
Microbiome data assessment
During the third follow-up, faecal samples were collected for microbiome analysis (Verdi et al., Reference Verdi, Abbasian, Bowyer, Lachance, Yarand, Christofidou, Mangino, Menni, Bell, Falchi, Small, Williams, Hammond, Hart, Spector and Steves2019). Faecal sample collection, bacterial DNA extraction, amplification, and sequencing have previously been described in detail (Bowyer et al., Reference Bowyer, Jackson, Le Roy, Ni Lochlainn, Spector, Dowd and Steves2019; Goodrich et al., Reference Goodrich, Davenport, Beaumont, Jackson, Knight, Ober, Spector, Bell, Clark and Ley2016). Briefly, after 16 sRNA V4 variable regions were sequenced using the Illumina MiSeq platform, amplicon sequence variants (ASVs) were generated using the DADA2 pipeline (Callahan et al., Reference Callahan, McMurdie, Rosen, Han, Johnson and Holmes2016). Samples achieving a sequence depth of less than 10,000 were excluded. Taxonomy was assigned to the ASV sequences using the SILVA reference database, and reverse transcript sequences were included (Wells et al., Reference Wells, Adebayo, Bowyer, Freidin, Finckh, Strowig, Lesker, Alpizar-Rodriguez, Gilbert, Kirkham, Cope, Steves and Williams2020). Sequences that were unassigned at the Kingdom/Phylum level were removed. A cross-sectional analysis was conducted to identify the microbiome profile for each of the dietary scores computed from FFQ3. Among the 3,345 participants with 27,650 ASVs recorded, 1,610 provided dietary data on the FFQ3 and were included. The Shannon index and Pielou’s evenness index were calculated using the R package, “vegan” to determine the alpha diversity of the microbiome community (Oksanen Reference Oksanen2017).
Functional analysis of predicted metagenomes
Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt2) pipeline version 2.4.1 (Douglas et al., Reference Douglas, Maffei, Zaneveld, Yurgel, Brown, Taylor, Huttenhower and Langille2020) was used to predict the functions of EDIH- and EDIP-enriched microbiota, following the pipeline guidelines and Metacyc dataset (Caspi et al., Reference Caspi, Billington, Keseler, Kothari, Krummenacker, Midford, Ong, Paley, Subhraveti and Karp2020).
Assessment of circulating biomarker data
We obtained circulating biomarker data at baseline, follow-up 1, follow-up 2, and follow-up 3 and assessed fasting insulin, glucose, and C-reactive protein (CRP; Menni et al., Reference Menni, Fauman, Erte, Perry, Kastenmüller, Shin, Petersen, Hyde, Psatha, Ward, Yuan, Milburn, Palmer, Frayling, Trimmer, Bell, Gieger, Mohney, Brosnan, Suhre, Soranzo and Spector2013; Sas et al., Reference Sas, Vaez, Jamshidi, Nolte, Kamali, Spector, Riese and Snieder2017). The homeostatic model assessment of insulin resistance (HOMA-IR) was calculated from fasting insulin and fasting glucose data using the standard formula [HOMA-IR = insulin (μIU/ml) × glucose (mmol/L)/22.5 (Wallace et al., Reference Wallace, Levy and Matthews2004). If one participant had multiple biomarker data in the same period, we used the mean value in this period as the final biomarker value.
Statistical analysis
To describe participants’ characteristics, all categorical variables were presented using frequencies (%), and all continuous variables were presented using means (standard deviations) across quintiles of the dietary indices (EDIH and EDIP) at each of the three time points for dietary data collection, but using FFQ3 as the primary analysis. The Shannon index and Pielou’s evenness index were normalised via log transformation using natural logs. We estimated the percentage difference in alpha diversity per 1 standard deviation increments in the dietary index using multivariable-adjusted linear regression analyses, adjusting for all the covariates listed below. In addition, we estimated the absolute alpha diversity indices in quintiles of EDIH and EDIP via back-transformation of the log-transformed values, and the corresponding relative alpha diversity using the lowest quintile as the reference.
For microbiome profile analyses, we aggregated the microbiome data at the genus level and conducted a centred log-ratio transformation after adding one pseudo count for genera expressed in >90% of participants. We identified microbiome profiles for adherence to each dietary pattern using elastic net regression. First, we randomised the dataset into 7:3 ratio (70% for training and 30% for testing) and used an elastic net regression model within a 10-fold cross-validation framework to regress EDIH and EDIP on the 143 genera in the training dataset. Then, we applied the trained model to the testing dataset to calculate a dietary index-related microbiome profile score. The score was calculated as the weighted sum of the selected microbial genera, with weights equal to the elastic net regression coefficients. In the training dataset, we calculated the microbiome scores using the elastic net regression model with a leave-one-out validation approach to avoid over-fitting. We calculated the Spearman correlation coefficient between dietary index-related microbiome profile scores, individual genera, the dietary indices, and dietary index food components.
We conducted two levels of functional analyses: First, we used multivariable-adjusted linear regression to assess the association between dietary index-related microbiome profile scores and circulating biomarkers of insulinaemia and inflammation. Second, we used PICRUSt2 to predict the functional pathways of the metagenomes. In this analysis, we excluded pathways with more than 90% zeros and then did a probit transformation before multivariable-adjusted linear regression analyses used above. A cut-off value of 0.1 for FDR p value was used to screen significant pathways associated with EDIH and EDIP. For the top 10 positively and negatively associated pathways, a z-score normalised abundance in each quintile was used to generate a heatmap for data visualisation.
In addition, we used similar multivariable-adjusted linear regression analyses to assess the associations between dietary indices and biomarkers detected at different time points. To determine how closely twins share the same dietary pattern, sensitivity analyses were conducted for biomarker, microbiome alpha diversity, and the microbiome score/biomarker analysis in twin pairs only, using a mixed-effects regression model by specifying twin pairs as a random effect to account for potential within-twin pair correlation.
The following covariates were assessed and included in the multivariable-adjusted models: total energy intake (kcal/day, continuous); age at FFQ time points (years, continuous); sex (male, female); self-reported racial/ethnic group (White, Black, Asian, and other); smoking status (never, current, past smoker); number of nutrient supplements used (continuous); occupation (unemployed, retired, permanently disabled, highly paid [doctor, pharmacist, professor, lawyer]), medium-paid profession (teacher, social worker, nurse, and similar), low-paid profession (waitress, cashier, cleaner, and similar); educational levels (less than elementary school, elementary school, high school, college, and higher); postmenopausal status (menopaused, not menopaused, men); hormone replacement therapy (yes/no); mean fasted hours (hours, continuous); nonsteroidal anti-inflammatory drug use (yes/no). All analyses were additionally adjusted for BMI (BMI = weight (kg)/(height (m)2, continuous). For twin-pair sensitivity analyses, zygosity was additional adjusted. Data on these covariates were collected by self-administered questionnaires on demographics, medical history, and lifestyle factors at baseline and follow-up periods. All analyses were conducted using SAS® version 9.4 (SAS Institute, Cary, NC) and R.
Results
Participants’ characteristics
Participant characteristics in quintiles of EDIH and EDIP dietary patterns computed from FFQ3 are shown in Table 1. Participants in the highest quintile (most hyperinsulinaemic or pro-inflammatory, respectively) compared with the lowest quintile of EDIH or EDIP were predominantly non-White, had higher BMI, and lower level of education. They also reported lower intake of whole grains, higher protein, fat, and sodium intake, and lower potassium intake. Similar trends of characteristics were observed in FFQ2 (Supplementary Table 2) and FFQb (Supplementary Table 3) dietary score quintiles.
a Dietary indices were adjusted for total energy intake using the residual method.
b Values presented are means ± SD for continuous variables and percentages for categorical variables.
c P value for differences of participant characteristics across quintiles. P values were calculated using chi square test for categorical variables and ANOVA for continuous variables.
d In TwinsUK, race/ethnicity was self-identified. Non-white ethnicity group included Black, Asian, mixed and others.
e The food group variables (servings/day) in TwinsUK were as follows: processed meat (Bacon, corned beef, spam, luncheon meats, ham, and sausages); red meat (roast, steak, mince, stew or casserole beef, savoury pies, for example, meat pie, pork pie, pasties, steak and kidney pie, sausage roll, roast, chops, stew or slices pork, beefburgers, meat soups, and roast, chops or stew lamb); sugar-sweetened beverages all regular (not diet) soft drinks and sweetened fruit drinks; wine (white wine); coffee or tea (all types); green leafy vegetables (green salad, lettuce, cucumber, celery, spinach, broccoli, spring green, kale, and watercress).
f Nutrient values are nutrient densities, presented per 1000 kcal of total energy intake.
Dietary indices in relation to microbiota alpha diversity
In cross-sectional analyses using FFQ3 dietary data, higher EDIH and EDIP, reflecting more hyperinsulinaemic or more pro-inflammatory dietary patterns, were associated with lower microbiota alpha diversity, as the Shannon index decreased 3.2% and 2.3% comparing the highest quintiles to the lowest quintiles for EDIH and EDIP, respectively. Similarly, Pielou’s evenness index also decreased 2.1% and 1.4% for the same comparisons. For both dietary patterns scores, the associations were slightly attenuated with additional adjustment for BMI (Table 2). Similar results were found for longitudinal assessments of alpha diversity with the EDIH and EDIP scores calculated from FFQ2 and FFQb (Supplementary Table 4).
a Values are beta-coefficients from linear regression models, adjusted for total energy intake, BMI, age, race, smoking status, supplement use, occupation, education, postmenopausal status, hormone replacement therapy, and nonsteroidal anti-inflammatory drug use.
b BMI was additional adjusted.
c The bolded numbers represent statistically significant findings (i.e., FDR p value <0.05).
Dietary indices-related microbiome profile scores
Dietary index-related microbiome scores at the genus level were developed using FFQ3 data. The elastic net regression retained 32 and 15 genera to compute the EDIH- and EDIP-related microbiome scores, respectively. For EDIH, Caproiciproducens (β = −0.034), Intestinimonas (β = −0.025), and Ruminococcaceae UCG-008 (β = −0.024) showed the largest inverse associations (i.e., low insulinaemic), whereas Adlercreutzia (β = 0.057), Negativibacillus (β = 0.023), and Turicibacter (β = 0.019) showed the largest positive associations (i.e., hyperinsulinaemic), (Supplementary Table 5). For EDIP, Ruminococcaceae_UCG-008 (β = −0.020), Lachnospiraceae_UCG-008 (β = −0.015), and Defluviitaleaceae_UCG-011 (β = −0.012) showed the largest inverse associations (low inflammatory), while Streptococcus (β = 0.010), Eisenbergiella (β = 0.008), and Negativibacillus (β = 0.008) showed the largest positive associations (Supplementary Table 6). Six genera were associated with both EDIH and EDIP, four inversely (Ruminococcaceae_UCG-008, Lachnospiraceae_UCG-008, Defluviitaleaceae_UCG-011, Anaeroplasma), and two positively (Negativibacillus, Streptococcus).
The microbiome profile scores showed similar correlations (as the dietary scores) with dietary score food components. That is, microbiome profile scores were positively correlated with food components contributing to higher dietary scores, while inversely correlated with food components contributing to lower dietary scores. Single genera showed the same positive or inverse correlations with microbiome profile and dietary scores. For the microbiota associated with both EDIH and EDIP, the Lachnospiraceae were positively associated with green leafy vegetables, coffee, wine, and whole fruit, while Negativibaccilus, streptococcus, and Adlercreutzia seem to positively associate with French fries, red/processed meat, poultry, regular sodas, and diet sodas (Figure 1, Supplementary Tables 5–7).
Microbiome functional analysis
Higher EDIH- and EDIP-related microbiome profile scores, reflecting microbes associated with a more hyperinsulinaemic or more pro-inflammatory microbiome profile, were significantly associated with higher concentration of insulin, glucose, and HOMA-IR in cross-sectional analyses at follow-up 3. However, the results were attenuated for glucose when additionally adjusted for BMI (Table 3). In addition, the two microbiome scores were significantly associated with higher concentration of insulin and HOMA-IR in longitudinal analysis at follow-up 2. However, we did not observe significant associations with baseline dietary scores assessed on average 12 year earlier.
a Values are beta coefficients from the multivariable-adjusted linear regression and the bolded numbers represent statistically significant findings (i.e., FDR p value <0.05).
b Values were adjusted for total energy intake, age, sex, race, smoking status, supplement use, occupation, education, postmenopausal status, nonsteroidal anti-inflammatory drug use, and hormone replacement therapy.
c Abbreviations: HOMA-IR, Homeostatic model assessment of insulin resistance.
In the predicted pathways analysis, multivariable-adjusted linear regression identified 14 pathways inversely associated with EDIH and 3 pathways positively associated. Among the 14 inverse pathways, 4 of them, involved in the biosynthesis of fatty acids (fatty acid, stearate, palmitoleate, oleate), were still significantly associated after additionally adjusting for BMI. The mevalonate pathway is the major positively associated pathways for EDIH. Multivariable-adjusted linear regression identified 33 pathways inversely associated with EDIP and 121 pathways positively associated. Higher EDIP score was associated with down-regulation of pathway abundances for the biosynthesis of nucleotide sugars (e.g., CMP-legionaminate, GDP-d-glycero-α-d-manno-heptose and dTDP-l-rhamnose), and amino acid (L-glutamate and L-glutamine), nitrogen compound metabolism, adenosylcobalamin salvage, D-fructuronate degradation; and up-regulation of pathway abundances for the biosynthesis of inosine-5′-phosphate, fatty acid ((5Z)-dodecenoate, palmitoleate), and geranylgeranyl diphosphate (GGPP), degradation of aromatic compounds (e.g., catechol and toluene) and lactose, mixed acid fermentation, TCA cycle, and mevalonate pathway (Figure 2).
Dietary index scores and circulating biomarker concentrations
In the construct validation sub-study, we found that higher EDIH and EDIP scores were significantly associated with higher concentrations of insulin, HOMA-IR, and CRP but not glucose in TwinsUK in cross-sectional analyses. Longitudinal analyses confirmed that EDIH and EDIP score assessed multiple years earlier were still associated with future unfavourable plasma biomarker profiles (Supplementary Figure 2).
Sensitivity analysis among twin pairs
We had 1,520; 1,039; and 1,075 twin pairs at FFQb, FFQ2, and FFQ3 twin-pair analyses. The mixed-effects regression results showed very similar results for alpha diversity and microbiome scores when compared with the main analysis results (Supplementary Tables 8 and 9). Paired t tests did not show any significant differences in dietary scores, alpha diversity scores, and microbiome profile scores between twin pairs (data not shown).
Discussion
Dietary patterns with a high potential to contribute to insulin hypersecretion and chronic systemic inflammation, based on higher EDIH and EDIP scores, have been associated with multiple metabolic diseases in previous studies (Wang et al., Reference Wang, Song, Eliassen, Wang, Fung, Clinton, Rimm, Hu, Willett, Tabung and Giovannucci2023). To explore the potential mechanisms that may underlie such associations, we identified microbiome diversity changes and differences in the abundance of specific microbes linked to hyperinsulinaemic and pro-inflammatory dietary patterns for the first time. Specifically, more hyperinsulinaemic and pro-inflammatory dietary patterns were associated with lower faecal microbial diversity and the abundances of specific microbes, essential biosynthetic and degradation processes in metabolic pathways, providing insights on which microbes may be depleted and/or enriched in a dietary pattern intervention to lower the insulinaemic or inflammatory activity of the diet. The related microbiome profile scores of the dietary patterns were also predictive of the plasma biomarker constructs of EDIH and EDIP. In the construct validation study, we determined that more hyperinsulinaemic or pro-inflammatory dietary patterns (higher EDIH and EDIP) were significantly associated with higher concentrations of insulin, HOMA-IR, and CRP, but not glucose. These findings from the three cross-sectional analyses were confirmed in the four longitudinal analyses, suggesting that dietary patterns assessed multiple years earlier, may impact future plasma biomarker profiles.
A major objective of the dietary patterns approach to nutrition research is to examine their impact on health outcomes; however, one limitation of the prevailing dietary patterns is that most are not designed to optimise disease prediction, and for this reason, metabolic dietary patterns, including the EDIH and EDIP, have been proposed (Shi et al., Reference Shi, Aroke, Jin, Lee, Hussan, Zhang, Manson, LeBlanc, Barac, Arcan, Clinton, Giovannucci and Tabung2021, Wang et al., Reference Wang, Song, Eliassen, Wang, Fung, Clinton, Rimm, Hu, Willett, Tabung and Giovannucci2023). This approach to dietary patterns is based on the premise that a dietary pattern predictive of a biological marker in a disease pathway (e.g., hyperinsulinaemia, lipids, chronic systemic inflammation) may be more predictive of disease outcomes if the pathway is a strong determinant of the disease. Indeed, previous studies found that both EDIH and EDIP were more predictive of: (i) type 2 diabetes risk than the glycaemic index and glycaemic load (Jin et al., Reference Jin, Shi, Aroke, Lee, Joseph, Donneyong, Conwell, Hart, Zhang, Clinton, Cruz-Monserrate, Brasky, Jackson, Tinker, Liu, Phillips, Shadyab, Nassir, Bao and Tabung2021; Lee et al., Reference Lee, Li, Li, Liu, Wu, Bhupathiraju, Rimm, Rexrode, Manson, Willett, Hu, Tabung and Giovannucci2020), (ii) colorectal cancer risk (Wang et al., Reference Wang, Song, Eliassen, Wang and Giovannucci2022), and risk of major chronic diseases (Wang et al., Reference Wang, Song, Eliassen, Wang, Fung, Clinton, Rimm, Hu, Willett, Tabung and Giovannucci2023), than existing dietary pattern indices such as the Healthy Eating Index (HEI), Dietary Approaches to Stop Hypertension diet, and so forth.
Multiple previous studies have investigated the associations of several indices of dietary quality and the gut microbiome, and generally found that higher dietary quality is associated with higher microbiota diversity, which can improve immune function and digestion and reduce the risk of metabolic and inflammatory diseases. A study in the Multi-Ethnic Cohort found that higher dietary quality based on four different dietary quality indices was associated with higher alpha diversity and specific taxa (Maskarinec et al., Reference Maskarinec, Hullar, Monroe, Shepherd, Hunt, Randolph, Wilkens, Boushey, Le Marchand, Lim and Lampe2019). Moreover, diet quality scores are associated with higher abundances of specific microbes that are potentially beneficial for health improvement (Bowyer et al., Reference Bowyer, Jackson, Pallister, Skinner, Spector, Welch and Steves2018; Maskarinec et al., Reference Maskarinec, Hullar, Monroe, Shepherd, Hunt, Randolph, Wilkens, Boushey, Le Marchand, Lim and Lampe2019). Our results align with previous studies that have found fibre-fermenting bacteria, including Faecalibacterium, Lachnospira, and Ruminococcus, to be associated with higher dietary quality. These are among three of the six genera or family (for Faecalibacterium and Negativibacillus) that were inversely associated with higher EDIH or EDIP in the current study. Although we did not analyse other dietary patterns in this study, a previous study among 2,070 individuals in the TwinsUK cohort showed that two commonly used dietary quality indices – the HEI-2010 and the Mediterranean Dietary Score – were associated with higher alpha diversity (Bowyer et al., Reference Bowyer, Jackson, Pallister, Skinner, Spector, Welch and Steves2018). The EDIH and EDIP assess dietary quality based on the insulinaemic or inflammatory potential of the diet, and lower EDIH/EDIP scores correlate positively with the conventional indices of dietary quality (Wang et al., Reference Wang, Song, Eliassen, Wang, Fung, Clinton, Rimm, Hu, Willett, Tabung and Giovannucci2023; Wang et al., Reference Wang, Song, Eliassen, Wang and Giovannucci2022). Although EDIH, EDIP, and HEI are all food-based dietary patterns, EDIH and EDIP are specifically designed based on biological mechanisms, whereas HEI is aimed at evaluating adherence to the Dietary Guidelines for Americans.
We created microbiome profile scores to better reflect the role of the dietary pattern in the microbiota composition and further demonstrate function by relating these microbiome profile scores with circulating markers of inflammation and insulin response, or through predicting the pathways of metagenomes. Based on the current study findings, to improve metabolic and overall health, a dietary intervention to reduce inflammatory or insulinaemic activity of the diet may therefore tip the overall balance of the gut microbial composition to greater levels of Lachnospiraceae, Faecalibacterium, Oscillospira, Ruminococcaceae_UCG-014, Marvinbryantia, Intestinimonas, and Anaerostipes, genera, and reduced levels of Porphyromonas, Eisenbergiella, Escherichia/Shigella, Adlercreutzia, Eggerthella, Fusobacterium, and Bilophila. Functional analysis revealed that EDIH and EDIP might regulate essential biosynthetic and biodegradation processes involved in nutrient assimilation, energy production, and the synthesis of critical biomolecules necessary for cellular structure and function. Several lipid metabolism pathways were found in the functional analysis. Higher EDIH was associated with reduced biosynthesis of monounsaturated fatty acid (MUFA), oleic acid, and palmitic acid. We also observed that higher EDIP was associated with increased palmitate biosynthesis. The increased palmitic acid may attenuate the insulin signalling pathway and promote insulin resistance through decreasing the function of the endoplasmic reticulum (ER) and mitochondria. In addition, palmitic acid can activate pro-inflammatory pathways through Toll-like receptor 4 pathways. Another MUFA, oleic acid, has the potential to attenuate these unhealthy functions caused by increased palmitic acid (Palomer et al., Reference Palomer, Pizarro-Delgado, Barroso and Vázquez-Carrera2018). An increased mevalonate pathway was associated with both EDIH and EDIP. The mevalonate pathway was upregulated by both hyperinsulinaemic and pro-inflammatory dietary patterns and produces essential regulators of cellular metabolism, such as lipoproteins, dolichol, ubiquinone, and cholesterol-derived products (Guerra et al., Reference Guerra, Recio, Aranda-Tavío, Guerra-Rodríguez, García-Castellano and Fernández-Pérez2021). The dysregulation of mevalonate was associated with multiple metabolic diseases, including cardiovascular disease, inflammatory bowel disease, and cancers (Guerra et al., Reference Guerra, Recio, Aranda-Tavío, Guerra-Rodríguez, García-Castellano and Fernández-Pérez2021, Pereira et al., Reference Pereira, Matuszewska, Glogova and Petrik2022). Mevalonate is essential for GGPP biosynthesis (Guo et al., Reference Guo, Wan, He, Lin, Cao, Qiu, Zhang, Zhao, Niu, Jin, Huang, Wang, Tan, Thomas, Zhang, Chen, Wong, Hu and Ji2022). The inhibition of GGPP pathways showed anticancer effects in several cancers.
Our study has several strengths. We note that the EDIH and EDIP scores were developed, validated, and applied in several large cohorts, all in the United States, and this is the first time the dietary scores have been applied in a non-US population, using a robust methodology for standardizing food serving size definitions. We took advantage of the comprehensive data assessments in TwinsUK and implemented a robust study design comprised of multiple cross-sectional and longitudinal studies, yielding highly concordant results. The elastic net regression with 10-fold cross validation or leave-one-out validation approach we used here can effectively deal with highly correlated variables and, at the same time, maintain the quality of model selection compared with multiple linear regressions for each microbe as the outcome variable. However, our study is not without limitations, which include the potential for measurement error in the diet and lifestyle variables. The potential for confounding by unmeasured variables or residual confounding by inadequately measured variables may not be completely removed even as we adjusted for several potential confounding factors. Microbiome studies are generally limited by the use of relative abundances rather than absolute concentrations (Goodrich et al., Reference Goodrich, Davenport, Beaumont, Jackson, Knight, Ober, Spector, Bell, Clark and Ley2016). Multiple software (bioinformatic pipelines) and algorithms are used in different studies with different limitations and biases (Nearing et al., Reference Nearing, Douglas, Hayes, MacDonald, Desai, Allward, Jones, Wright, Dhanani, Comeau and Langille2022; Prodan et al., Reference Prodan, Tremaroli, Brolin, Zwinderman, Nieuwdorp and Levin2020). We regressed the microbiome profile score on the aggregated genus level but not on the species, which may miss some signals in the microbial community. Therefore, metagenomic and metatranscriptomic data may be needed in future studies. Twin studies offer valuable insights into environment–genetic interactions within microbiomes (Goodrich et al., Reference Goodrich, Davenport, Beaumont, Jackson, Knight, Ober, Spector, Bell, Clark and Ley2016). However, in our study, we primarily focused on individual-level microbiome analysis, although we briefly explored twin-pair analysis in the smaller sample size. Future research, integrating microbiome and genetic data, is essential to further elucidate these interactions and their impacts on microbiome composition and variation.
Conclusion
Dietary patterns that exert metabolic effects on insulin and inflammation may influence disease risk and prognosis by modulating gut microbial composition and function via multiple biosynthetic and biodegradation pathways, providing insights on which microbes may be depleted and/or enriched in a dietary pattern intervention to reduce the insulinaemic or inflammatory activity of the diet.
Abbreviations
- EDIH
-
Empirical Dietary Index for Hyperinsulinaemia
- EDIP
-
Empirical Dietary Inflammatory Pattern (EDIP)
- FFQ
-
food frequency questionnaires
- HOMA-IR
-
homeostasis model assessment of insulin resistance
- CRP
-
C-reactive protein
- SD
-
standard deviation
- NDSR
-
Nutrition Data System for Research
- EPIC
-
European Prospective Investigation into Cancer and Nutrition
- FDA
-
Food and Drug Administration
- ASVs
-
amplicon sequence variants
- FDR
-
false discovery rate
- BMI
-
body mass index
- SCFA
-
Short-chain fatty acids
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/gmb.2024.14.
Acknowledgements
Funding: TwinsUK was funded by the Wellcome Trust, Medical Research Council, Versus Arthritis, European Union Horizon 2020, Chronic Disease Research Foundation (CDRF), Zoe Ltd, the UK National Institute for Health and Care Research (NIHR) Clinical Research Network (CRN), and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust in partnership with King’s College London. CM was funded by the Chronic Disease Research Foundation.
Disclosure statement
We declare no competing interests.
Author contribution
Conceptualisation: F.K.T. and N.S. Methodology: N.S. and F.K.T. Formal analysis: N.S., S.N., and H.R. Data curation: F.K.T., N.S., and C.M. Writing-original draft: N.S. and F.K.T. Writing-review and editing: F.K.T., N.S., C.M., M.C.P., D.S., P.M.W., C.J.S., and S.K.C. Supervision: F.K.T. Funding acquisition: C.M.
Data availability statement
The data used in this study are held by the Department of Twin Research at King’s College London. The data can be released to bona fide researchers using our normal procedures, which are overseen by the Wellcome Trust and its guidelines, as part of our core funding (https://twinsuk.ac.uk/resources-for-researchers/access-our-data/).