We were interested to read the paper by Mullie et al.( Reference Mullie, Deforche and Mertens 1 ) published in July 2018 issue of British Journal of Nutrition. The aim of the authors was to investigate the reproducibility of glycaemic index (GI) and glycaemic load (GL) using data obtained from a prospective cohort study with two nutritional assessments in 2002 and in 2012. For GI and GL, the reproducibility and cross-classification of quintiles between the two measurement periods were tested with Pearson’s correlation coefficient( Reference Mullie, Deforche and Mertens 1 ).
It is important to bear in mind that in all fields of research, validity (accuracy) and reliability (precision) are two important methodological issues that are completely different, and these issues are assessed using appropriate tests( Reference Sabour 2 ).
Authors investigated the association of dietary GI and dietary GL in 2012 with dietary GL and dietary GI in 2002, respectively( Reference Mullie, Deforche and Mertens 1 ). It refers to validity and has nothing to do with reliability( Reference Bland and Altman 3 ). Also, authors investigated the association of dietary GI and dietary GL in 2012 with dietary GI and dietary GL in 2002, respectively (that refers to reproducibility) using Pearson’s correlation coefficient( Reference Mullie, Deforche and Mertens 1 ). The Pearson correlation coefficients between 2002 and 2012 were 0·27 for GI and 0·41 for GL. For quantitative variables, applying Pearson’s correlation and paired t test are among common mistakes in reliability (reproducibility) analysis( Reference Sabour 2 ).
Pearson’s correlation assumes that the relationships between variables are linear. So, it just measures linearity but fails to detect non-reliability (departure from the 45° line). Applying Pearson’s correlation, high correlation coefficients are possible with far from reproducibility (reliability)( Reference Sabour 2 ). Therefore, Pearson correlation coefficient is not an appropriate statistical test for reproducibility analysis. Briefly, weighted κ are good estimates for quantitative variables, intra-class correlation coefficient or Bland–Altman plot, and for qualitative variables with more than two categories( Reference Sabour 2 – Reference Sabour 6 ).
The authors mentioned that low 10-year reproducibility should be taken into account in prospective cohort studies with only one nutritional assessment at baseline.
In terms of reproducibility (reliability), such a sweeping conclusion may provide a misleading message. In conclusion, for reliability analysis, appropriate tests as well as correct interpretation should be applied. Otherwise, misinterpretation cannot be avoided.
Acknowledgements
This research received no specific grant from any funding agency, commercial or not-for-profit sectors.
S. I. wrote the paper. S. I. and S. S. share responsibility for final content.
There are no conflicts of interest.