Introduction
Diabetes is one of the significant causes of mortality and morbidity all around the world. The increasing prevalence of type 2 diabetes mellitus (T2DM) is accelerating the burden on healthcare systems (Safiri et al., Reference Safiri, Karamzad, Kaufman, Bell, Nejadghaderi, Sullman, Moradi-Lakeh, Collins and Kolahi2022; GBD 2021 Diabetes Collaborators, 2023). Numerous studies have shown that identifying individuals at high risk for diabetes and implementing lifestyle interventions can significantly reduce the risk of developing diabetes. In Finland, the Finnish Diabetes Risk Score (FINDRISC) has been used in primary healthcare, pharmacies, and nationwide advertising campaigns to identify high-risk individuals. Targeted interventions for these individuals led to a 58% reduction in diabetes incidence within 3 years (Saaristo et al., Reference Saaristo, Moilanen, Korpi-Hyövälti, Vanhala, Saltevo, Niskanen, Jokelainen, Peltonen, Oksa, Tuomilehto and Uusitupa2010; Rintamäki et al., Reference Rintamäki, Rautio, Peltonen, Jokelainen, Keinänen-Kiukaanniemi, Oksa, Saaristo, Puolijoki, Saltevo, Tuomilehto and Uusitupa2021).
In the context of primary prevention, various strategies have been employed to screen for diabetes risk factors and identify individuals at high risk within the general population. While the American Diabetes Association recommends fasting blood glucose (FBG) and oral glucose tolerance tests (OGTTs) to detect high-risk individuals and those with asymptomatic diabetes (American Diabetes Association Professional Practice Committee, 2021), screening the entire population using these methods is challenging, time-consuming, and not cost-effective. As a result, there has been a shift towards more cost-effective and non-invasive risk assessment tools that can be easily applied in primary care settings (Stern et al., Reference Stern, Williams and Haffner2002; Fagg and Valabhji, Reference Fagg and Valabhji2019). Hence, various predictive risk scores have been developed to assess an individual’s risk of diabetes.
FINDRISC is one of the widely recommended and internationally accepted non-invasive risk assessment tools. It estimates an individual’s likelihood of developing diabetes over a 10-year period by considering eight diabetes-associated predictors. The primary objective of this score is to identify individuals at high risk of diabetes and provide them lifestyle counselling to prevent the early onset of diabetes and reduce the likelihood of associated complications (Lindström and Tuomilehto, Reference Lindström and Tuomilehto2003). Since countries differ in the distribution of risk factors, lifestyle, and genetic composition, it is necessary to assess the applicability of these risk scores within specific populations (Glümer et al., Reference Glümer, Borch-Johnsen and Colagiuri2005). FINDRISC score has been validated in many European countries, including Sweden (Hellgren et al., Reference Hellgren, Petzold, Björkelund, Wedel, Jansson and Lindblad2012), Greece (Makrilakis et al., Reference Makrilakis, Liatis, Grammatikou, Perrea, Stathi, Tsiligros and Katsilambros2011), Spain (Soriguer et al., Reference Soriguer, Valdés, Tapia, Esteva, Ruiz de Adana, Almaraz, Morcillo, Rodríguez and Rojo-Martinez2012), Italy (Franciosi et al., Reference Franciosi, De Berardis, Rossi, Sacco, Belfiglio, Pellegrini, Tognoni, Valentini and Nicolucci2005), Bulgaria (Tankova et al., Reference Tankova, Chakarova, Atanassova and Dakovska2011), and Slovenia (Štiglic et al., Reference Štiglic, Fijačko, Stožer, Sheikh and Pajnkihar2016).
In this study, we aimed to validate the FINDRISC questionnaire in Turkish population and to investigate the development of country-specific model for Turkish people.
Method
Study design and data source
This retrospective cohort study utilized data from the Türkiye Chronic Diseases and Risk Factors Survey (TurCDRFs) (Ünal et al., Reference Ünal, Ergor, Dinc, Kalaca and Sozman2013) to evaluate the applicability of the FINDRISC in the Turkish population. The TurCDRF survey was a nationwide effort conducted by the Turkish Ministry of Health in 2011 to determine the prevalence of chronic diseases and associated risk factors. Data were collected by family physicians (FPs) from their registered populations, aged 15 years and older. Participants were invited to Family Health Centers (FHCs) for face-to-face interviews, physical examinations, and laboratory tests. The data were then entered into an electronic system (Ünal et al., Reference Ünal, Ergor, Dinc, Kalaca and Sozman2013).
The participants of the 2011 survey were electronically tracked through national databases until 2017 to monitor the development of chronic diseases, hospitalizations, medical prescriptions, and survival outcomes (Kara et al., Reference Kara, Keskinkilic, Ekinci, Altunay Ozkan, Sarioglu, Arikan and Navruz Kapusuz2021). The national data sources linked to baseline 2011 database included the Social Security Institution (SSI) and the National Death Notification System (NDNS). The SSI covers health expenses in Türkiye, including treatments, medicines, laboratory, and imaging tests. The NDNS records causes of death using ICD-10 codes, as determined by physicians.
Study population and diabetes diagnosis
The initial dataset consisted of 18477 participants aged 15 years and older in 2011. After excluding 4997 individuals with a prior history of T2DM and 1231 with missing data on FINDRISC variables, a total of 12249 participants were included in the study. Participants who were initially healthy in 2011 and were diagnosed with T2DM based on ICD-10 codes (E11–E14) at least twice during the 6-year follow-up period were classified as incident T2DM cases. Single entries of ICD-10 codes (E11–E14) were not considered sufficient for T2DM classification unless the individual had been hospitalized, received treatment, or had diabetes listed as a cause of death. Individuals with diabetes listed as the cause of death were also classified as incident cases.
Data collection and preparation for the analysis
FINDRISC was developed to identify high-risk individuals for diabetes and predict their 10-year diabetes risk using a questionnaire with eight measures: age (categorized, years), BMI (categorized, kg/m2), waist circumference (WC) (categorized, cm), physical activity (>30 minutes/day), daily intake of fruits/vegetables/berries, history of medication for hypertension, self-reported history of high blood glucose, and family history of diabetes. Scores on the FINDRISC questionnaire range from 0 to 26 points, with risk categories typically classified as low (0–6 points), slightly elevated (7–11 points), moderate (12–14 points), high (15–20 points), or very high (21–26 points). Higher scores indicate a greater risk of developing diabetes, making FINDRISC a valuable tool for individualized preventive interventions (Lindström and Tuomilehto, Reference Lindström and Tuomilehto2003; Saaristo et al., Reference Saaristo, Peltonen, Lindström, Saarikoski, Sundvall, Eriksson and Tuomilehto2005).
In the baseline TurCDRF survey, sociodemographic characteristics, lifestyle factors, and both family and personal medical histories were collected by FPs. Additionally, anthropometric measurements, including weight, height, and waist and hip circumference, were taken. Sociodemographic characteristics included age, gender, educational level, place of residence, and marital status. Age was categorized into four groups (<45, 45–54, 55–64, and ≥ 65 years) in accordance with the FINDRISC questionnaire. Educational level was classified into two categories: less than high school and high school or university. Place of residence was categorized as urban for populations exceeding 20000 and rural for populations of 20000 or less.
Lifestyle factors included smoking status, alcohol use, physical activity, type of bread consumed, type of oil/butter consumed, and additional salt intake. Smoking status was categorized as current smoking (regular or occasional) or not (non-smoker or ex-smoker). In the original TurCDRF dataset, physical activity was assessed in terms of intensity, frequency, and duration. Participants were categorized into three groups: adequate physical activity (engaging in moderate or vigorous activity at least five times a week for 30 minutes), moderate physical activity (1–4 times a week for 30 minutes), and low physical activity (less than once a week for 30 minutes or none). In alignment with the FINDRISC questionnaire, ‘Yes’ to engaging in 30 minutes of daily exercise was considered ‘adequate’, while ‘No’ was classified as ‘moderate to low’ physical activity. Daily consumption of vegetables and fruits was recorded as either ‘0 portions’ (no daily consumption) or ‘one or more portions’ (daily consumption). Participants reported their bread preferences, including white, whole-wheat, rye, and oat bread. For the type of oil/butter consumed, options included butter, margarine, olive oil, and other types such as sunflower, corn, soy, or hazelnut. Salt intake habits were assessed based on whether participants added salt to their meals at the dining table without tasting first. Participants’ weight, height, and WC were also measured by FPs during data collection in 2011.
Data on personal and family medical histories were based on self-report. Hypertension was assessed with the questions, ‘Have you ever been told by a physician that you have high blood pressure?’ and ‘Do you use any drugs regularly for this condition?’ (yes/no). Some variables in our database were reconstructed to align with the FINDRISC scoring system. For example, the FINDRISC questionnaire assesses a history of high blood glucose with the question, ‘Have you ever been found to have high blood glucose?’ (e.g., in a health examination, during an illness, or during pregnancy). In our study, this question was not self-reported; instead, individuals with impaired FBG levels detected through blood measurements taken in 2011 were considered to have answered ‘yes’. Additionally, while the FINDRISC questionnaire assigns different points for first-degree and second-degree relatives in the family history of diabetes, our study considered only first-degree relatives.
Statistical analysis
Categorical data were presented as number and percentages (n, %). In the comparison of categorical variables between incident T2DM group and the people who did not develop T2DM, Pearson’s chi-square test with Yates’ continuity correction was used. Receiver operating characteristics (ROC) curve analysis was used to find the optimal cut-off (Method: Youden index) of continuous FINDRISC score in the prediction of T2DM. For the external validation of FINDRISC in Turkish population, a multivariable logistic regression model with FINDRISC variables was built. To develop a country-specific model, we generated two separate models due to highly correlation between waist-to-hip ratio (WHR) and WC: WHR model and WC model. The least absolute shrinkage and selection operator (LASSO) algorithm was utilized to select potential variables that might impact the diagnosis of T2DM. In addition to FINDRISC variables, candidate variables for WHR model were gender, place of residence, education, marital status, current smoking, alcohol use, type of bread consumed, type of oil/butter consumed, additional salt intake, and WHR. For WC model, WHR was replaced by WC. The optimal regularization parameter (λ) was estimated by 10-fold cross-validation. Prior to selection procedure, missing data on place of residence, education, marital status, current smoking, alcohol use, type of bread consumed, type of oil/butter consumed, additional salt intake, and WHR were imputed using logistic regression or polytomous logistic regression. Discrimination indexes of three models (area under the curve [AUC] and Brier index) were compared. Risk measure was odds ratio (OR) with its 95% confidence interval (95% CI). Statistical analysis and visualizations were performed with R version 4.3.1 (A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/).
Ethical considerations
An official permission to use anonymized data was obtained from the Ministry of Health of Turkiye (Date: 28.08.2022 No: E-96867468-151.01). The study was approved with decision number 2022/30-15 dated September 21, 2022, by Non-Interventional Research Ethics Committee of Dokuz Eylul University.
Results
The study included a total of 12249 participants, of whom 505 developed incident T2DM during the study period. The 6-year cumulative incidence was 4122.8 per 100000 people aged ≥15 years. Among individuals in the high and very high-risk groups, the proportion developing T2DM was 10.9% and 17.8%, respectively. The optimal FINDRISC cut-off score for predicting T2DM was found to be 8.5 (AUC, 95% CI: 0.75, 0.73–0.77; sensitivity: 78.4%, specificity: 59.0%, positive predictive value (PPV): 7.6%, negative predictive value (NPV): 98.5%).
Figure 1 presents violin plots illustrating the distribution of FINDRISC scores for age group, BMI, WC, and WHR, further stratified by gender. The plots reveal that FINDRISC scores were generally lower in younger age groups, with the distribution widening as age increased, reflecting greater variability in diabetes risk among older individuals. In the older age categories, females showed higher risk scores. Additionally, median FINDRISC scores increased with higher BMI categories. Scores were also higher in categories with larger WC and elevated WHR, with females exhibiting higher scores in these categories.

Figure 1. Violin plots of FINDRISC scores for age group (A), BMI (B), waist circumference (C), and waist-to-hip ratio (D), stratified by gender. Comparisons by gender are labelled as follows: **p ≤ 0.01, ***p ≤ 0.001, ****p ≤ 0.0001.
Table 1 presents descriptive characteristics of the study population and incident T2DM cases by groups. A higher proportion of females (4.7%) developed T2DM compared to males (3.4%). The incidence of T2DM also increased with age, with the highest rates observed in individuals aged 55–64 years (8.1%) and those over 64 years (7.2%). Additionally, participants with a family history of diabetes, elevated blood glucose levels, higher WC, high WHR, and elevated BMI were more likely to develop T2DM.
Table 1. General characteristics of the population included in the study and incident type 2 diabetes mellitus cases by groups

*Over total n, **row percentage.
Results from the multivariable logistic regression model, which included only FINDRISC variables, are summarized in Table 2. Age was a significant predictor across all age subgroups. Participants with a BMI over 30 kg/m2 had significantly higher odds of developing T2DM (OR: 1.94, 95% CI: 1.36–2.77). A WC exceeding 102 cm in males or 88 cm in females was also strongly associated with an increased risk of T2DM (OR: 2.22, 95% CI: 1.57–3.15). Additionally, the use of antihypertensive medication (OR: 1.91, 95% CI: 1.52–2.38), high blood glucose levels (OR: 1.80, 95% CI: 1.47–2.19), and a family history of diabetes (OR: 1.43, 95% CI: 1.18–1.73) were significantly associated with a higher likelihood of developing T2DM (Table 2).
Table 2. Univariate and multivariate logistic regression models for type 2 diabetes mellitus with FINDRISC variables

BMI: body mass index, DM: diabetes mellitus, OR: odds ratio, aOR: adjusted odds ratio.
LASSO regression was used to identify potential predictors of diabetes in the Turkish population, incorporating models separately using WHR and WC, along with other study variables. In both the WHR and WC models, significant predictors included age category, education, marital status, family history of diabetes, antihypertensive medication use, BMI >30 kg/m2 category, and high blood glucose levels. The ORs (95% CI) for each variable are presented in Table 3. High WHR was significantly associated with an increased risk of T2DM (OR: 1.40, 95% CI: 1.14–1.73), while being in a high WC category (>102 cm for males, >88 cm for females) was also a strong predictor of T2DM (OR: 2.21, 95% CI: 1.57–3.13) in the WC model.
Table 3. Prediction models with selected variables for identifying type 2 diabetes mellitus in Turkish population

Variable selection process was based on the least absolute shrinkage and selection operator (LASSO) algorithm.
The potential variables for the WHR model included gender, age category, place of residence, education, marital status, family history of diabetes, current smoking, alcohol use, antihypertensive medication use, type of bread consumed, type of oil/butter consumed, additional salt intake, intake of vegetables/fruits/berries, BMI category, WHR, and high blood glucose.
The potential variables for the WC model were the same as those in the WHR model, with the substitution of waist circumference for the WHR.
The FINDRISC score demonstrated good performance in predicting T2DM within a 6-year period, with an AUC of 0.76 (95% CI: 0.74–0.78). Both the WHR and WC models showed similar predictive performance, with an AUC of 0.77 (95% CI: 0.74–0.79) and a Brier score of 0.038 for both models (Figure 2).

Figure 2. Receiver operating characteristic (ROC) curve analysis of predicted probabilities for type 2 diabetes mellitus, with model discrimination indexes. Multivariable model with only FINDRISC variables (A), waist-to-hip ratio (WHR) model with LASSO-selected variables (B), and waist circumference (WC) model with LASSO-selected variables (C).
Discussion
In this study, the performance of FINDRISC score was demonstrated well in Turkish population, showing an AUC of 0.76 (sensitivity: 73.5%, specificity: 67.6%). When comparing the performance of the FINDRISC score across various populations, previous studies have reported AUC values ranging from 0.63 to 0.87 (Lindström and Tuomilehto, Reference Lindström and Tuomilehto2003; Omech et al., Reference Omech, Mwita, Tshikuka, Tsima, Nkomazna and Amone-P’Olak2016; Lim et al., Reference Lim, Chia and Koay2020). These values generally fell below the performance observed in the Finnish population (AUC: 0.85). However, FINDRISC demonstrated similar performance in the majority of studies, including Malaysia (AUC: 0.76) (Lim et al., Reference Lim, Chia and Koay2020), Spain (AUC: 0.75) (Salinero-Fort et al., Reference Salinero-Fort, Burgos-Lunar, Lahoz, Mostaza, Abánades-Herranz, Laguna-Cuesta, Estirado-de Cabo, García-Iglesias, González-Alegre, Fernández-Puntero and Montesano-Sánchez2016), Norway (AUC: 0.77) (Jølle et al., Reference Jølle, Midthjell, Holmen, Carlsen, Tuomilehto, Bjørngaard and Åsvold2019), and a European cohort (AUC: 0.74) (Gabriel et al., Reference Gabriel, Acosta, Florez, Anillo, Navarro, Boukichou, Acosta-Reyes, Barengo, Lindström, Tuomilehto and Aschner2021).
Two separate county-specific models, one incorporating WHR and the other incorporating WC, also demonstrated good performance in predicting 6-year T2DM, each with an AUC of 0.77. The WHR model had a sensitivity of 71.1% and specificity of 70.5%, while the WC model had a sensitivity of 75.1% and specificity of 67.7%. A study from China developed a simplified prediction model for Chinese population and found an AUC of 0.67 (sensitivity: 84.2%, specificity: 39.8%), while FINDRISC score showed an AUC of 0.66. In contrast to our findings, both the Chinese simplified diabetes risk score and the FINDRISC score demonstrated relatively lower performance (Gao et al., Reference Gao, Dong, Pang, Nan, Wang, Ren, Zhang, Tuomilehto and Qiao2010). This difference may be attributed to variations in population characteristics, study designs – often involving cross-sectional studies – and sample sizes.
In this study, we found similar performance between the FINDRISC and country-specific models in predicting T2DM within the Turkish population. Therefore, both the FINDRISC score and our models appear suitable for determining diabetes risk in the Turkish population. When estimating diabetes risk, commonly considered predictors include age, gender, BMI, WC, physical activity, dietary habits, hypertension, and a family history of diabetes (Lindström and Tuomilehto, Reference Lindström and Tuomilehto2003). However, we expanded on this by incorporating sociodemographic factors such as marital status, educational level, and place of residence, along with additional lifestyle-related variables like smoking, alcohol consumption, type of bread consumed, type of oil/butter consumed, and additional salt intake.
In a meta-analysis comparing the predictive abilities of BMI, WC, and WHR for diabetes incidence, it was found that despite the high correlation between BMI and WC, and the relatively low correlation of WHR with these measures, all three variables showed similar predictive performance in estimating diabetes incidence (Vazquez et al., Reference Vazquez, Duval, Jacobs and Silventoinen2007). In our study, we observed that WC (94–102 (M); 80–88 (F), OR: 1.66, 95% Cl: 1.20–2.30; >102 (M); 88 (F) OR: 2.21, 95% Cl: 1.57–3.13) contributed to a higher increase in diabetes risk compared to WHR (OR: 1.40, 95% Cl: 1.14–1.73). However, when we assessed the overall performance of the models, both WC and WHR models demonstrated equivalent predictive power.
Our findings indicated that being married and having a higher level of education were associated with an increased risk of diabetes. The relationship between marriage and diabetes is complex. Previous studies have shown that the risk of diabetes was higher in single, divorced men, and widowed women (Kposowa et al., Reference Kposowa, Aly Ezzat and Breault2021). However, in our study, the increased risk among married individuals may be explained by the possibility that spouses provide support, leading to better healthcare access and higher diagnosis rates among married individuals.
In the literature, lower educational levels are often associated with an increased risk of diabetes (Sezer et al., Reference Sezer, Lafçi, Korkmaz and Dağdeviren2021). However, our study found that individuals with higher educational background had a higher risk of developing diabetes. A study conducted in the USA also found that higher educational levels were associated with an increased the risk of T2DM (Zhang et al., Reference Zhang, Zhang, Zhang, Hu and Chen2014). This paradoxical finding may be attributed to the fact that individuals with higher education are more aware of health issues and may seek medical care more frequently, leading to higher diagnosis rates. Therefore, targeted interventions may be needed for disadvantaged groups, such as those with lower educational attainment and participants with low social support.
In this study, multivariable logistic regression analysis was conducted in externally validate FINDRISC variables. Both physical activity (OR: 0.96, 95% Cl: 0.77–1.22) and daily intake of fruit and vegetables (OR: 0.87, 95% Cl: 0.49–1.43) were not statistically significant predictors of T2DM. Simplified versions of the FINDRISC model have been evaluated and found to perform similarly or better in different populations. For instance, simplified versions of FINDRISC were successfully implemented in Spain (Salinero-Fort et al., Reference Salinero-Fort, Burgos-Lunar, Lahoz, Mostaza, Abánades-Herranz, Laguna-Cuesta, Estirado-de Cabo, García-Iglesias, González-Alegre, Fernández-Puntero and Montesano-Sánchez2016) (BMI, history of high blood glucose, and antihypertensive medication use), an European cohort (Mavrogianni et al., Reference Mavrogianni, Lambrinou, Androutsos, Lindström, Kivelä, Cardon, Huys, Tsochev, Iotova, Chakarova and Rurik2019) (age, BMI, antihypertensive medication use, and history of high blood glucose), and Burkina Faso (Traoré et al., Reference Traoré, Paré, Dabourou, Guira, Sagna, Kamouni, Zoungrana, Bognounou, Tiéno and Drabo2021) (age, daily physical activity, antihypertensive medication use, WC, and BMI). These results indicate that there may be additional factors or modifications specific to our population that should be integrated into the risk assessment model for more accurate predictions.
This is the first study to evaluate FINDRISC performance in Turkish population using a large cohort. Additionally, it contributes to the development of a diabetes risk score specifically modelled for the Turkish population. However, there are several limitations to consider. First, T2DM diagnoses were based on admissions to healthcare facilities in this cohort. As a result, some diagnoses might have been missed, particularly among individuals who do not regularly attend clinics. Second, while the FINDRISC score was originally developed to predict a 10-year risk of diabetes, our study only evaluated a 6-year time frame. This shorter assessment period may have influenced the accuracy of risk predictions. Furthermore, the TurCDRF study did not collect information on second-degree family history, which may have led to the misclassification of individuals with a second-degree family history of diabetes. As a result, we may have underestimated the impact of family history on diabetes risk in the study population. Lastly, the study does not account for genetic predisposition to diabetes. Incorporating genome-wide association studies in future research could provide more personalized and accurate diabetes risk predictions.
Conclusion
In conclusion, the assessment of the FINDRISC score’s performance in the Turkish population has confirmed its effectiveness in predicting T2DM. Additionally, the country-specific models exhibited discrimination indexes similar to FINDRISC. Reevaluating and tailoring diabetes risk scores for diverse populations have the potential to improve the accuracy of risk assessment. Consequently, both the FINDRISC score and our models serve as suitable tools for identifying T2DM risk in the Turkish population.
Acknowledgements
The study used data from the national Turkiye Chronic Diseases and Risk Factors Survey commissioned by the MoH Turkiye.
Funding statement
No funding was received for conducting this study.
Competing interests
The authors have no relevant financial or non-financial interests to disclose.