Skip to main content

Diabetic peripheral neuropathy detection of type 2 diabetes using machine learning from TCM features: a cross-sectional study

Abstract

Aims

Diabetic peripheral neuropathy (DPN) is the most common complication of diabetes mellitus. Early identification of individuals at high risk of DPN is essential for successful early intervention. Traditional Chinese medicine (TCM) tongue diagnosis, one of the four diagnostic methods, lacks specific algorithms for TCM symptoms and tongue features. This study aims to develop machine learning (ML) models based on TCM to predict the risk of diabetic peripheral neuropathy (DPN) in patients with type 2 diabetes mellitus (T2DM).

Methods

A total of 4723 patients were included in the analysis (4430 with T2DM and 293 with DPN). TFDA-1 was used to obtain tongue images during a questionnaire survey. LASSO (least absolute shrinkage and selection operator) logistic regression model with fivefold cross-validation was used to select imaging features, which were then screened using best subset selection. The synthetic minority oversampling technique (SMOTE) algorithm was applied to address the class imbalance and eliminate possible bias. The area under the receiver operating characteristic curve (AUC) was used to evaluate the model's performance. Four ML algorithms, namely logistic regression (LR), random forest (RF), support vector classifier (SVC), and light gradient boosting machine (LGBM), were used to build predictive models for DPN. The importance of covariates in DPN was ranked using classifiers with better performance.

Results

The RF model performed the best, with an accuracy of 0.767, precision of 0.718, recall of 0.874, F-1 score of 0.789, and AUC of 0.77. With a value of 0.879, the LGBM model appeared to be the best regarding recall Age, sweating, dark red tongue, insomnia, and smoking were the five most significant RF features. Age, yellow coating, loose teeth, smoking, and insomnia were the five most significant features of the LGBM model.

Conclusions

This cross-sectional study demonstrates that the RF and LGBM models can screen for high-risk DPN in T2DM patients using TCM symptoms and tongue features. The identified key TCM-related features, such as age, tongue coating, and other symptoms, may be advantageous in developing preventative measures for T2DM patients.

Peer Review reports

Introduction

Type 2 diabetes mellitus (T2DM) has become a worldwide epidemic due to a complex interplay of genetic predisposition, excessive food intake, and environmental factors [1, 2]. It is estimated that around 463 million adults aged 20–79 have been diagnosed with diabetes, resulting in an alarming incidence rate of 9.3% [1]. Notably, China has the highest number of diabetic patients globally, with approximately 116 million affected adults, the majority of whom have T2DM [3]. 2DM is a chronic metabolic disorder characterized by high blood sugar levels and resistance to insulin. Poor control of blood sugar can lead to complications affecting the heart, kidneys, and nervous system [4]. Among these complications, diabetic peripheral neuropathy (DPN) is one of the most prevalent chronic conditions associated with diabetes. It often leads to sensory and motor dysfunction in the limbs along with sleep disturbances, depression, reduced quality of life, and impaired social functioning [5]. As the disease progresses, approximately half of individuals with diabetes will develop DPN making it the primary cause for foot ulcers as well as disability and amputation rates [5,6,7]. The significant morbidity and disability rates associated with DPN impose substantial physical and economic burdens on individuals affected by this condition [8, 9]. Unfortunately, DPN tends to manifest gradually without noticeable symptoms until it reaches an irreversible stage [10, 11]. Numerous academic studies have emphasized the importance of achieving better control over blood sugar levels in patients with T2DM in order to reduce long-term morbidity related to neuropathy [12,13,14]. Therefore there is an urgent need for increased awareness and early intervention strategies aimed at preventing or mitigating the debilitating consequences caused by DPN in individuals living with T2DM.

Nerve conduction studies are currently considered as highly reliable diagnostic tools for identifying DPN. These tests not only help diagnose DPN [15, 16]. However, conducting these studies can be costly, time-consuming, labor-intensive, and impractical for routine clinical care. Currently, standard clinical assessments utilize scored clinical evaluations and bedside tests to diagnose DPN in routine clinical practice. Unfortunately, by the time these basic tests detect neuropathy, it is often too advanced to reverse or halt its progression [10]. Early detection and diagnosis of diabetes and its chronic complications are crucial for effective patient treatment and recovery. The ancient Chinese medical text Huangdi Neijing documented the early diagnosis and treatment of diabetes over 2000 years ago [17]. In traditional Chinese medicine (TCM), diabetes is referred to as "Xiaoke" disease which is believed to result from "Yin deficiency", leading to excessive thirst and urination. TCM has accumulated extensive diagnostic experience with a systematic procedure and standard for diagnosing this condition [18, 19]. TCM primarily relies on tongue diagnosis to identify imbalances in Yin-Yang energy [19], which aids in early-stage disease detection. TCM stands out due to its personalized approach towards syndrome differentiation and treatment [20]. By incorporating elements of TCM into a clinical prediction model, an accurate digital risk assessment can be provided that highlights the benefits of individualized therapy. The knowledge gained from syndrome differentiation and treatment in TCM can serve as evidence-based support for preventing and treating DPN in patients with type 2 diabetes mellitus.

In our previous study, we utilized LASSO regression to screen variables and subsequently developed a clinical nomogram. The findings indicated that TCM indicators, including age, smoking, sweating, and purple tongue, demonstrated strong predictive efficacy for DPN [21]. Building upon these results, the current study employs the SMOTE algorithm to address data imbalance between the two groups. We established predictive models for DPN using four machine learning methods and evaluated their performance using the AUC. The final results indicate that the RF and LGBM algorithms exhibit superior performance, and the feature importance ranking is used to optimize the limitations of the variables in the previous study. Finally, the 5 most important features of predicted DPN were screened out and ranked.

Managing DPN should prioritize individualization; however, physicians often encounter challenges during the decision-making process due to increasing patient numbers, variations among individuals, as well as the complex nature of DPN itself. In medicine, machine learning techniques have emerged as a promising strategy for addressing these challenges effectively [22]. ML offers several advantages compared to traditional statistical methods, such as the ability to learn from multiple data sources, improved identification of variables associated with clinical outcomes, enhanced predictive performance, better modeling of complex relationships, and resilience against data noise [23]. The objective of this study is to develop a straightforward ML-based screening model for DPN that integrates clinical signs from TCM and laboratory indicators. We analyzed data from 4723 subjects in order to achieve this goal.

Materials and methods

Study Population

This cross-sectional study was conducted between January 2019 and October 2020 at the Endocrinology Department and TCM Surgery Department of The Second Affiliated Hospital of Tianjin University of TCM, Tianjin, China. Prior to inclusion in the study, all participants provided written informed consent. T2DM was confirmed based on a previous diagnosis reported at the Second Affiliated Hospital of Tianjin University of TCM, or fasting plasma glucose (FPG) levels ≥ 7.0 mmol/L and/or random plasma glucose (RPG) levels ≥ 11.1 mmol/L. DPN was confirmed based on screening methods using a 128 Hz tuning fork and 10 g monofilaments [6, 10]. Exclusion criteria for the study included the following: (1) age below 18 years old; (2) failure to complete the questionnaire; (3) pregnant or lactating women; (4) inability to cooperate with the complete tongue image collector; (5) incomplete clinical data. After rigorous data filtration, a total of 4723 subjects were included in the study. Clinical trial number: not applicable.

Data Collection

During the interview, trained investigators from the School of Health Sciences and Engineering at Tianjin University of TCM administered the standardized questionnaire (Information Record Form of TCM Clinical Four Diagnostics) developed by Shanghai University of TCM [24]. Studies related to the use of this questionnaire have been published [21]. The questionnaire included seven physicochemical indexes: age, sex, BMI, FPG, random plasma glucose, and smoking status. BMI was calculated by dividing weight in kilograms by the square of height in meters. Clinical symptom information, such as "fatigue", "sigh", "irritability", "forgetfulness", "insomnia", "sweating", "loose teeth", "dry skin", "dry mouth", "polydipsia", "thirst without drinking much", "polyuria", and "frequency of urine", was gathered using the questionnaire, which was filled out by the investigators with responses coded as "yes" (1) or "no" (0). Figure 1 illustrates the flowchart for the study.

Fig. 1
figure 1

The flowchart of this study. LASSO, least absolute shrinkage and selection operator; LGBM, light gradient boosting machine; SVC, supporting vector classifier; LR, logistic regression; RF, random forest

Collection of tongue features

Two investigators identified tongue features and recorded the findings in the questionnaires. Using the Tongue Diagnosis Analysis-1 (TFDA-1) instrument developed by the national key research and development plan, tongue images were collected [25]. Tongue images were taken in the morning (8 a.m. to 9:30 a.m.) when there was no food and in a fixed light source room, where subjects were asked to rinse their mouths before they were taken. The researchers first set the shooting parameters, instructed the subjects to maintain emotional stability, rested their chin on the corresponding position on the tongue diagnostic machine, and easily stretched out their tongue to relax. The researcher then clicked on the middle of the tongue on the instrument screen to complete the acquisition. Two experienced TCM experts were invited to review the images back-to-back. In case of disagreement, the final decision would be made by a third expert. Crimson tongue, purple tongue, dark red tongue, enlarged tongue, spotted tongue, teeth-marked tongue, fissured tongue, yellow coating, less coating, thick coating, and greasy coating were the tongue variables. Crimson tongue, purple tongue and dark red tongue are the color of the tongue; enlarged tongue, spotted tongue, teeth marked tongue and fissured tongue are the state of the tongue; yellow coating, less coating, thick coating and greasy coating are the state of tongue coating.

Statistical analysis

SPSS (version 26.0), R (version 4.1.3), Python (version 3.7.3), and STATA (version 15.0) were utilized for statistical analysis. Normally distributed continuous variables were presented as means ± standard deviation. As appropriate, the chi-squared test or Fisher's exact test was used to compare differences between categorical variables. All reported statistical significance levels were two-sided, with statistical significance set at 0.05. This study used the least absolute shrinkage and selection operator (LASSO) to select the most valuable candidate variables, which were then used to build the ML models. T2DM was more prevalent than DPN, resulting in class imbalances. The synthetic minority over-sampling technique (SMOTE) algorithm was employed to address this problem. Using the area under the receiver operating characteristic curve (AUC), we compared the performance of the four constructed models. The model with the highest AUC was considered the optimal model. Finally, the variable importance ranking of the optimal model was then displayed, and a nomogram model was constructed to identify T2DM patients at risk of developing DPN.

Feature selection

The LASSO regression model was utilized to screen the final input features for ML models. This method minimizes the LASSO cost function and selects features with non-zero coefficients to produce a subset of variables for further analysis. Additionally, best subset selection was applied to all 29 variables. The SMOTE algorithm was utilized to address the class imbalance caused by the larger number of subjects with T2DM compared to DPN.

Data imbalance processing

The SMOTE algorithm synthesizes new samples by analyzing a small number of samples and adding them to the dataset, effectively resolving the problem of model overfitting associated with random oversampling [26]. The fundamental concept of the SMOTE algorithm is to synthesize new samples by analyzing a small number of samples and adding them to the dataset [27]. This algorithm addresses the class imbalance caused by model overfitting [23].

Machine learning algorithms

Four supervised machine learning algorithms, namely logistic regression (LR), random forest (RF), support vector classifier (SVC), and light gradient boosting machine (LGBM), were utilized to establish DPN prediction models. The model with the optimal performance was chosen for further analysis. LR is a classification algorithm that provides probabilities between 0 and 1 and establishes a relationship between features and outcome probabilities. In addition, it provides baseline accuracy scores relative to other non-parametric ML models [28]. LR provides baseline accuracy scores compared to other non-parametric ML models [29]. RF is an ensemble algorithm that combines multiple decision trees and can be used for regression and classification tasks [30]. SVC is used for finding hyperplanes in N-dimensional spaces and is particularly effective for distinguishing data points with a hyperplane that maximizes the edge distance [31]. Based on decision trees, LGBM is used for various ML tasks; it applies gradient-based one-side sampling (GOSS) to estimate information gain and reduce the number of data instances during training without compromising accuracy [32].

Performance measurement

Using a balanced dataset generated by the SMOTE algorithm, we conducted a comprehensive and quantitative evaluation of the performance of our machine learning models, The dataset was randomly divided into training and validation sets for binary classification tasks. We used a confusion matrix and five standard metrics, including accuracy, precision, recall, F-1 score, and receiver operating characteristic (ROC), which were calculated using Eqs. (1)– (4) to evaluate the recognition ability of the models.

$$Accuracy=\frac{TP+TN}{TP+TN+FP+FN}$$
(1)
$$Precision=\frac{TP}{TP+FN}$$
(2)
$$Recall=\frac{TP}{TP+FP}$$
(3)
$$F-1=\frac{2 \times Precision\times Recall }{Precision+Recall}$$
(4)

Feature ranking algorithms

The features were independently ranked through feature ranking algorithms without using ML algorithms. The distinct and stable features were selected based on their rank scores [33, 34]. Feature ranking is a filtering method that can be affected by the classification algorithm. However, it can also generate hypotheses to gain insight into the factors influencing the prediction [34, 35].

Results

Baseline characteristics

This study included 4723 eligible diabetic participants, including 4430 T2DM patients and 293 DPN patients. Three clinical variables, including physicochemical indexes, TCM symptoms, and tongue features, were analyzed using a standardized questionnaire with categorical variables between 2019 and 2020. The participants' baseline characteristics are listed in Table 1. The DPN group had more participants aged > 60 than the T2DM group. The most common BMI range was 24.6–28 (42.3%), and the proportion of males (57.3%) in the DPN group was greater than that of females (42.7%). Significantly more smokers were present in the DPN group (44.4%) than in the T2DM group (21.9%). Age, BMI, gender, smoking status, insomnia, sweating, loose teeth, dry skin, polyuria, purple tongue, dark red tongue, yellow coating, and greasy coating were significantly associated with DPN incidence (p < 0.05).

Table 1 Baseline characteristics of the participants

Features selected by LASSO regression

Using a balanced dataset generated by the SMOTE algorithm, the performance of the proposed ML models was evaluated thoroughly and quantitatively (Table 2). The dataset consisted of 8840 samples, evenly balanced at 1:1 for positive and negative cases, with 6202 samples used for training and 2658 samples for validation (7:3). All dataset attributes were found to be statistically significant (p < 0.05).

Table 2 Dataset description by SMOTE algorithm

As shown in Table 2 and Fig. 2, we used LASSO regression to reduce the number of features from 29 to 12 and identify the most significant features associated with DPN incidence. Among these 12 features, age, smoke, insomnia, sweating, loose teeth, dry skin, polyuria, purple tongue, dark red tongue, yellow coating, thick coating, and greasy coating were found to be significantly associated with DPN (Fig. 2 and Supplemental Fig. 1). Using the best subset selection method, we narrowed the selection of features further, resulting in eight features, including age, smoking, sweating, loose teeth, dry skin, purple tongue, dark red tongue, and greasy coating, as visualized in the Nomogram diagram (Supplemental Fig. 2). Considering the clinical significance of TCM, we obtained a prediction model containing 12 DPN-related features.

Fig. 2
figure 2

LASSO for feature selection. a The LASSO plot was produced based on the log (λ) sequence and according to the fivefold cross-validation that resulted in 12 nonzero coefficients. b Dotted vertical lines were drawn at the optimal values using fivefold cross-validation using the minimum criteria and 1-SE criteria

SMOTE algorithm validation

Using the SMOTE algorithm, we obtained a dataset containing 8840 samples that was balanced in a 1:1 ratio (Table 2), with 6202 training subjects and 2658 validation subjects (7:3). All attributes were statistically significant (p < 0.05).

Model performance comparison

As shown in Table 3, we compared the performance of four ML models, namely RF, LGBM, SVC, and LR. The RF model achieved the highest accuracy of all models with 0.767, followed by LGBM with 0.758, SVC with 0.714, and LR with 0.674. Similar trends were observed for other performance metrics, including precision, F-1 score, and AUC, except for recall, where RF consistently outperformed the other models. Regarding recall, LGBM demonstrated the highest predictive power with a value of 0.879, followed by RF with 0.874, SVC with 0.855, and LR with 0.709. The confusion matrix in Fig. 3 visually depicted the performance of the models, with darker colors indicating higher true negatives and true positives. Overall, the RF model showed the best performance, with an accuracy of 0.767, precision of 0.718, recall of 0.874, F-1 score of 0.789, and AUC of 0.77 (Fig. 4).

Table 3 Performance comparison of the four ML models
Fig. 3
figure 3

The confusion matrix of the four ML models

Fig. 4
figure 4

The receiver operating characteristics (ROC) curves of all algorithms. Logistic, logistic regression; SVC, supporting vector classifier; LGBM, light gradient boosting machine

Feature importance

Further analysis of feature importance using relative importance value or feature importance score revealed that age, sweating, dark red tongue, insomnia, and smoking were the top five most important features in the RF model. In contrast, the top five most essential features in the LGBM model were age, yellow coating, loose teeth, smoking, and insomnia (Fig. 5). In summary, the results of this study demonstrated that the RF and the LGBM models performed the best in classification performance, with age, sweating, dark red tongue, insomnia, and smoking identified as the top important features associated with DPN in both models.

Fig. 5
figure 5

The importance rankings. a The relative importance ranking of RF; the top ten were age, sweating, dark red tongue, insomnia, smoke, greasy coating, thick coating, yellow coating, dry skin, loose teeth, purple tongue, and polyuria. b The importance ranking of LGBM; the top ten were age, yellow coating, loose teeth, smoke, insomnia, dry skin, thick coating, purple tongue, greasy coating, sweating, polyuria, and dark red tongue

Discussion

In this cross-sectional study involving individuals with T2DM, we employed four different machine learning algorithms to predict the risk of developing DPN. Among these models, the RF algorithm utilizing 12 features demonstrated the highest accuracy in predicting DPN incidence. Its performance metrics were as follows: accuracy = 0.767, precision = 0.718, recall = 0.874, F-1 score = 0.789, and AUC = 0.768. These results suggest that our developed predictive model can effectively screen individuals at high risk for DPN and contribute towards preventing and managing diabetes complications. This finding aligns with previous studies [36, 37] and serves as a successful application of ML in predicting DPN risk within TCM research context.

Previous research on prediction models for DPN has primarily focused on laboratory indicators while neglecting TCM symptoms and tongue features [38]. Our study stands out as the first in China to specifically target TCM-related indicators and employ machine learning techniques for building predictive models based on TCM symptoms and tongue features when it comes to DPN prediction among individuals with diabetes mellitus type 2 (T2DM). With advancements in artificial intelligence technology, unsupervised and supervised machine learning approaches have gained popularity within the medical field especially for predicting diabetes complications [39,40,41].

The analysis of feature importance revealed the significant contributions of various model features. The study identified age and smoking as potential risk factors for DPN. Previous research conducted in Beijing, China found that individuals aged 40 years or older were at a higher risk for DPN among diabetic patients [42]. In a Chinese cross-sectional study of diabetic patients, the prevalence of DPN increased with age [43, 44]. Similarly, a cohort study in Bangladesh demonstrated that smoking was associated with microvascular complications in diabetic patients [36]. Another study suggested that impaired sweat secretion could increase the likelihood of foot ulcers among those with diabetes and DPN [45]. Patients with DPN and abnormal sweating function had a 15-fold increased risk of foot ulcers [46]. A visual indicator plaster method has been used to diagnose DPN by assessing foot skin dryness [47]. while Sudoscan technology can quantitatively analyze sweating function to assess neurological impairment of sweat glands [48]. Patients with diabetes often exhibit oral manifestations such as periodontal disease, tooth loss, dental caries, dry mouth, delayed wound healing, and taste dysfunction [49]. The bidirectional relationship between periodontitis and diabetes mellitus is well-established [50]. Thus, it is essential to promote proper dental care and tooth retention among adults with diabetes mellitus [51]. Insomnia has been linked to an increased risk of type 2 diabetes according to observational studies and Mendelian randomization studies [52,53,54]. This study confirmed an association between loose teeth and insomnia in older patients with DPN. Additionally, a previous epidemiological study conducted in Japan found a correlation between yellow tongue coating and a higher prevalence of diabetes mellitus [55]. According to traditional Chinese medicine (TCM), DPN is often accompanied by blood stasis [56]. Additionally, Morita A et al. objectively evaluated the relationship between blood stasis and patients with a dark purple tongue [57]. As demonstrated by previous studies, tongue performance is crucial for detecting blood stasis [58]. Specifically, patients with T2DM exhibit increased tongue manifestations of blood stasis associated with severe arterial stiffness [59]. Consistent with previous findings, we observed a preference for dark red tongue color and yellow coating in patients with DPN in our study. Our study is the first to develop four classical machine learning methods for predicting DPN using TCM-related features. Xia et al. compared three classical ML methods and found that the RF model best diagnosed metabolic syndrome [60]. Some Traditional Chinese Medicine (TCM) indicators, such as chest constriction, spontaneous sweating, wiry pulse, and tongue coating with a greasy appearance, have also demonstrated improved diagnostic accuracy for metabolic syndrome. The RF algorithm employs random samples and variables to generate decision trees, where the final prediction of its model is based on the category predicted by these decision trees [61]. In our study, we established an RF prediction model that exhibited favorable performance as well. This model could assist TCM practitioners in identifying individuals with type 2 diabetes at a heightened risk of developing diabetic peripheral neuropathy (DPN). Our study identified TCM-related characteristics that encompassed not only physicochemical indicators but also TCM symptoms and tongue features. These findings can effectively contribute to the development and implementation of TCM-based strategies aimed at preventing DPN in individuals diagnosed with type 2 diabetes.

Strengths and limitations

There are certain limitations to our study that should be acknowledged. Firstly, the generalizability of our findings to other cities or countries may be uncertain as the study population was recruited solely from a single center in Tianjin. Secondly, it is important to note that our study employed a cross-sectional design and further research is required to validate the reliability of the established prediction models. In future investigations, we aim to incorporate additional indicators in order to enhance the predictive accuracy of these models. Thirdly, due to insufficient data availability, significant risk factors such as low-density lipoprotein cholesterol (LDL-C) and glycosylated hemoglobin (HbA1c) were not included in our prediction model. Lastly, while our study focused on analyzing tongue features alone, we believe that incorporating data on specific tongue flora and integrating information from various dimensions will contribute towards developing more advanced prediction models for diabetic peripheral neuropathy (DPN) in future studies.

Data availability

The data used for the modelling in this study belongs to the National Key Research and Development Program of China and restrictions apply to the availability of these data.

References

  1. Donath MY, Shoelson SE. Type 2 diabetes as an inflammatory disease. Nat Rev Immunol. 2011;11:98–107. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/nri2925.

    Article  CAS  PubMed  Google Scholar 

  2. Hosomi K, Saito M, Park J, Murakami H, Shibata N, Ando M, Nagatake T, Konishi K, Ohno H, Tanisawa K, et al. Oral administration of Blautia wexlerae ameliorates obesity and type 2 diabetes via metabolic remodeling of the gut microbiota. Nat Commun. 2022;13:4477. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41467-022-32015-7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Sinclair A, Saeedi P, Kaundal A, Karuranga S, Malanda B, Williams R. Diabetes and global ageing among 65–99-year-old adults: Findings from the International Diabetes Federation Diabetes Atlas, 9th edition. Diabetes Res Clin Pract. 2020; 162:108078. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.diabres.2020.108078

  4. Cole JB, Florez JC. Genetics of diabetes mellitus and diabetes complications. Nat Rev Nephrol. 2020;16:377–90. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41581-020-0278-5.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Diabetic neuropathy. Nat Rev Dis Primers. 2019; 5:42. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41572-019-0097-9

  6. Pop-Busui R, Boulton AJM, Feldman EL, Bril V, Freeman R, Malik RA, Sosenko JM, Ziegler D. Diabetic Neuropathy: A Position Statement by the American Diabetes Association. Diabetes Care. 2017;40:136–54. https://doiorg.publicaciones.saludcastillayleon.es/10.2337/dc16-2042.

    Article  CAS  PubMed  Google Scholar 

  7. Liu S, Zheng H, Zhu X, Mao F, Zhang S, Shi H, Li Y, Lu B. Neutrophil-to-lymphocyte ratio is associated with diabetic peripheral neuropathy in type 2 diabetes patients. Diabetes Res Clin Pract. 2017;130:90–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.diabres.2017.05.008.

    Article  PubMed  Google Scholar 

  8. Feldman EL, Nave K-A, Jensen TS, Bennett DLH. New Horizons in Diabetic Neuropathy: Mechanisms, Bioenergetics, and Pain. Neuron. 2017;93:1296–313. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neuron.2017.02.005.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Callaghan BC, Cheng HT, Stables CL, Smith AL, Feldman EL. Diabetic neuropathy: clinical manifestations and current treatments. Lancet Neurol. 2012;11:521–34. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S1474-4422(12)70065-0.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Selvarajah D, Kar D, Khunti K, Davies MJ, Scott AR, Walker J, Tesfaye S. Diabetic peripheral neuropathy: advances in diagnosis and strategies for screening and early intervention. Lancet Diabetes Endocrinol. 2019;7:938–48. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S2213-8587(19)30081-6.

    Article  PubMed  Google Scholar 

  11. Dyck PJ, Albers JW, Andersen H, Arezzo JC, Biessels G-J, Bril V, Feldman EL, Litchy WJ, O’Brien PC, Russell JW, et al. Diabetic polyneuropathies: update on research definition, diagnostic criteria and estimation of severity. Diabetes Metab Res Rev. 2011;27:620–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/dmrr.1226.

    Article  PubMed  Google Scholar 

  12. TODAY Study Group. Risk Factors for Diabetic Peripheral Neuropathy in Adolescents and Young Adults With Type 2 Diabetes: Results From the TODAY Study. Diabetes Care. 2021; dc211074. https://doiorg.publicaciones.saludcastillayleon.es/10.2337/dc21-1074

  13. Jensen TS, Karlsson P, Gylfadottir SS, Andersen ST, Bennett DL, Tankisi H, Finnerup NB, Terkelsen AJ, Khan K, Themistocleous AC, et al. Painful and non-painful diabetic neuropathy, diagnostic challenges and implications for future management. Brain. 2021;144:1632–45. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/brain/awab079.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Bjerg L, Nicolaisen SK, Christensen DH, Nielsen JS, Andersen ST, Jørgensen ME, Jensen TS, Sandbæk A, Andersen H, Beck-Nielsen H, et al. Diabetic Polyneuropathy Early in Type 2 Diabetes Is Associated With Higher Incidence Rate of Cardiovascular Disease: Results From Two Danish Cohort Studies. Diabetes Care. 2021;44:1714–21. https://doiorg.publicaciones.saludcastillayleon.es/10.2337/dc21-0010.

    Article  PubMed  Google Scholar 

  15. England JD, Gronseth GS, Franklin G, Miller RG, Asbury AK, Carter GT, Cohen JA, Fisher MA, Howard JF, Kinsella LJ, et al. Distal symmetrical polyneuropathy: a definition for clinical research. A report of the American Academy of Neurology, the American Association of Electrodiagnostic Medicine, and the American Academy of Physical Medicine and Rehabilitation. Arch Phys Med Rehabil. 2005; 86:167–174. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.apmr.2004.09.011

  16. Carrington AL, Shaw JE, Van Schie CHM, Abbott CA, Vileikyte L, Boulton AJM. Can motor nerve conduction velocity predict foot problems in diabetic subjects over a 6-year outcome period? Diabetes Care. 2002;25:2010–5. https://doiorg.publicaciones.saludcastillayleon.es/10.2337/diacare.25.11.2010.

    Article  PubMed  Google Scholar 

  17. Cañizares-Carmenate Y, Díaz-Amador R, Mayra M, Tran T, Torrens F, Alberto J. Virtual screening of flavonoids from Jatropha gossypiifolia L. as potential drugs for diabetic complications. Tradit Med Res. 2022; 7:14. https://doiorg.publicaciones.saludcastillayleon.es/10.53388/TMR20220131261

  18. Dong J. The Relationship between Traditional Chinese Medicine and Modern Medicine. Evid Based Complement Alternat Med. 2013;2013: 153148. https://doiorg.publicaciones.saludcastillayleon.es/10.1155/2013/153148.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Balasubramaniyan S, Jeyakumar V, Nachimuthu DS. Panoramic tongue imaging and deep convolutional machine learning model for diabetes diagnosis in humans. Sci Rep. 2022;12:186. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41598-021-03879-4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Wang F, Liao S, Wen J, Wang H, Wang Y, Weng X, Wu R, Huang Y. Effects of pestle needle on nerve conduction velocity and inflammatory injury in patients with diabetic peripheral neuropathy. Tradit Med Res. 2022;7:17. https://doiorg.publicaciones.saludcastillayleon.es/10.53388/TMR20220119257.

    Article  Google Scholar 

  21. Tian Z, Fan Y, Sun X, Wang D, Guan Y, Zhang Y, Zhang Z, Guo J, Bu H, Wu Z, Wang H. Predictive value of TCM clinical index for diabetic peripheral neuropathy among the type 2 diabetes mellitus population: A new observation and insight. Heliyon. 2023;16: e17339. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.heliyon.2023.e17339.

    Article  Google Scholar 

  22. Teh K, Wilkinson ID, Heiberg-Gibbons F, Awadh M, Kelsall A, Pallai S, Sloan G, Tesfaye S, Selvarajah D. Somatosensory network functional connectivity differentiates clinical pain phenotypes in diabetic neuropathy. Diabetologia. 2021;64:1412–21. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00125-021-05416-4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Ji W, Xue M, Zhang Y, Yao H, Wang Y. A Machine Learning Based Framework to Identify and Classify Non-alcoholic Fatty Liver Disease in a Large-Scale Population. Front Public Health. 2022;10: 846118. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fpubh.2022.846118.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Z J. Study on the characteristics of sub-health symptoms and TCM syndrome patterns distribution in 1 754 non-disease population. Chinese J Integrative Med. 2017; 37:934--938.

  25. Zhang J, Xu J, Hu X, Chen Q, Tu L, Huang J, Cui J. Diagnostic Method of Diabetes Based on Support Vector Machine and Tongue Images. Biomed Res Int. 2017;2017:7961494. https://doiorg.publicaciones.saludcastillayleon.es/10.1155/2017/7961494.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Sreejith S, Khanna Nehemiah H, Kannan A. Clinical data classification using an enhanced SMOTE and chaotic evolutionary feature selection. Comput Biol Med. 2020;126: 103991. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.compbiomed.2020.103991.

    Article  CAS  PubMed  Google Scholar 

  27. Zhou R, Yin W, Li W, Wang Y, Lu J, Li Z, Hu X. Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm. Comput Math Methods Med. 2022;2022:8498159. https://doiorg.publicaciones.saludcastillayleon.es/10.1155/2022/8498159.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Choi RY, Coyner AS, Kalpathy-Cramer J, Chiang MF, Campbell JP. Introduction to Machine Learning, Neural Networks, and Deep Learning. Transl Vis Sci Technol. 2020;9:14. https://doiorg.publicaciones.saludcastillayleon.es/10.1167/tvst.9.2.14.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Cox DR. The Regression Analysis of Binary Sequences. J Roy Stat Soc: Ser B (Methodol). 1958;20:215–32. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/j.2517-6161.1958.tb00292.x.

    Article  Google Scholar 

  30. Liu Q, Zhang M, He Y, Zhang L, Zou J, Yan Y, Guo Y. Predicting the Risk of Incident Type 2 Diabetes Mellitus in Chinese Elderly Using Machine Learning Techniques. J Pers Med. 2022;12:905. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/jpm12060905.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Shen Y, Zhao E, Zhang W, Baccarelli AA, Gao F. Predicting pesticide dissipation half-life intervals in plants with machine learning models. J Hazard Mater. 2022;436: 129177. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jhazmat.2022.129177.

    Article  CAS  PubMed  Google Scholar 

  32. Jiao S, Zou Q. Identification of plant vacuole proteins by exploiting deep representation learning features. Comput Struct Biotechnol J. 2022;20:2921–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.csbj.2022.06.002.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Wang H, Khoshgoftaar TM, Gao K. A comparative study of filter-based feature ranking techniques. 2010 IEEE International Conference on Information Reuse & Integration. Las Vegas, NV, USA: IEEE (2010). p. 43–48 https://doiorg.publicaciones.saludcastillayleon.es/10.1109/IRI.2010.5558966

  34. Hussain L, Aziz W, Khan IR, Alkinani MH, Alowibdi JS. Machine learning based congestive heart failure detection using feature importance ranking of multimodal features. Math Biosci Eng. 2020;18:69–91. https://doiorg.publicaciones.saludcastillayleon.es/10.3934/mbe.2021004.

    Article  PubMed  Google Scholar 

  35. Taylor J, Tibshirani RJ. Statistical learning and selective inference. Proc Natl Acad Sci U S A. 2015;112:7629–34. https://doiorg.publicaciones.saludcastillayleon.es/10.1073/pnas.1507583112.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Machine Learning for Screening Microvascular Complications in Type 2 Diabetic Patients Using Demographic, Clinical, and Laboratory Profiles - PubMed. https://pubmed.ncbi.nlm.nih.gov/35207179/ [Accessed August 30, 2022]

  37. Dagliati A, Marini S, Sacchi L, Cogni G, Teliti M, Tibollo V, De Cata P, Chiovato L, Bellazzi R. Machine Learning Methods to Predict Diabetes Complications. J Diabetes Sci Technol. 2018;12:295–302. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/1932296817706375.

    Article  PubMed  Google Scholar 

  38. Wu B, Niu Z, Hu F. Study on Risk Factors of Peripheral Neuropathy in Type 2 Diabetes Mellitus and Establishment of Prediction Model. Diabetes Metab J. 2021;45:526–38. https://doiorg.publicaciones.saludcastillayleon.es/10.4093/dmj.2020.0100.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Nanda R, Nath A, Patel S, Mohapatra E. Machine learning algorithm to evaluate risk factors of diabetic foot ulcers and its severity. Med Biol Eng Comput. 2022;60:2349–57. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s11517-022-02617-w.

    Article  PubMed  Google Scholar 

  40. Bej S, Sarkar J, Biswas S, Mitra P, Chakrabarti P, Wolkenhauer O. Identification and epidemiological characterization of Type-2 diabetes sub-population using an unsupervised machine learning approach. Nutr Diabetes. 2022;12:27. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41387-022-00206-2.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Allen A, Iqbal Z, Green-Saxena A, Hurtado M, Hoffman J, Mao Q, Das R. Prediction of diabetic kidney disease with machine learning algorithms, upon the initial diagnosis of type 2 diabetes mellitus. BMJ Open Diabetes Res Care. 2022;10: e002560. https://doiorg.publicaciones.saludcastillayleon.es/10.1136/bmjdrc-2021-002560.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Liu J, Yuan X, Liu J, Yuan G, Sun Y, Zhang D, Qi X, Li H, Zhang J, Wen B, et al. Risk Factors for Diabetic Peripheral Neuropathy, Peripheral Artery Disease, and Foot Deformity Among the Population With Diabetes in Beijing, China: A Multicenter. Cross-Sectional Study Front Endocrinol (Lausanne). 2022;13: 824215. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fendo.2022.824215.

    Article  PubMed  Google Scholar 

  43. Wang W, Ji Q, Ran X, Li C, Kuang H, Yu X, Fang H, Yang J, Liu J, Xue Y, Feng B, Lei M, Zhu D. Prevalence and risk factors of diabetic peripheral neuropathy: A population-based cross-sectional study in China. Diabetes Metab Res Rev. 2023;39(8): e3702. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/dmrr.370.

    Article  CAS  PubMed  Google Scholar 

  44. Li C, Wang W, Ji Q, Ran X, Kuang H, Yu X, Fang H, Yang J, Liu J, Xue Y, Feng B, Lei M, Zhu D. Prevalence of painful diabetic peripheral neuropathy in type 2 diabetes mellitus and diabetic peripheral neuropathy: A nationwide cross-sectional study in mainland China. Diabetes Res Clin Pract. 2023;198: 110602. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.diabres.2023.110602.

    Article  CAS  PubMed  Google Scholar 

  45. Gin H, Baudoin R, Raffaitin CH, Rigalleau V, Gonzalez C. Non-invasive and quantitative assessment of sudomotor function for peripheral diabetic neuropathy evaluation. Diabetes Metab. 2011;37:527–32. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.diabet.2011.05.003.

    Article  CAS  PubMed  Google Scholar 

  46. Tentolouris N, Marinou K, Kokotis P, Karanti A, Diakoumopoulou E, Katsilambros N. Sudomotor dysfunction is associated with foot ulceration in diabetes. Diabet Med. 2009;26:302–5. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/j.1464-5491.2009.02677.x.

    Article  CAS  PubMed  Google Scholar 

  47. Ziegler D, Papanas N, Rathmann W, Heier M, Scheer M, Meisinger C, KORA Study Group. Evaluation of the Neuropad sudomotor function test as a screening tool for polyneuropathy in the elderly population with diabetes and pre-diabetes: the KORA F4 survey. Diabetes Metab Res Rev. 2012; 28:692–697. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/dmrr.2340

  48. Burgess J, Frank B, Marshall A, Khalil RS, Ponirakis G, Petropoulos IN, Cuthbertson DJ, Malik RA, Alam U. Early Detection of Diabetic Peripheral Neuropathy: A Focus on Small Nerve Fibres. Diagnostics (Basel). 2021;11:165. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/diagnostics11020165.

    Article  CAS  PubMed  Google Scholar 

  49. Nazir MA, AlGhamdi L, AlKadi M, AlBeajan N, AlRashoudi L, AlHussan M. The burden of Diabetes, Its Oral Complications and Their Prevention and Management. Open Access Maced J Med Sci. 2018;6:1545–53. https://doiorg.publicaciones.saludcastillayleon.es/10.3889/oamjms.2018.294.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Preshaw PM, Alba AL, Herrera D, Jepsen S, Konstantinidis A, Makrilakis K, Taylor R. Periodontitis and diabetes: a two-way relationship. Diabetologia. 2012;55:21–31. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00125-011-2342-y.

    Article  CAS  PubMed  Google Scholar 

  51. Luo H, Pan W, Sloan F, Feinglos M, Wu B. Forty-Year Trends in Tooth Loss Among American Adults With and Without Diabetes Mellitus: An Age-Period-Cohort Analysis. Prev Chronic Dis. 2015;12:E211. https://doiorg.publicaciones.saludcastillayleon.es/10.5888/pcd12.150309.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Vgontzas AN, Liao D, Pejovic S, Calhoun S, Karataraki M, Bixler EO. Insomnia with objective short sleep duration is associated with type 2 diabetes: A population-based study. Diabetes Care. 2009;32:1980–5. https://doiorg.publicaciones.saludcastillayleon.es/10.2337/dc09-0284.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Hein M, Lanquart J-P, Loas G, Hubain P, Linkowski P. Prevalence and risk factors of type 2 diabetes in insomnia sufferers: a study on 1311 individuals referred for sleep examinations. Sleep Med. 2018;46:37–45. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.sleep.2018.02.006.

    Article  PubMed  Google Scholar 

  54. Yuan S, Larsson SC. An atlas on risk factors for type 2 diabetes: a wide-angled Mendelian randomisation study. Diabetologia. 2020;63:2359–71. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00125-020-05253-x.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Tomooka K, Saito I, Furukawa S, Maruyama K, Eguchi E, Iso H, Tanigawa T. Yellow Tongue Coating is Associated With Diabetes Mellitus Among Japanese Non-smoking Men and Women: The Toon Health Study. J Epidemiol. 2018;28:287–91. https://doiorg.publicaciones.saludcastillayleon.es/10.2188/jea.JE20160169.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Li M, Su H, Xiang Y, Zhang J, Zhao J, Lin L. Features analysis on Traditional Chinese Medicine syndromes in patients with diabetic peripheral neuropathy. J Tradit Chin Med. 2013;33:637–41. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/s0254-6272(14)60034-0.

    Article  PubMed  Google Scholar 

  57. Morita A, Murakami A, Noguchi K, Watanabe Y, Nakaguchi T, Ochi S, Okudaira K, Hirasaki Y, Namiki T. Combination Image Analysis of Tongue Color and Sublingual Vein Improves the Diagnostic Accuracy of Oketsu (Blood Stasis) in Kampo Medicine. Front Med (Lausanne). 2021;8: 790542. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fmed.2021.790542.

    Article  PubMed  Google Scholar 

  58. Tongue diagnosis of traditional Chinese medicine for rheumatoid arthritis - PubMed. https://pubmed.ncbi.nlm.nih.gov/24311851/ [Accessed August 30, 2022]

  59. Hsu P-C, Huang Y-C, Chiang JY, Chang H-H, Liao P-Y, Lo L-C. The association between arterial stiffness and tongue manifestations of blood stasis in patients with type 2 diabetes. BMC Complement Altern Med. 2016;16:324. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12906-016-1308-5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Xia S-J, Gao B-Z, Wang S-H, Guttery DS, Li C-D, Zhang Y-D. Modeling of diagnosis for metabolic syndrome by integrating symptoms into physiochemical indexes. Biomed Pharmacother. 2021;137: 111367. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.biopha.2021.111367.

    Article  CAS  PubMed  Google Scholar 

  61. Rigatti SJ. Random Forest J Insur Med. 2017;47:31–9. https://doiorg.publicaciones.saludcastillayleon.es/10.17849/insm-47-01-31-39.1.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

I would like to thank Dr. Xinhai Zhang for his careful guidance and suggestions in the process of writing the paper.

Funding

This project was supported by the National Key Research and Development Program of China (approval No.:2017YFC1703305),Natural Science Foundation of Hebei Province(approval No.:H2023209049),Scientific Research Project of Hebei Provincial Administration of Traditional Chinese Medicine (approval No.:2024355) and Key Research Projects of North China University of Science and Technology (approval No.:ZD-YG-202316, ZD-YG-202409).

Author information

Authors and Affiliations

Authors

Contributions

Conception, design, and manuscript reviewing: ZT, HW. Data collection and sorting: ZT, XS, DW, JZ and XL. Data analysis and manuscript drafting: ZT, YF,GL and HW. All authors contributed to the article and approved the submitted version.

Corresponding authors

Correspondence to GuoHui Lu or Hongwu Wang.

Ethics declarations

Ethics approval and consent to participate

This study reporting adheres to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines and only anonymous data was used in the analysis. This study was approved by the institutional ethics committee of Tianjin University of TCM (No: TJUTCM-EC20190004), and all participants were required to sign an informed consent form before the study's initiation. We had obtained informed consent from all participants in this study.This study adhered to the tenets of the Declaration of Helsinki.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tian, Z., Zhang, J., Fan, Y. et al. Diabetic peripheral neuropathy detection of type 2 diabetes using machine learning from TCM features: a cross-sectional study. BMC Med Inform Decis Mak 25, 90 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12911-025-02932-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12911-025-02932-w

Keywords