Prediction of stroke events in patients with type 2 diabetes mellitus by interpretable machine learning based on perivascular adipose tissue features: a multicenter cohort study

Ting Zhao; Guihan Lin; Weiyue Chen; Chengli Jiang; Weiming Hu; Lei Xu; Yongjun Chen; Yang Jing; Jinhong Sun; Zhihan Yan; Shuiwei Xia; Chenying Lu; Minjiang Chen; Jiansong Ji; Weiqian Chen

doi:10.21037/qims-2025-1760

Original Article

Prediction of stroke events in patients with type 2 diabetes mellitus by interpretable machine learning based on perivascular adipose tissue features: a multicenter cohort study

Ting Zhao^1,2#, Guihan Lin^1,3#, Weiyue Chen^1,3 , Chengli Jiang^1,3, Weiming Hu^1,2,3, Lei Xu⁴, Yongjun Chen⁵, Yang Jing⁶, Jinhong Sun^1,2,3, Zhihan Yan⁴, Shuiwei Xia^1,3, Chenying Lu^1,3, Minjiang Chen^1,3 , Jiansong Ji^1,3 , Weiqian Chen^1,2,3

¹Zhejiang Key Laboratory of Imaging and Interventional Medicine, The Fifth Affiliated Hospital of Wenzhou Medical University, Lishui, China; ²Department of Vascular Surgery, Lishui Central Hospital, The Fifth Affiliated Hospital of Wenzhou Medical University, Lishui, China; ³Department of Radiology, Lishui Central Hospital, The Fifth Affiliated Hospital of Wenzhou Medical University, Lishui, China; ⁴Wenzhou Key Laboratory of Structural and Functional Imaging, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, China; ⁵Department of Radiology, Lishui People’s Hospital, Lishui, China; ⁶Huiying Medical Technology Co., Ltd., Beijing, China

Contributions: (I) Conception and design: T Zhao, G Lin, Weiyue Chen; (II) Administrative support: C Lu, J Ji, M Chen, Weiqian Chen; (III) Provision of study materials or patients: Weiyue Chen, L Xu; (IV) Collection and assembly of data: T Zhao, G Lin, Y Jing, C Jiang; (V) Data analysis and interpretation: G Lin, Weiyue Chen, W Hu, S Xia; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

^#These authors contributed equally to this work.

Correspondence to: Jiansong Ji, MD, PhD. Zhejiang Key Laboratory of Imaging and Interventional Medicine, The Fifth Affiliated Hospital of Wenzhou Medical University, No. 289 Kuocang Road, Wanshang Street, Liandu District, Lishui 323000, China; Department of Radiology, Lishui Central Hospital, The Fifth Affiliated Hospital of Wenzhou Medical University, Lishui, China. Email: jjstcty@wmu.edu.cn; Weiqian Chen, MD, PhD. Zhejiang Key Laboratory of Imaging and Interventional Medicine, The Fifth Affiliated Hospital of Wenzhou Medical University, No. 289 Kuocang Road, Wanshang Street, Liandu District, Lishui 323000, China; Department of Vascular Surgery, Lishui Central Hospital, The Fifth Affiliated Hospital of Wenzhou Medical University, Lishui, China; Department of Radiology, Lishui Central Hospital, The Fifth Affiliated Hospital of Wenzhou Medical University, Lishui, China. Email: ls2119088@126.com.

Background: Stroke is one of the leading causes of mortality, and patients with type 2 diabetes mellitus (T2DM) have a higher incidence of stroke. However, research on the imaging characteristics of plaques and perivascular adipose tissue (PVAT) in this patient population remains limited. This study therefore aimed to develop and validate a machine learning-based combined model to predict acute stroke events in patients with T2DM and assess its utility in stratifying patients into different risk categories based on follow-up outcomes.

Methods: In this multicenter study, a total of 494 computed tomography angiography (CTA) datasets from patients with T2DM were retrospectively collected from The Fifth Affiliated Hospital of Wenzhou Medical University, The Second Affiliated Hospital of Wenzhou Medical University, and Lishui People’s Hospital and divided into four sets: training (n=193), internal testing (n=84), external validation 1 (n=105), and external validation 2 (n=102). Based on the magnetic resonance imaging findings, the patients were divided into a stroke group and a non-stroke group. PVAT features were extracted from CTA, and perivascular fat density (PFD) was determined. A combined model was developed by integrating radiomics scores with PFD and clinical factors via the extreme gradient boosting (XGBoost) algorithm. The model’s prediction process was illustrated with the SHapley Additive exPlanation (SHAP) method, and its prognostic value was evaluated with Kaplan-Meier analysis.

Results: In this study, 167 patients with T2DM (33.8%) who experienced ischemic stroke (IS) were classified into the stroke group, while 327 patients with T2DM (66.2%) were classified into the non-stroke group. Through application of variance thresholding, SelectKBest, and least absolute shrinkage and selection operator, seven radiomic features were ultimately selected from CTA images to construct the radiomics model. After univariate and multivariate logistic regression analysis, total cholesterol (P=0.033) and hypertension (P=0.028) were identified as independent risk factors for IS. The combined model demonstrated substantial accuracy and robustness, with an area under the receiver operating characteristic curve of 0.955, 0.847, 0.856, and 0.876 in the training, internal testing, external validation 1, and external validation 2 cohorts. SHAP analysis revealed that Exponential_glszm_SizeZoneNonUniformity and Wavelet-HLL_firstorder_Range were the most important features. Event-free survival (EFS) analysis demonstrated that the model could effectively determine patient prognosis. Results from univariate and multivariate Cox regression analyses identified the independent prognostic predictors of follow-up ischemic events to be stroke status [hazard ratio (HR) =3.916; 95% confidence interval (CI): 1.792–6.558; P<0.001] and predicted stroke status (HR =1.352; 95% CI: 1.317–4.777; P=0.030), indicating these factors are associated with the occurrence of ischemic cerebrovascular events during follow-up.

Conclusions: The combined XGBoost model incorporating PVAT features accurately predicted stroke events in patients with T2DM and provided risk stratification for patients.

Keywords: Type 2 diabetes mellitus (T2DM); ischemic stroke (IS); perivascular adipose tissue (PVAT); event-free survival (EFS); SHapley Additive exPlanation (SHAP)

Submitted Aug 14, 2025. Accepted for publication Dec 11, 2025. Published online Jan 21, 2026.

doi: 10.21037/qims-2025-1760

Introduction

Ischemic stroke (IS) is among the leading causes of death and disability worldwide (1). The annual incidence of IS is approximately 101.3 per 100,000 individuals, placing a significant economic burden on society and families while also causing considerable psychological stress (2). Approximately 20% of IS cases are caused by carotid atherosclerotic plaques (3). Type 2 diabetes mellitus (T2DM), a metabolic disorder primarily driven by insulin resistance, is a common condition strongly associated with cardiovascular and cerebrovascular complications (4-6). Studies have shown that compared to the general population, patients with T2DM are 2–5-fold more likely to experience an IS (7,8). Additionally, the mortality, disability, and recurrence rates after stroke are significantly higher in patients with T2DM than in those without it (9,10). Therefore, accurate assessment of the risk of carotid plaques in patients with T2DM is crucial for predicting the occurrence of IS and may facilitate the development of more effective management and prevention strategies.

The mechanisms linking carotid atherosclerosis to stroke in patients with T2DM are associated with various factors, including inflammation, age, obesity, smoking, dyslipidemia, and other stress markers (11,12). In recent years, an increasing abundance of research has identified inflammation as a key driver of atherosclerotic plaque progression and rupture (13,14). Inflammatory mediators within plaques can diffuse through the vascular wall into surrounding adipose tissue, promoting lipolysis and inhibiting the differentiation of preadipocytes, resulting in a higher water-to-lipid ratio in perivascular adipose tissue (PVAT) (15,16). Studies have shown that perivascular fat density (PFD) can be used to identify early subclinical carotid disease and vulnerable plaques, while the radiological parameters of PVAT can predict high-risk carotid plaques and differentiate patients at different stages of carotid atherosclerosis (17-19). However, in current clinical practice, the assessment of carotid atherosclerotic plaques is still largely limited to the measurement of the PFD around carotid plaques (20). This approach fails to fully capture the complex three-dimensional structure of adipose tissue and does not adequately reflect the finer spatial relationships between voxels (21,22). Therefore, identifying higher-dimensional PVAT features is crucial for assessing the risk of IS in patients with T2DM.

Radiomics-based machine learning (ML) is an emerging field that leverages imaging features to objectively and quantitatively identify lesion phenotypes (23). Initially developed for oncology research, this approach has recently been increasingly applied in studies examining cardiovascular and cerebrovascular events (24,25). ML holds significant potential for predicting adverse cardiovascular events, monitoring disease progression, and assessing prognosis (26,27). However, only a few studies have examined the application of PVAT in carotid plaques (28,29). Thus far, radiological models based on PVAT specifically designed for predicting IS risk in patients with T2DM are lacking.

Therefore, we conducted a multicenter study with the aim of developing and validating an ML-based model that integrates radiological features of PVAT with clinical and radiological data to construct an IS risk prediction model for patients with T2DM and evaluate its potential utility in stratifying patients’ risk levels during follow-up. This risk prediction model was designed to support effective personalized interventions and to reduce the risk of stroke onset through early intervention, ultimately alleviating the social and individual burden of stroke. We present this article in accordance with the TRIPOD+AI reporting checklist (available at https://qims.amegroups.com/article/view/10.21037/qims-2025-1760/rc).

Methods

Study population

This retrospective, multicenter study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments, and was approved by the Ethics Committee of The Fifth Affiliated Hospital of Wenzhou Medical University (No. 2025-136). The requirement for informed consent was waived due to the retrospective nature of the analysis. Computed tomography angiography (CTA) data related to carotid atherosclerosis were retrospectively collected for all patients with T2DM treated between January 2015 and January 2024 at The Fifth Affiliated Hospital of Wenzhou Medical University (Center 1), between January 2017 and January 2023 at The Second Affiliated Hospital of Wenzhou Medical University (Center 2), and between January 2016 and January 2024 at the Lishui People’s Hospital (Center 3). The inclusion criteria, exclusion criteria, and clinical data collected are detailed in Appendices 1,2. The patient selection process and the distribution of the training set and validation set are illustrated in Figure 1. The sample size estimation is described in Appendix 3.

Figure 1 Flowchart of patient selection. Center 1, The Fifth Affiliated Hospital of Wenzhou Medical University; Center 2, The Second Affiliated Hospital of Wenzhou Medical University; Center 3, Lishui People’s Hospital. CTA, computed tomography angiography; MRI, magnetic resonance imaging.

Two radiologists independently collected clinical and imaging data based on the inclusion and exclusion criteria and grouped patients according to their magnetic resonance imaging (MRI) findings. IS was defined according to imaging evidence of ischemia in the carotid artery territory and the presence of clinical symptoms lasting at least 24 hours. Patients with positive MRI findings (indicating acute or subacute IS) within 2 weeks before the head and neck CTA examination were assigned to the stroke group. Patients with negative brain MRI results were assigned to the non-stroke group. Ultimately, 494 patients were included, with 167 patients in the stroke group and 327 in the non-stroke group. Patients from Center 1 were randomly divided into an internal training set (n=193) and an internal validation set (n=84) in a 7:3 ratio, while those from Centers 2 and 3 were placed into external validation set 1 (n=105) and external validation set 2 (n=102), respectively.

Image acquisition, preprocessing, and PVAT segmentation

CTA examinations in Centers 1 and 2 were performed with a SOMATOM Force computed tomography (CT) scanner (Siemens Healthineers, Erlangen, Germany), while those in Center 3 were performed with an Aquilion One CT scanner (Toshiba Medical Systems, Otawara, Japan). Detailed information on the CTA acquisition parameters is provided in Table S1. The preprocessing of CTA images, delineation of the PVAT, and measurement of PFD are described in Appendices 4,5. The degree of stenosis was assessed via CTA according to the North American Symptomatic Carotid Endarterectomy Trial criteria (30).

Radiomics feature extraction and selection

A total of 1,688 radiomic features were extracted from the PVAT through the use of RadCloud version 7.1 (Huiying Medical Technology Co., Ltd., Beijing, China; http://radcloud.cn) and categorized into four groups (Appendix 6). Subsequently, all feature values were standardized before feature selection, and the intraclass correlation coefficient (ICC; Appendix 7) was used to evaluate the reproducibility of the features obtained from the PVAT segmentation. Next, in the internal training set, features were sequentially screened via variance thresholding, SelectKBest, and least absolute shrinkage and selection operator (LASSO) regression to identify radiomics features associated with stroke (Appendix 8). Finally, radiomics scores (Radscores) were calculated based on the features with nonzero coefficients in the LASSO model.

Model development, evaluation, and visualization

The clinical variables underwent univariate logistic regression (LR) analysis, and those with a P value of <0.05 were included in the multivariate LR analysis to identify independent clinical risk factors for developing the clinical model. For PFD, we directly developed an LR model due to the smaller number of input features. In order to identify the optimal ML model, six classifiers—k-nearest neighbors (KNN), support vector machine (SVM), LR, linear discriminant analysis (LDA), naive Bayes (NB), and extreme gradient boosting (XGBoost)—were employed in the training set by linearly combining features with nonzero coefficients, which were weighted by their respective coefficients. The optimal combinations of hyperparameters for each classifier in the internal training set are listed in Appendix 9.

The models’ performance was evaluated through various metrics, including the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and accuracy. The ML model that had the highest average AUC in the training set was selected as the optimal radiomics model, and the results were converted into a Radscore. Following this, the Radscore, clinical variables, and PFD were integrated to construct the combined model. Calibration curves were constructed to assess the agreement between observed outcomes and predicted risks, and decision curve analysis (DCA) was used to calculate the models’ net benefit. Additionally, the SHapley Additive exPlanation (SHAP) method was used to visually and analytically interpret the models’ prediction process (Appendix 10). The workflow of the radiomics methodology used in this study is illustrated in Figure 2.

Figure 2 Schematic representation of the radiomics analysis steps. AI, artificial intelligence; AUC, area under the receiver operating characteristic curve; CTA, computed tomography angiography; FAI, fat attenuation index; PFD, perivascular fat density; T2DM, type 2 diabetes mellitus; TC, total cholesterol; VOI, volume of interest.

Follow-up and survival analysis

After discharge, 494 patients with carotid atherosclerotic plaques were followed up through a review of medical records or telephone interviews with the patients or their relatives. The follow-up assessments preferentially used CTA as the imaging modality, with follow-ups conducted every 3–6 months. The median follow-up period was 25 months (range, 0.5–36 months). Follow-up outcomes were recorded as the time from discharge to the first occurrence of a new ischemic event (including transient ischemic attack or stroke). Kaplan-Meier curves were used to visualize differences in the follow-up event rates between groups. In the validation set, we performed Cox proportional hazards regression to assess the association of the model-predicted risk groups with follow-up events.

Statistical analysis

The data were statistically analyzed with R statistical software v4.3.1 (The R Foundation for Statistical Computing, Vienna, Austria) and Python v3.7.6 (Python Software Foundation, Wilmington, DE, USA). The normality of continuous variables was evaluated with the Kolmogorov-Smirnov test. Normally distributed continuous variables are reported as the mean ± standard deviation and were compared between groups via the independent samples t-test. Nonnormally distributed continuous variables are reported as the median and interquartile range and were compared with the Mann-Whitney test. Categorical variables are reported as frequencies and were compared between groups via the Chi-squared test. The DeLong test was used to compare the AUCs of the different ML models, and the corresponding P values were calculated. A P value of <0.05 was considered statistically significant. Univariate and multivariate analyses were conducted via the Cox proportional hazards regression model to identify predictors of stroke events during the follow-up period, with statistical significance set at P<0.05. Survival curves were plotted through the use of the Kaplan-Meier method and were compared with the log-rank test.

Results

Clinical model and PFD model construction

Based on the PFD surrounding the carotid plaque, a PFD model was established through LR, with its coefficients being used as weights. The patients’ clinical characteristics are summarized in Table 1 and did not differ significantly between the training and validation sets (Table S2). Univariate and multivariate LR analyses, hypertension, and total cholesterol were identified as independent risk factors and were subsequently used to develop the clinical model (Table 2).

Table 1

Comparison of clinical characteristics and laboratory findings between the stroke and non-stroke group

Characteristics	Training set (n=193)			Internal validation set (n=84)			External validation set 1 (n=105)			External validation set 2 (n=112)
Characteristics	Stroke (n=64)	Non-stroke (n=129)	P value	Stroke (n=31)	Non-stroke (n=53)	P value	Stroke (n=27)	Non-stroke (n=78)	P value	Stroke (n=45)	Non-stroke (n=67)	P value
Age (years)	71.50±8.91	70.36±8.81	0.477	71.42±7.39	70.55±9.34	0.777	71.74±7.97	69.59±9.50	0.401	70.00±11.67	68.39±9.19	0.371
BMI (kg/m²)	24.26±3.21	24.44±3.42	0.633	24.50±3.58	23.85±5.05	0.640	24.67±4.45	25.32±3.97	0.299	24.38±4.65	24.38±4.30	0.931
PFD (HU)	−50.74±19.53	−66.87±14.98	<0.001*	−56.48±21.96	−68.09±16.31	0.017*	−56.70±14.87	−64.87±13.01	0.009*	−58.61±19.66	−68.43±13.60	0.004*
Degree of stenosis (%)	69.53±14.30	65.84±10.29	0.393	66.90±11.96	62.96±6.36	0.040*	70.11±14.73	65.12±7.00	0.131	68.40±12.97	64.94±7.85	0.104
Gender			0.961			0.696			0.805			0.298
Female	31 (48.4)	62 (48.1)		16 (51.6)	25 (47.2)		17 (63.0)	47 (60.2)		24 (53.3)	29 (43.3)
Male	33 (51.6)	67 (51.9)		15 (48.4)	28 (52.8)		10 (37.0)	31 (39.8)		21 (46.7)	38 (56.7)
Comorbidities
Hypertension			0.004*			0.052			0.248			0.046*
Yes	29 (45.3)	32 (24.8)		14 (45.2)	13 (24.5)		12 (44.4)	25 (15.4)		18 (40.0)	15 (22.4)
No	35 (54.7)	97 (75.2)		17 (54.8)	40 (75.5)		15 (55.6)	53 (84.6)		27 (60.0)	52 (77.6)
Hyperlipidemia			0.041*			0.203			0.419			0.801
Yes	25 (35.9)	32 (24.8)		4 (12.9)	13 (24.5)		6 (22.2)	12 (14.1)		11 (24.4)	15 (22.4)
No	39 (64.1)	97 (75.2)		27 (87.1)	40 (75.5)		21 (77.8)	66 (85.9)		34 (75.6)	52 (77.6)
Smoking			0.552			0.670			0.233			0.710
Yes	12 (18.8)	29 (22.5)		8 (25.8)	16 (30.2)		9 (33.3)	17 (21.8)		17 (37.8)	23 (32.3)
No	52 (81.2)	100 (87.5)		23 (74.2)	37 (69.8)		19 (66.7)	61 (78.2)		28 (62.2)	44 (67.7)
Medication in use
Statin use	3 (4.7)	5 (3.9)	0.791	1 (3.2)	2 (3.8)	0.897	1(3.7)	3 (3.8)	0.974	2(4.4)	3 (4.5)	0.993
Antihypertension use	15 (23.4)	26 (20.2)	0.601	6 (19.4)	12 (22.6)	0.725	6 (22.2)	19 (24.4)	0.823	10 (22.2)	15 (22.4)	0.984
Aspirin use	8 (12.5)	14 (10.9)	0.735	4 (12.9)	6 (11.3)	0.930	3 (11.1)	7 (9.0)	0.746	6 (13.3)	7 (10.4)	0.642
Laboratory findings
WBC (×10¹²/L)	6.00 (4.80, 7.60)	6.20 (5.40, 7.65)	0.193	5.60 (4.90, 6.70)	6.20 (5.15, 7.60)	0.356	6.26 (5.49, 7.47)	6.11 (4.86, 7.83)	0.689	6.40 (5.40, 7.90)	6.00 (4.80, 7.50)	0.275
GLU (mmol/L)	7.77 (6.49, 8.85)	8.15 (7.10, 9.04)	0.131	8.00 (7.51, 9.05)	8.60 (6.84, 9.66)	0.389	7.87 (6.56, 9.50)	7.96 (7.11, 9.53)	0.391	7.81 (7.29, 8.97)	7.72 (7.36, 8.87)	0.753
TG (mmol/L)	1.51 (0.88, 2.44)	1.40 (1.02, 2.15)	0.937	1.27 (0.99, 2.06)	1.43 (0.98, 2.30)	0.633	1.55 (1.08, 2.02)	1.51 (1.06, 1.98)	0.781	1.11 (0.86, 2.23)	1.32 (0.93, 2.02)	0.585
TC (mmol/L)	4.78 (4.03, 5.40)	4.31 (3.63, 5.04)	0.017*	4.85 (4.20, 5.63)	4.46 (3.68, 5.11)	0.062	5.15 (4.36, 5.76)	4.41 (3.65, 5.53)	0.022*	4.68 (4.10, 5.29)	4.36 (4.03, 5.01)	0.084
HDL (mmol/L)	1.01 (0.91, 1.20)	1.01 (0.88, 1.21)	0.865	0.98 (0.86, 1.28)	0.99 (0.90, 1.20)	0.860	1.13 (0.87, 1.34)	1.00 (0.83, 1.39)	0.626	1.03 (0.86, 1.47)	1.05 (0.94, 1.33)	0.603
LDL (mmol/L)	2.05 (1.79, 2.89)	2.15 (1.66, 2.75)	0.675	2.46 (1.93, 3.12)	2.38 (1.94, 2.86)	0.501	2.20 (1.57, 3.51)	2.39 (1.60, 3.19)	0.750	2.22 (1.73, 2.92)	2.35 (1.87, 2.90)	0.559

Continuous values are expressed as mean ± standard deviation or median (interquartile range). Categorical values are expressed as n (%). *, data are means with a statistical difference. P value reflects the differences between stroke the and non-stroke groups in both the training and validation sets. BMI, body mass index; GLU, blood glucose; HDL, high-density lipoprotein; HU, Hounsfield units; LDL, low-density lipoprotein; PFD, perivascular fat density; TC, total cholesterol; TG, triglyceride; WBC, white blood cell count.

Table 2

Univariate and multivariate LR analyses of the clinical characteristics in the training set

Characteristics	Univariate LR		Multivariate LR
Characteristics	OR (95% CI)	P value	OR (95% CI)	P value
Age	1.015 (0.981, 1.050)	0.400
BMI	0.984 (0.900, 1.077)	0.730
PFD	1.059 (1.036, 1.082)	<0.001*	1.058 (1.034, 1.082)	<0.001*
Degree of stenosis	1.027 (1.001, 1.053)	0.044*
Gender	0.985 (0.541, 1.794)	0.961
Hypertension	2.512 (1.332, 4.734)	0.004*	2.305 (1.097, 4.844)	0.028*
Hyperlipidemia	1.943 (1.023, 3.691)	0.042*
Smoking	0.796 (0.375, 1.687)	0.551
Medication in use
Statin use	1.220 (0.282, 5.272)	0.790
Antihypertension use	1.213 (0.590, 2.493)	0.600
Aspirin use	1.173 (0.465, 2.961)	0.735
WBC	0.902 (0.762, 1.067)	0.228
GLU	0.991 (0.893, 1.099)	0.865
TG	1.000 (0.783, 1.276)	0.998
TC	1.469 (1.093, 1.975)	0.011*	1.455 (1.030, 2.057)	0.033*
HDL	0.809 (0.282, 2.318)	0.693
LDL	1.140 (0.791, 1.644)	0.482

*, data are means with a statistical difference. P value reflects the differences between the stroke and non-stroke groups. BMI, body mass index; CI, confidence interval; GLU, blood glucose; HDL, high-density lipoprotein; LDL, low-density lipoprotein; LR, logistic regression; OR, odds ratio; PFD, perivascular fat density; TC, total cholesterol; TG, triglyceride; WBC, white blood cell count.

Feature selection and radiomics model construction

The intra- and interobserver agreement for the radiomics features selected from the PVAT of the same lesion was satisfactory, with ICCs ranging from 0.876 to 0.982 and from 0.831 to 0.929, respectively. Therefore, all extracted radiomic features were retained for further analysis. In the training cohort, seven significantly correlated features were obtained through variance threshold, SelectKBest, and LASSO: three wavelet features, three first-order features, and one texture feature (Figure S1). Details regarding the PVAT features are provided in Table S3 and Figure S2. Pairwise correlations between the selected features were assessed with the Pearson correlation coefficient (r), and the results were visualized as a correlation matrix heatmap (Figure S3). Finally, all selected features were combined and used to construct the radiomics model.

Construction and evaluation of the ML classifiers

The predictive performance of the radiomics models developed with the six ML classifiers is summarized in Table 3 and Figure S4. The DeLong test was applied to the six ML classifiers (Figure S5). As shown in Figure S6, the comprehensive predictive metrics of the XGBoost classifier were superior to those of the other classifiers. Therefore, XGBoost was identified as the most suitable ML classifier for constructing the radiomics model, and the model outputs were used to calculate the Radscore. Figure S7 shows the distribution of Radscores in the stroke and non-stroke groups in each cohort based on the combined model. In all cohorts, the Radscores were significantly higher in the stroke group than in the non-stroke group (all P values <0.001).

Table 3

Diagnostic performance of the different fused ML models

Set	Model	AUC (95% CI)	F1 score (95% CI)	Sensitivity (95% CI)	Specificity (95% CI)	Accuracy (95% CI)
Training	KNN	0.832 (0.771, 0.882)	0.655 (0.479, 0.831)	0.500 (0.377, 0.623)	0.977 (0.895, 1.000)	0.819 (0.748, 0.890)
	SVM	0.883 (0.830, 0.925)	0.771 (0.605, 0.937)	0.734 (0.549, 0.919)	0.915 (0.827, 1.000)	0.855 (0.783, 0.927)
	LR	0.918 (0.870, 0.953)	0.803 (0.659, 0.947)	0.828 (0.666, 0.990)	0.884 (0.798, 0.970)	0.865 (0.794, 0.936)
	LDA	0.845 (0.787, 0.893)	0.703 (0.525, 0.881)	0.844 (0.730, 0.958)	0.721 (0.620, 0.822)	0.762 (0.680, 0.844)
	NB	0.713 (0.644, 0.776)	0.548 (0.372, 0.724)	0.484 (0.362, 0.606)	0.861 (0.769, 0.953)	0.736 (0.658, 0.814)
	XGB	0.927 (0.881, 0.959)	0.879 (0.738, 1.000)	0.906 (0.807, 1.000)	0.923 (0.842, 1.000)	0.917 (0.858, 0.976)
Internal validation	KNN	0.804 (0.703, 0.882)	0.456 (0.351, 0.561)	0.419 (0.241, 0.597)	0.755 (0.569, 0.941)	0.631 (0.524, 0.738)
	SVM	0.786 (0.683, 0.868)	0.709 (0.576, 0.842)	0.903 (0.790, 1.000)	0.623 (0.455, 0.791)	0.726 (0.621, 0.831)
	LR	0.755 (0.649, 0.843)	0.667 (0.554, 0.780)	0.871 (0.707, 1.000)	0.566 (0.389, 0.743)	0.679 (0.573, 0.785)
	LDA	0.727 (0.619, 0.819)	0.700 (0.582, 0.818)	0.939 (0.807, 1.000)	0.566 (0.389, 0.743)	0.704 (0.585, 0.823)
	NB	0.665 (0.553, 0.764)	0.632 (0.526, 0.738)	0.774 (0.610, 0.938)	0.604 (0.436, 0.772)	0.667 (0.556, 0.778)
	XGB	0.795 (0.693, 0.876)	0.677 (0.566, 0.788)	0.710 (0.586, 0.834)	0.774 (0.606, 0.942)	0.750 (0.644, 0.856)
External validation 1	KNN	0.788 (0.698, 0.862)	0.615 (0.519, 0.711)	0.593 (0.408, 0.778)	0.885 (0.785, 0.985)	0.810 (0.717, 0.903)
	SVM	0.795 (0.706, 0.868)	0.630 (0.535, 0.725)	0.630 (0.445, 0.815)	0.872 (0.785, 0.959)	0.810 (0.717, 0.903)
	LR	0.753 (0.659, 0.832)	0.602 (0.511, 0.693)	0.704 (0.519, 0.889)	0.782 (0.682, 0.882)	0.762 (0.670, 0.854)
	LDA	0.734 (0.639, 0.816)	0.543 (0.453, 0.633)	0.704 (0.519, 0.889)	0.692 (0.523, 0.861)	0.695 (0.603, 0.787)
	NB	0.690 (0.593, 0.777)	0.520 (0.429, 0.611)	0.963 (0.837, 1.000)	0.397 (0.263, 0.531)	0.543 (0.452, 0.634)
	XGB	0.826 (0.739, 0.893)	0.645 (0.552, 0.738)	0.815 (0.634, 0.996)	0.756 (0.588, 0.924)	0.771 (0.678, 0.864)
External validation 2	KNN	0.745 (0.654, 0.823)	0.698 (0.609, 0.787)	0.822 (0.632, 1.000)	0.642 (0.473, 0.811)	0.714 (0.622, 0.806)
	SVM	0.802 (0.716, 0.871)	0.769 (0.678, 0.860)	0.778 (0.588, 0.968)	0.836 (0.735, 0.937)	0.813 (0.721, 0.905)
	LR	0.848 (0.768, 0.909)	0.758 (0.667, 0.849)	0.867 (0.677, 1.000)	0.716 (0.558, 0.874)	0.777 (0.685, 0.869)
	LDA	0.783 (0.695, 0.855)	0.701 (0.610, 0.792)	0.756 (0.566, 0.946)	0.731 (0.569, 0.893)	0.741 (0.650, 0.832)
	NB	0.676 (0.581, 0.762)	0.633 (0.543, 0.723)	0.822 (0.632, 1.000)	0.478 (0.307, 0.649)	0.616 (0.527, 0.705)
	XGB	0.840 (0.759, 0.902)	0.759 (0.668, 0.850)	0.667 (0.477, 0.857)	0.940 (0.835, 1.045)	0.830 (0.738, 0.922)

AUC, area under the receiver operating characteristic curve; CI, confidence interval; KNN, k-nearest neighbors; LDA, linear discriminant analysis; LR, logistic regression; ML, machine learning; NB, naive Bayes; SVM, support vector machine; XGBoost, extreme gradient boosting.

Combined model evaluation

Table 4 presents the AUC, accuracy, sensitivity, and specificity of the different models for predicting IS in patients with T2DM. Figure 3A provides a comparison of the prediction performance of the combined model and the clinical, PFD, and radiomics models, showing that the combined model exhibited superior predictive performance. The combined model demonstrated superior accuracy and robustness across all cohorts (Figure 3B). Although the combined model performed slightly better than did the radiomics model, the difference was not significant (detailed results from the DeLong test are shown in Figure S8).

Table 4

Diagnostic performance of the clinical, PFD, radiomics, and combined models

Set	Model	AUC (95% CI)	F1 score (95% CI)	Sensitivity (95% CI)	Specificity (95% CI)	Accuracy (95% CI)
Training	Clinical	0.664 (0.593, 0.731)	0.578 (0.510, 0.646)	0.781 (0.634, 0.928)	0.543 (0.459, 0.627)	0.622 (0.550, 0.694)
	PFD	0.730 (0.661, 0.791)	0.611 (0.544, 0.678)	0.578 (0.431, 0.725)	0.845 (0.763, 0.927)	0.756 (0.684, 0.828)
	Radiomics	0.927 (0.881, 0.959)	0.879 (0.813, 0.945)	0.906 (0.686, 1.000)	0.923 (0.843, 1.000)	0.917 (0.853, 0.981)
	Combined	0.955 (0.916, 0.980)	0.876 (0.811, 0.941)	0.938 (0.775, 1.000)	0.899 (0.820, 0.978)	0.912 (0.847, 0.977)
Internal validation	Clinical	0.622 (0.510, 0.726)	0.449 (0.335, 0.563)	0.419 (0.251, 0.587)	0.355 (0.201, 0.509)	0.868 (0.774, 0.962)
	PFD	0.656 (0.544, 0.756)	0.583 (0.471, 0.695)	0.903 (0.605, 1.000)	0.677 (0.502, 0.852)	0.623 (0.519, 0.727)
	Radiomics	0.795 (0.693, 0.876)	0.678 (0.570, 0.786)	0.871 (0.593, 1.000)	0.710 (0.541, 0.879)	0.774 (0.670, 0.878)
	Combined	0.847 (0.752, 0.916)	0.750 (0.644, 0.856)	0.939 (0.602, 1.000)	0.677 (0.502, 0.852)	0.925 (0.826, 1.000)
External validation 1	Clinical	0.648 (0.549, 0.739)	0.468 (0.374, 0.562)	0.436 (0.255, 0.617)	0.852 (0.752, 0.952)	0.745 (0.652, 0.838)
	PFD	0.668 (0.569, 0.757)	0.475 (0.382, 0.568)	0.519 (0.339, 0.699)	0.769 (0.667, 0.871)	0.705 (0.613, 0.797)
	Radiomics	0.826 (0.739, 0.893)	0.647 (0.554, 0.740)	0.815 (0.621, 1.000)	0.756 (0.642, 0.870)	0.771 (0.673, 0.869)
	Combined	0.856 (0.774, 0.917)	0.737 (0.644, 0.830)	0.778 (0.598, 0.958)	0.885 (0.797, 0.973)	0.857 (0.765, 0.949)
External validation 2	Clinical	0.653 (0.557, 0.741)	0.617 (0.525, 0.709)	0.733 (0.549, 0.917)	0.552 (0.401, 0.703)	0.625 (0.531, 0.719)
	PFD	0.663 (0.567, 0.749)	0.591 (0.501, 0.681)	0.578 (0.391, 0.765)	0.746 (0.619, 0.873)	0.679 (0.586, 0.772)
	Radiomics	0.840 (0.759, 0.902)	0.760 (0.669, 0.851)	0.667 (0.481, 0.853)	0.940 (0.842, 1.000)	0.830 (0.733, 0.927)
	Combined	0.876 (0.800, 0.930)	0.785 (0.695, 0.875)	0.689 (0.501, 0.877)	0.955 (0.856, 1.000)	0.848 (0.758, 0.938)

AUC, area under the receiver operating characteristic curve; CI, confidence interval; PFD, perivascular fat density.

Figure 3 Performance evaluation of different models in the four sets. (A) AUC curves for the clinical, PFD, radiomics, and combined models. (B) Radar chart of the predictive performance of the clinical, PFD, radiomics, and combined models in the four cohorts. (C) Calibration curves for the clinical, PFD, radiomics, and combined models in the four cohorts. (D) DCA of the clinical, PFD, radiomics, and combined models in the four cohorts. AUC, area under the receiver operating characteristic curve; DCA, decision curve analysis; PFD, perivascular fat density.

There was a high degree of agreement between the predicted and actual outcomes across sets for the combined model according to the calibration curves, and there was good consistency between clinical observations and model predictions (Figure 3C). The DCA showed that the combined model has a significant advantage in clinical decision-making compared to the other models (Figure 3D).

SHAP analysis of the combined XGBoost model

The SHAP bar chart in Figure 4A illustrates the magnitude of each feature’s contribution to the model’s predictive performance. The SHAP summary plot in Figure 4B provides an intuitive visualization of each feature’s impact on the model’s predictions, with positive contributions shown in purple and negative contributions shown in yellow. For example, the purple dots with higher feature values for Exponential_glszm_SizeZoneNonUniformity indicate that the variation in the size zones of grayscale areas in the image is more uneven, suggesting that heterogeneous tissue characteristics may be associated with an increased risk of IS. In contrast, the yellow dots with lower feature values imply a reduced risk of IS. These low-value features decrease the forward prediction in the model, as reflected by the negative SHAP values on the X-axis, indicating a prediction of non-stroke.

Figure 4 SHAP visualization of the overall model. (A) A bar chart of the importance of all features in the model based on SHAP values. (B) A SHAP summary plot showing each feature’s positive (purple) and negative (yellow) contributions to the predicted probability. (C) A SHAP heatmap showing the direction and magnitude of the impact of each feature across all cases in the model. (D) A SHAP decision plot showing each important feature’s effect on the final predicted probability for a single case. PFD, perivascular fat density; SHAP, SHapley Additive exPlanation; TC, total cholesterol.

The next feature, Wavelet-HLL_firstorder_Range, represents the degree of intensity variation within the image. Higher range values typically indicate greater differences in pixel intensity, providing additional insights into the correlation with disease states. When the value of this feature increases, the SHAP value is positive, indicating that higher values of this feature contribute positively to the model’s prediction, potentially because greater variability suggests an unstable condition within the vascular structure.

Next, Wavelet-LLH_glszm_SizeZoneNonUniformity was found to be similar to the first feature, representing the nonuniformity of size zones. Higher values may indicate a more complex spatial arrangement of gray-level regions, which could be related to abnormal tissue characteristics. Lower negative SHAP values reinforce the notion that more uniform PVAT tissue features may protect against IS events.

The SHAP heatmap in Figure 4C further demonstrates the direction and magnitude of each feature’s influence across all cases within the model. The SHAP decision plot in Figure 4D depicts how each feature contributes to the final predicted probability for individual cases. Figure 5 presents two specific examples to better illustrate the interpretability of the model. These SHAP visualizations highlight the explainability of the model, emphasizing the importance of radiomics features in predicting symptomatic and asymptomatic carotid plaques while offering a transparent interpretation of the model’s decision-making process. These features collectively support the importance of texture and intensity variations in imaging as predictive markers for stroke risk, with their interactions providing valuable insights into underlying pathological mechanisms.

Figure 5 SHAP visualization of a single model. (A) Patient A is an example of a correctly predicted stroke case, and (B) patient B is an example of a correctly predicted non-stroke case. FAI, fat attenuation index; HU, Hounsfield units; PFD, perivascular fat density; SHAP, SHapley Additive exPlanation; TC, total cholesterol.

Follow-up and survival analysis

We further analyzed the patients’ follow-up data after discharge. During this period, the incidence of ischemic events in the non-stroke group was 10.8%, 13.2%, 11.5%, and 8.9% in the training, internal testing, external validation 1, and external validation 2 sets, respectively, while the incidence rates of ischemic events in the stroke group were 31.2%, 32.2%, 33.3%, and 28.8%, respectively. Event-free survival (EFS) differed significantly between the stroke and non-stroke groups in both the training and validation cohorts (Figure 6A-6D). The patients were divided into two groups based on the combined model: the predicted non-stroke group (XGBoost score <0.837) and the predicted stroke group (XGBoost score ≥0.837); this threshold was determined according to the maximization of the Youden index. Kaplan-Meier analysis indicated that the combined model effectively stratified patients into groups with significantly different follow-up event rates, showing close agreement with observed outcomes. The multivariate Cox regression analysis identified the independent predictors of follow-up ischemic events to be stroke status [hazard ratio (HR) =3.916; 95% confidence interval (CI): 1.792–6.558; P<0.001] and predicted stroke status (HR =1.352; 95% CI: 1.317–4.777; P=0.030) (Table S4).

Figure 6 The Kaplan-Meier survival curves based on patient grouping according to stroke occurrence and patient grouping predicted by the nomogram in the (A) training set, (B) internal validation set, (C) external validation set 1, and (D) external validation set 2. CI, confidence interval; HR, hazard ratio.

Discussion

This study developed and validated a model based on PVAT features derived from CTA images to predict acute stroke events in patients with T2DM. The integration of clinical and radiological features further enhanced the model. A comparative analysis was conducted on six commonly used ML algorithms, and their predictive performance was evaluated across multiple dimensions. Ultimately, XGBoost was selected as the optimal model, and a combined model incorporating XGBoost with independent risk factors was constructed. This model demonstrated excellent predictive performance, achieving an AUC of 0.847, 0.856, and 0.876 in the internal validation, external validation 1, and external validation 2 sets, respectively. The XGBoost-based combined model offered a robust and interpretable method for IS risk assessment and prognostic stratification in patients with T2DM, potentially aiding in personalized clinical decision-making and management.

This study integrated and extracted various types of radiomics features, and ultimately, seven key features were selected via LASSO regression. Among these, wavelet features were the most prominent, followed by the first-order features. This finding aligns with several studies that have identified the substantial prognostic value of wavelet features, emphasizing their critical role in constructing robust radiomics models (22,31). In addition, first-order features, which characterize the distribution of gray-level values in the region of interest, play a vital role in distinguishing and classifying lesions (32). These features are analogous to the mean voxel intensity of PVAT, reflecting changes in the water-lipid balance within PVAT (33). Therefore, these features can serve as effective imaging biomarkers for predicting symptomatic carotid plaques, highlighting the potential of radiomics in enhancing our understanding of the microstructural changes in PVAT and improving risk stratification in clinical practice.

Six ML models were developed based on the aforementioned radiomics features. Among them, the XGBoost classifier demonstrated outstanding performance and was selected as the best classifier. Its superiority may be related to its robust ability to handle nonlinear features and effectively deal with missing values. In addition, through the adoption of ensemble learning, it can construct multiple weak classifiers and combine them into a strong classifier, which significantly improves the model’s prediction accuracy and stability. Moreover, its built-in feature importance assessment function allows the model to excel in selecting the most relevant features, further enhancing predictive performance (34,35).

In addition, we established a PFD model, evaluated baseline clinical data, and identified hypertension and total cholesterol as independent predictors of IS in patients with T2DM. Previous research has established a strong association between hypertension and the occurrence of IS (36). Prolonged hypertension damages the vascular walls, accelerates the process of atherosclerosis, and makes blood vessels more vulnerable and prone to rupture, which can impair blood flow, potentially leading to IS (37,38). Similarly, studies such as those by Lewington et al. (39) have demonstrated that elevated total cholesterol levels increase the risk of stroke. Cholesterol deposits in the arterial walls form plaques, which can eventually obstruct blood flow and cause vascular blockages (40,41).

A clinical model was established based on these independent predictors. However, the PFD and clinical models performed suboptimally. In order to improve predictive accuracy, a combined model was developed that integrated the Radscore derived from the XGBoost classifier with independent clinical predictors and PFD. The XGBoost-based combined model demonstrated good predictive performance.

SHAP has become a widely used tool for visualizing each feature’s contribution to both overall and individual predictions (42). SHAP analysis in our study highlighted the importance and impact of each feature within the combined model. Case-specific analyses demonstrated the contribution of these features in certain scenarios. The final SHAP values can be used to calculate predictive probabilities, enabling more personalized and precise IS predictions.

In this study, a history of stroke was identified as an independent risk factor for EFS, which is consistent with previous research findings (43). Moreover, stroke status predicted by the XGBoost-based combined model was confirmed to be an independent risk factor for IS. The differences observed in the EFS curves stratified by XGBoost scores closely mirrored those based on the history of stroke, indicating that the XGBoost-based combined model can effectively guide the stratification of patients into different ischemic risk categories in T2DM. Therefore, this model could play a crucial role in assessing personalized recurrence risk and providing valuable insights for clinicians in developing appropriate and effective treatment strategies.

This study was the first to combine the radiological characteristics of PVAT with clinical risk factors to predict IS risk in patients with T2DM. Our proposed model, using PVAT features derived from CTA imaging, performs better than do previous ultrasound-based models (44) in identifying patients at risk of IS and can be used to support patient risk stratification based on predicted ischemic event likelihood. The XGBoost-based combined model, integrated with the SHAP method, can assist clinicians in evaluating the risk of individual IS symptoms. For patients with T2DM at risk of IS, this model offers guidance on whether more proactive preventive measures in patients at higher predicted risk are warranted. These patients may also require more frequent follow-up assessments. However, in real-world clinical settings, final treatment decisions must still be based on a comprehensive evaluation of each patient.

This study involved several limitations that should be acknowledged. First, given the retrospective design, it may be subject to selection bias. Moreover, only patients who could recover and be discharged were included, resulting in a relatively high proportion of negative cases, which may limit the clinical applicability of the developed model. Second, the data used were derived exclusively from Chinese patients, potentially restricting the generalizability of its findings to other geographic regions with different distributions of clinical characteristics. Third, our study focused on predicting the risk of initial IS in patients with T2Ds. The purpose of the follow-up event analysis was solely to evaluate the model’s ability to classify patients into different risk categories in a real-world timeline rather than to establish a prognostic or survival prediction model. Therefore, to assess the predictive ability of the model in terms of patient prognosis, future studies should be large-sample and prospective in design. Finally, the model incorporated the radiomics features of PVAT but not plaque-specific characteristics, which may reduce the model’s predictive performance.

Conclusions

This study developed and validated a novel predictive model based on PVAT radiomics features to predict acute stroke events in patients with T2DM. The purpose of this model is to identify symptomatic carotid plaques and stratify patients into distinct risk categories during follow-up. When applied in conjunction with SHAP, this model enhances personalized prediction by clarifying the contribution of individual features. Therefore, this model provides a foundation for multidimensional management and personalized risk assessment in patients with T2DM, potentially enabling the administration of preventive strategies at an earlier period among high-risk patients with T2DM.

Acknowledgments

None.

Footnote

Reporting Checklist: The authors have completed the TRIPOD+AI reporting checklist. Available at https://qims.amegroups.com/article/view/10.21037/qims-2025-1760/rc

Data Sharing Statement: Available at https://qims.amegroups.com/article/view/10.21037/qims-2025-1760/dss

Funding: This research was funded by the General Research Project of Zhejiang Provincial Education Department (No. Y202457302 to T.Z.) and the Medical Science and Technology Project of Zhejiang Province (No. 2025KY495 to W.H.).

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://qims.amegroups.com/article/view/10.21037/qims-2025-1760/coif). T.Z. reports a funding from the General Research Project of Zhejiang Provincial Education Department (No. Y202457302). W.H. reports a funding from the Medical Science and Technology Project of Zhejiang Province (No. 2025KY495). Y.J. is an employee of Huiying Medical Technology Co., Ltd. The other authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. This multicenter retrospective study was approved by the Ethics Committee of The Fifth Affiliated Hospital of Wenzhou Medical University (No. 2025-136), and the requirement of informed consent from patients was waived due to its retrospective design.

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.

References

Global, regional, and national burden of stroke and its risk factors, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol 2021;20:795-820. [Crossref] [PubMed]
Saini V, Guada L, Yavagal DR. Global Epidemiology of Stroke and Access to Acute Ischemic Stroke Interventions. Neurology 2021;97:S6-S16. [Crossref] [PubMed]
Zhang S, Yu X, Gu H, Kang B, Guo N, Wang X. Identification of high-risk carotid plaque by using carotid perivascular fat density on computed tomography angiography. Eur J Radiol 2022;150:110269. [Crossref] [PubMed]
Glovaci D, Fan W, Wong ND. Epidemiology of Diabetes Mellitus and Cardiovascular Disease. Curr Cardiol Rep 2019;21:21. [Crossref] [PubMed]
Li W, Chen D, Tao Y, Lu Z, Wang D. Association between triglyceride-glucose index and carotid atherosclerosis detected by ultrasonography. Cardiovasc Diabetol 2022;21:137. [Crossref] [PubMed]
Sarwar N, Gao P, Seshasai SR, Gobin R, Kaptoge S, Di Angelantonio E, Ingelsson E, Lawlor DA, Selvin E, Stampfer M, Stehouwer CD, Lewington S, Pennells L, Thompson A, Sattar N, White IR, Ray KK, Danesh J. Diabetes mellitus, fasting blood glucose concentration, and risk of vascular disease: a collaborative meta-analysis of 102 prospective studies. Lancet 2010;375:2215-22. [Crossref] [PubMed]
O'Donnell MJ, Chin SL, Rangarajan S, Xavier D, Liu L, Zhang H, et al. Global and regional effects of potentially modifiable risk factors associated with acute stroke in 32 countries (INTERSTROKE): a case-control study. Lancet 2016;388:761-75. [Crossref] [PubMed]
Bragg F, Holmes MV, Iona A, Guo Y, Du H, Chen Y, Bian Z, Yang L, Herrington W, Bennett D, Turnbull I, Liu Y, Feng S, Chen J, Clarke R, Collins R, Peto R, Li L, Chen ZChina Kadoorie Biobank Collaborative Group. Association Between Diabetes and Cause-Specific Mortality in Rural and Urban Areas of China. JAMA 2017;317:280-9. [Crossref] [PubMed]
Stitziel NO, Kanter JE, Bornfeldt KE. Emerging Targets for Cardiovascular Disease Prevention in Diabetes. Trends Mol Med 2020;26:744-57. [Crossref] [PubMed]
Pan L, Zou H, Meng X, Li D, Li W, Chen X, Yang Y, Yu X. Predictive values of metabolic score for insulin resistance on risk of major adverse cardiovascular events and comparison with other insulin resistance indices among Chinese with and without diabetes mellitus: Results from the 4C cohort study. J Diabetes Investig 2023;14:961-72. [Crossref] [PubMed]
Ménégaut L, Laubriet A, Crespy V, Leleu D, Pilot T, Van Dongen K, de Barros JP, Gautier T, Petit JM, Thomas C, Nguyen M, Steinmetz E, Masson D. Inflammation and oxidative stress markers in type 2 diabetes patients with Advanced Carotid atherosclerosis. Cardiovasc Diabetol 2023;22:248. [Crossref] [PubMed]
Gyldenkerne C, Mortensen MB, Kahlert J, Thrane PG, Warnakula Olesen KK, Sørensen HT, Thomsen RW, Maeng M. 10-Year Cardiovascular Risk in Patients With Newly Diagnosed Type 2 Diabetes Mellitus. J Am Coll Cardiol 2023;82:1583-94. [Crossref] [PubMed]
Ridker PM, Everett BM, Thuren T, MacFadyen JG, Chang WH, Ballantyne C, et al. Antiinflammatory Therapy with Canakinumab for Atherosclerotic Disease. N Engl J Med 2017;377:1119-31. [Crossref] [PubMed]
Ridker PM, Libby P, MacFadyen JG, Thuren T, Ballantyne C, Fonseca F, Koenig W, Shimokawa H, Everett BM, Glynn RJ. Modulation of the interleukin-6 signalling pathway and incidence rates of atherosclerotic events and all-cause mortality: analyses from the Canakinumab Anti-Inflammatory Thrombosis Outcomes Study (CANTOS). Eur Heart J 2018;39:3499-507. [Crossref] [PubMed]
Antonopoulos AS, Sanna F, Sabharwal N, Thomas S, Oikonomou EK, Herdman L, et al. Detecting human coronary inflammation by imaging perivascular fat. Sci Transl Med 2017;9:eaal2658. [Crossref] [PubMed]
Oikonomou EK, Marwan M, Desai MY, Mancio J, Alashi A, Hutt Centeno E, et al. Non-invasive detection of coronary inflammation using computed tomography and prediction of residual cardiovascular risk (the CRISP CT study): a post-hoc analysis of prospective outcome data. Lancet 2018;392:929-39. [Crossref] [PubMed]
Zhang S, Gu H, Yu X, Kang B, Yuan X, Wang X. Association Between Carotid Artery Perivascular Fat Density and Intraplaque Hemorrhage. Front Cardiovasc Med 2021;8:735794. [Crossref] [PubMed]
Mao L, Chen L, Qu M, He X. Pericarotid Adipose Tissue is Associated with Circulatory Markers of Inflammation and Carotid Atherosclerosis. Angiology 2025;76:857-63. [Crossref] [PubMed]
Dai A, Yao W, Lei J, Yan L, Dang L, Zhao H, Gu J, Li J, Nie Y, Zheng M, Wang D, Wang Q. Relationship Between Pericarotid Fat Density and Pathology-Based Carotid Plaque Risk Characteristics. J Craniofac Surg 2024; Epub ahead of print. [Crossref]
Tang C, Wang QH, Cai J, Wang YM, Chen ZP, Fan LZ, Zheng L, Qiao T. Correlation analysis of carotid adipose tissue density and plaque stability in patients with carotid artery stenosis. Zhonghua Yi Xue Za Zhi 2024;104:3123-9. [Crossref] [PubMed]
Marcelin G, Silveira ALM, Martins LB, Ferreira AV, Clément K. Deciphering the cellular interplays underlying obesity-induced adipose tissue fibrosis. J Clin Invest 2019;129:4032-40. [Crossref] [PubMed]
Chen C, Tang W, Chen Y, Xu W, Yu N, Liu C, Li Z, Tang Z, Zhang X. Computed tomography angiography-based radiomics model to identify high-risk carotid plaques. Quant Imaging Med Surg 2023;13:6089-104. [Crossref] [PubMed]
Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images Are More than Pictures, They Are Data. Radiology 2016;278:563-77. [Crossref] [PubMed]
Kolossváry M, Karády J, Szilveszter B, Kitslaar P, Hoffmann U, Merkely B, Maurovich-Horvat P. Radiomic Features Are Superior to Conventional Quantitative Computed Tomographic Metrics to Identify Coronary Plaques With Napkin-Ring Sign. Circ Cardiovasc Imaging 2017;10:e006843. [Crossref] [PubMed]
Lin A, Kolossváry M, Yuvaraj J, Cadet S, McElhinney PA, Jiang C, Nerlekar N, Nicholls SJ, Slomka PJ, Maurovich-Horvat P, Wong DTL, Dey D. Myocardial Infarction Associates With a Distinct Pericoronary Adipose Tissue Radiomic Phenotype: A Prospective Case-Control Study. JACC Cardiovasc Imaging 2020;13:2371-83. [Crossref] [PubMed]
Dong X, Li N, Zhu C, Wang Y, Shi K, Pan H, Wang S, Shi Z, Geng Y, Wang W, Zhang T. Diagnosis of coronary artery disease in patients with type 2 diabetes mellitus based on computed tomography and pericoronary adipose tissue radiomics: a retrospective cross-sectional study. Cardiovasc Diabetol 2023;22:14. [Crossref] [PubMed]
Miao S, Yu F, Sheng R, Zhang X, Li Y, Qi Y, Lu S, Ji P, Fan J, Zhang X, Xu T, Wang Z, Liu Y, Yang G. Radiomics of pericoronary adipose tissue on computed tomography angiography predicts coronary heart disease in patients with type 2 diabetes mellitus. BMC Cardiovasc Disord 2024;24:300. [Crossref] [PubMed]
Yu S, Huo R, Qiao H, Ning Z, Xu H, Yang D, Shen R, Xu N, Han H, Chen S, Liu Y, Zhao X. Carotid artery perivascular adipose tissue on magnetic resonance imaging: a potential indicator for carotid vulnerable atherosclerotic plaque. Quant Imaging Med Surg 2023;13:7695-705. [Crossref] [PubMed]
Nie JY, Chen WX, Zhu Z, Zhang MY, Zheng YJ, Wu QD. Initial experience with radiomics of carotid perivascular adipose tissue in identifying symptomatic plaque. Front Neurol 2024;15:1340202. [Crossref] [PubMed]
Barnett HJM, Taylor DW, Haynes RB, Sackett DL, Peerless SJ, Ferguson GG, Fox AJ, Rankin RN, Hachinski VC, Wiebers DO, Eliasziw M. Beneficial effect of carotid endarterectomy in symptomatic patients with high-grade carotid stenosis. N Engl J Med 1991;325:445-53. [Crossref] [PubMed]
Oikonomou EK, Williams MC, Kotanidis CP, Desai MY, Marwan M, Antonopoulos AS, et al. A novel machine learning-derived radiotranscriptomic signature of perivascular fat improves cardiac risk prediction using coronary CT angiography. Eur Heart J 2019;40:3529-43. [Crossref] [PubMed]
Varriano G, Guerriero P, Santone A, Mercaldo F, Brunese L. Explainability of radiomics through formal methods. Comput Methods Programs Biomed 2022;220:106824. [Crossref] [PubMed]
Hu X, Chen J, Fu H, Chen Y, Fan D, Chen Y, Shen C. Association Between Carotid Artery Perivascular Fat Density and Embolic Stroke of Undetermined Source. Front Neurol 2021;12:765962. [Crossref] [PubMed]
Natekin A, Knoll A. Gradient boosting machines, a tutorial. Front Neurorobot 2013;7:21. [Crossref] [PubMed]
Borstelmann SMMachine Learning Principles for Radiology Investigators. Acad Radiol 2020;27:13-25. [Crossref] [PubMed]
Said MA, Verweij N, van der Harst P. Associations of Combined Genetic and Lifestyle Risks With Incident Cardiovascular Disease and Diabetes in the UK Biobank Study. JAMA Cardiol 2018;3:693-702. [Crossref] [PubMed]
Flint AC, Conell C, Ren X, Banki NM, Chan SL, Rao VA, Melles RB, Bhatt DL. Effect of Systolic and Diastolic Blood Pressure on Cardiovascular Outcomes. N Engl J Med 2019;381:243-51. [Crossref] [PubMed]
Liu FD, Shen XL, Zhao R, Tao XX, Wang S, Zhou JJ, Zheng B, Zhang QT, Yao Q, Zhao Y, Zhang X, Wang XM, Liu HQ, Shu L, Liu JR. Pulse pressure as an independent predictor of stroke: a systematic review and a meta-analysis. Clin Res Cardiol 2016;105:677-86. [Crossref] [PubMed]
Lewington S, Whitlock G, Clarke R, Sherliker P, Emberson J, Halsey J, Qizilbash N, Peto R, Collins R. Blood cholesterol and vascular mortality by age, sex, and blood pressure: a meta-analysis of individual data from 61 prospective studies with 55,000 vascular deaths. Lancet 2007;370:1829-39. [Crossref] [PubMed]
Liu Z, Zhang L, Wang L, Li K, Fan F, Jia J, Li J, Zhang Y. The predictive value of cumulative atherogenic index of plasma (AIP) for cardiovascular outcomes: a prospective community-based cohort study. Cardiovasc Diabetol 2024;23:264. [Crossref] [PubMed]
Kim MK, Han K, Kim HS, Park YM, Kwon HS, Yoon KH, Lee SH. Cholesterol variability and the risk of mortality, myocardial infarction, and stroke: a nationwide population-based study. Eur Heart J 2017;38:3560-6. [Crossref] [PubMed]
Ma J, Bo Z, Zhao Z, Yang J, Yang Y, Li H, Yang Y, Wang J, Su Q, Wang J, Chen K, Yu Z, Wang Y, Chen G. Machine Learning to Predict the Response to Lenvatinib Combined with Transarterial Chemoembolization for Unresectable Hepatocellular Carcinoma. Cancers (Basel) 2023;15:625. [Crossref] [PubMed]
Li J, Han M, Chen Y, Wu B, Wu Y, Jia W, Liu J, Luo H, Yu P, Tu J, Kuang J, Yi Y. Prediction of recurrent ischaemic stroke using radiomics data and machine learning methods in patients with acute ischaemic stroke: protocol for a multicentre, large sample, prospective observational cohort study in China. BMJ Open 2023;13:e076406. [Crossref] [PubMed]
Liu Y, Kong Y, Yan Y, Hui P. Explore the value of carotid ultrasound radiomics nomogram in predicting ischemic stroke risk in patients with type 2 diabetes mellitus. Front Endocrinol (Lausanne) 2024;15:1357580. [Crossref] [PubMed]

Cite this article as: Zhao T, Lin G, Chen W, Jiang C, Hu W, Xu L, Chen Y, Jing Y, Sun J, Yan Z, Xia S, Lu C, Chen M, Ji J, Chen W. Prediction of stroke events in patients with type 2 diabetes mellitus by interpretable machine learning based on perivascular adipose tissue features: a multicenter cohort study. Quant Imaging Med Surg 2026;16(2):114. doi: 10.21037/qims-2025-1760

Prediction of stroke events in patients with type 2 diabetes mellitus by interpretable machine learning based on perivascular adipose tissue features: a multicenter cohort study

Introduction

Methods

Study population

Image acquisition, preprocessing, and PVAT segmentation

Radiomics feature extraction and selection

Model development, evaluation, and visualization

Follow-up and survival analysis

Statistical analysis

Results

Clinical model and PFD model construction

Table 1

Table 2

Feature selection and radiomics model construction

Construction and evaluation of the ML classifiers

Table 3

Combined model evaluation

Table 4

SHAP analysis of the combined XGBoost model

Follow-up and survival analysis

Discussion

Conclusions

Acknowledgments

Footnote

References

Article Options

Download Citation

Share