Prediction of lymphovascular invasion in invasive breast cancer via intratumoral and peritumoral multiparametric magnetic resonance imaging machine learning-based radiomics with Shapley additive explanations interpretability analysis
Introduction
Breast cancer is the predominant malignant tumor among women, exerting a significant burden on women’s health across the globe (1). Lymphovascular invasion (LVI), characterized by the presence of tumor cells within the lymphatic and vascular systems, is a critical biomarker for indicating breast cancer aggressiveness and poor prognosis (2,3). Research has consistently shown that LVI is strongly associated with increased tumor invasiveness, recurrence, involvement of axillary lymph nodes (ALNs), and distant metastasis (4,5). However, the current methods for assessing LVI are fraught with limitations. Traditional assessments primarily depend on postoperative histopathological examination, which fails to offer preoperative predictions of LVI status (6).
Magnetic resonance imaging (MRI) plays a critical role in diagnosing and evaluating breast cancer, particularly in predicting LVI (7). It provides critical information about the morphological features and function of the tumor, which is essential for the early detection and determination of the invasiveness of breast cancer (8,9). Previous research has established a strong correlation of MRI indicators such as peritumoral edema (PE) and the status of ALNs with the presence of LVI (10). However, limitations exist in the morphological assessment of conventional MRI. For instance, detecting minimal or subtle LVI manifestations can be challenging, especially in smaller tumors or those at early stages (11). As an innovative approach, radiomics can quantitatively analyze medical images and extract an array of image features (12,13). In recent years, radiomics has demonstrated considerable value in oncology—particularly in breast cancer for noninvasive prediction and classification tasks. Various studies have demonstrated that combining radiomics features with machine learning classifiers significantly improves diagnostic accuracy. For instance, Wu et al. (14) constructed a radiomics framework combining intratumoral, peritumoral, and habitat features extracted from dynamic contrast-enhanced MRI (DCE-MRI) and applied six different machine learning classifiers to predict ALN metastasis in patients with breast cancer. Similarly, Zhang et al. (15) built eight radiomics-based machine learning models using T2-weighted imaging (T2WI) and DCE-MRI to predict LVI in breast cancer.
Previous radiomics research into LVI in breast cancer has predominantly concentrated on the DCE-MRI (7,16,17). Multiparametric MRI, by integrating DCE-MRI with other sequences, offers a comprehensive evaluation of the biological dynamics of breast cancer tumors (6,12). Notably, peritumoral factors, including their impact on ALN status, have also demonstrated predictive value for neoadjuvant chemotherapy (18,19). However, there are relatively few studies that have combined intratumoral and peritumoral features in multiparametric MRI to predict LVI status (17). This highlights the need for further exploration to optimize LVI prediction and improve clinical decision-making.
Shapley additive explanation (SHAP) provides a robust framework for enhancing the interpretability of machine learning models, addressing the longstanding “black-box” problem associated with their clinical adoption (20,21). By offering both global and local explanations, SHAP enables the ranking of feature importance and the visualization of contributions through intuitive plots, facilitating a clearer interpretation of complex radiomics-based predictions. Recent studies have demonstrated the effectiveness of SHAP in improving the reliability and clinical applicability of predictive models, such as those for molecular subtype classification and ALN status assessment (18,22).
The objective of our study was thus to predict LVI status in patients with breast cancer by integrating intratumoral and peritumoral radiomics features derived from multiparametric MRI. Furthermore, SHAP was applied to the combined model incorporating both clinical and intra- and peritumoral radiomics features to generate global insights into feature importance and local explanations for individual predictions, thereby enabling a comprehensive interpretation of each feature’s contribution to LVI prediction. We present this article in accordance with the TRIPOD+AI reporting checklist (available at https://qims.amegroups.com/article/view/10.21037/qims-2024-2685/rc).
Methods
Patients and histopathology collection
The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments and approved by the Institutional Ethics Committee of Guangzhou First People’s Hospital (approval No. K-2023161-01) and The Third Affiliated Hospital of Sun Yat-Sen University (approval No. SL-RG2024-070-01). The requirement for informed consent was waived due to the retrospective nature of the analysis.
A total of 366 female patients with invasive breast cancer who underwent surgery from April 2019 to May 2023 in center 1 (Guangzhou First People’s Hospital) and from July 2019 to May 2020 in center 2 (The Third Affiliated Hospital of Sun Yat-Sen University) were included in the study. The inclusion criteria were as follows: (I) MRI examination performed prior to the biopsy procedure; (II) confirmed LVI status via postoperative pathology; (III) no prior treatment or surgery for breast cancer; and (IV) a time interval between MRI examination and surgery of <2 weeks. Meanwhile, the exclusion criteria were as follows: (I) obvious artifacts on magnetic resonance (MR) images; (II) incomplete clinical characteristics (such as absence of menstrual status and immunohistochemistry); and (III) neoadjuvant therapy before MRI. Patients from center 1 were randomly divided into training (n=208) and validation (n=70) sets in a 3:1 ratio, while all patients from center 2 were assigned to the test set (n=88) for evaluation of the model’s generalizability. Figure 1 provides a flowchart of patient enrollment.
Clinicopathological information was collected for the enrolled patients from electronic medical records and included age at initial diagnosis, menopausal status, LVI status, histologic type, estrogen receptor (ER) status, progesterone receptor (PR) status, hormonal receptor (HR) status, human epidermal growth factor receptor type 2 (HER2) status, and Ki-67 index. LVI was considered positive if cancer cells were present in lymphatic or small blood vessels external to the tumor. The threshold for ER and PR positivity was set at 1%, and that for the Ki-67 index was 20%. HR-positive was defined as ER-positive or PR-positive. HER2-negative tumors were defined as an immunohistochemical staining score of 0 or 1+, whereas HER2-positive tumors were identified by a score of 3+. Chromogenic in situ hybridization was used to confirm HER2 amplification in cases scored 2+.
MRI acquisition and evaluation
MRI scans were conducted with two 1.5-T MRI systems (center 1: uMR 560, United Imaging, Shanghai, China; center 2: Achieva, Philips Healthcare, Best, the Netherlands), employed through dedicated bilateral breast coils and with patients in the prone position. The MRI protocols included fat-suppressed T2WI, DCE, and diffusion-weighted imaging (DWI) with b-values of 0 and 800 s/mm2. For DCE-MRI, both centers administered a gadolinium-based contrast agent (Gd-DTPA, Magnevist; Bayer, Leverkusen, Germany) at a dosage of 0.2 mL/kg body weight and an infusion rate of 1.5 mL/s. Center 1 acquired six DCE-MRI phases (1 precontrast and 5 postcontrast), while center 2 acquired seven phases (1 precontrast and 6 postcontrast). The detailed acquisition parameters of the two centers are summarized in Table S1.
To delineate MRI morphological characteristics, two radiologists who were blinded to the clinicopathological data visually assessed the MR images following the Breast Imaging Reporting and Data System guidelines. They evaluated the following MRI features of tumors: lesion type, mass shape, margin, background parenchymal enhancement (BPE), PE, multifocal lesions, time-signal intensity curve (TIC) pattern, MRI-ALN status, and largest diameter (LD). PE was considered present when fat-suppressed T2 showed that the signal around the tumor was higher than the signal of the breast tissue and almost equal to the signal of water (23). MRI-ALN metastasis was considered present when lymph nodes exhibited a round or irregular shape, loss of fatty hilum, cortical thickening ≥3 mm, or a short-axis diameter of ≥10 mm (24-26). The contrast enhancement pattern was classified based on the TIC into three conventional types: persistent (type I), plateau (type II), and washout (type III).
MRI segmentation and radiomics feature extraction
The data flowchart for this study is presented in Figure 2. To enhance feature robustness across datasets, images underwent resampling to a voxel spacing of 1 mm × 1 mm × 1 mm, followed by z-score normalization and intensity discretization with a fixed bin width of 25. Regions of interest (ROIs) encompassing the entire tumor region were manually delineated slice by slice along the lesion boundary on transverse images from the peak phase of DCE-MRI, T2WI, and DWI for each patient by a junior radiologist using ITK-SNAP 4.0 software (http://www.itksnap.org). In cases of multicentric lesions, the largest tumor lesion was designated as the target lesion. A subset of 30 patients was randomly selected for a second delineation by the senior radiologist to assess the intraclass correlation coefficients (ICCs). After lesion delineation, ROIs of the peritumoral area (PA) 4 mm around the tumor were obtained automatically (27,28). The segmented regions were denoted as the tumor area (TA), PA, and tumor-plus-peritumoral area (TPA). Image preprocessing and feature extraction processes were executed via the open-source Pyradiomics software, which facilitated the extraction of various features from all ROIs, including first-order features, texture features, shape features, and wavelet features.
Feature selection
We developed a three-step procedure to select robust radiomic features. Initially, the ICC was calculated to assess the stability of the radiomic features, and features exhibiting an ICC less than 0.75 were excluded. Subsequently, Pearson correlation analysis was employed to remove redundant features under a threshold of r>0.90. Finally, the genetic algorithm (GA) was used to identify the optimal subset of features (details are presented in Appendix 1).
Construction and validation of the different models
The radiomics score (radscore) was defined as a linear combination of selected radiomic features weighted by their corresponding logistic regression coefficients after feature selection with a GA. The clinical model was constructed from the most significant clinical factors as identified through univariate and multivariate logistic regression analyses. The combined model was established through the incorporation of both the highest radiomics signature and clinical factors via multivariate logistic regression. Five models were developed for predicting LVI status based on logistic regression: the TA model, PA model, TPA model, clinical model, and combined model (Table S2).
Model interpretability with SHAP
The SHAP method was applied to rank feature importance and interpret the final habitat model via the computation of both global and individual Shapley values, thereby enhancing model transparency and supporting its clinical applicability. SHAP was used to interpret the final logistic regression model by quantifying the contribution of each feature. The logistic regression model was implemented via the logistic regression class from the scikit-learn library (penalty =“l2”, C =1.0, solver = “lbfgs”, maximum iterations =100). The SHAP analysis was conducted with the “shap” Python package (Python Software Foundation, Wilmington, DE, USA), and summary plots were generated to visualize global feature importance and individual explanations.
Sample size evaluation
Based on the following inputs and hypotheses, a sample size of at least 44 patients with breast cancer (13 LVI-positive patients and 31 LVI-negative patients) was deemed necessary for the study: alternative hypothesis for the area under the receiver operating characteristic (ROC) curve (AUC), 0.800; AUC for the null hypothesis, 0.500; power, 80%; two-sided significance threshold, 0.05; and allocation ratio of sample sizes in the negative and positive groups, 7:3. Therefore, under the assumption that the true AUC was >0.800, sample sizes of 208 in training, 70 in the validation, and 88 in the test set were considered adequate to detect an AUC difference of 0.500 with 80% power (29).
Statistical analysis
Statistical analyses were performed with SPSS version 26.0 (IBM Corp., Armonk, NY, USA) and R software version 4.4.1 (The R Foundation for Statistical Computing). Continuous variables were compared between the LVI-positive and LVI-negative groups via independent sample t-tests, while categorical variables were analyzed with the chi-squared test. Variables that reached statistical significance (P<0.05) in the univariate analysis were fitted into the multivariate logistic regression analysis to estimate the odds ratio (OR) and 95% confidence interval (CI) for risk factors of LVI via the enter method. ROC curves were constructed, with optimal cutoff values being derived from the maximum Youden index. Subsequently, the AUC was calculated to assess the performance of the models. The DeLong test was employed for statistical comparisons, and the calibration of the models was evaluated with calibration curves, which demonstrated the agreement between predicted probabilities and actual outcomes. To evaluate the clinical utility of the models, decision curve analysis (DCA) was applied, with the net benefits across a range of threshold probabilities being calculated. All tests were two-tailed, and a P value of less than 0.05 was considered statistically significant.
Results
Patient characteristics
This investigation analyzed data from 366 patients. From center 1, 208 patients were placed into a training set and 70 patients into a validation set; from center 2, 88 patients were placed into a test set (Table 1). The ICC for interobserver agreement for MRI characteristics ranged from 0.705 to 0.980 (Table S3). In the univariate logistic regression of the training set, the significantly different characteristics between LVI-negative and LVI-positive patients included LD, MRI-ALN status, and PE (all P values <0.05). According to multivariable logistic regression, the predictive variables for LVI were MRI-ALN status (OR =2.670; 95% CI: 1.384–5.153; P=0.003) and PE (OR =2.303; 95% CI: 1.246–4.257; P=0.008) (Table 2). The clinical model was constructed from MRI-ALN status and PE.
Table 1
| Characteristics | Training set (n=208) | Validation set (n=70) | Test set (n=88) |
|---|---|---|---|
| LVI status | |||
| Negative | 137 (65.8) | 48 (68.6) | 59 (67.0) |
| Positive | 71 (34.2) | 22 (31.4) | 29 (33.0) |
| Age (years) | 54.73±10.43 | 53.74±11.04 | 48.77±11.11 |
| Menstrual status | |||
| Premenopausal | 71 (34.1) | 27 (38.6) | 53 (60.2) |
| Postmenopausal | 137 (65.9) | 43 (61.4) | 35 (39.8) |
| ER status | |||
| Negative | 44 (21.2) | 16 (22.9) | 34 (38.6) |
| Positive | 164 (78.8) | 54 (77.1) | 54 (61.4) |
| PR status | |||
| Negative | 57 (27.4) | 22 (31.4) | 40 (45.5) |
| Positive | 151 (72.6) | 48 (68.6) | 48 (54.5) |
| HER2 status | |||
| Negative | 158 (76.0) | 50 (71.4) | 57 (64.8) |
| Positive | 50 (24.0) | 20 (28.6) | 31 (35.2) |
| Ki-67 status | |||
| Negative | 58 (27.9) | 17 (24.3) | 20 (22.7) |
| Positive | 150 (72.1) | 53 (75.7) | 68 (77.3) |
| Location | |||
| Left | 105 (50.5) | 29 (41.4) | 49 (55.7) |
| Right | 103 (49.5) | 41 (58.6) | 39 (44.3) |
| LD (mm) | 21.93±9.03 | 25.37±12.35 | 27.00±12.69 |
| TIC | |||
| Persistent | 17 (8.2) | 6 (8.6) | 6 (6.8) |
| Plateau | 170 (81.7) | 51 (72.8) | 70 (79.6) |
| Washout | 21 (10.1) | 13 (18.6) | 12 (13.6) |
| MRI-ALN status | |||
| Absent | 153 (73.6) | 47 (67.1) | 55 (62.5) |
| Present | 55 (26.4) | 23 (32.9) | 33 (37.5) |
| PE | |||
| Absent | 114 (54.8) | 36 (51.4) | 48 (54.5) |
| Present | 94 (45.2) | 34 (48.6) | 40 (45.5) |
| Multifocal or multicentric | |||
| Absent | 175 (84.1) | 59 (84.3) | 77 (87.5) |
| Present | 33 (15.9) | 11 (15.7) | 11 (12.5) |
| Shape | |||
| Oval/round | 48 (23.0) | 6 (8.5) | 11 (12.5) |
| Irregular | 160 (77.0) | 64 (91.5) | 77 (87.5) |
| Margin | |||
| Circumscribed | 36 (17.3) | 6 (8.6) | 11 (12.5) |
| Noncircumscribed | 172 (82.7) | 64 (91.4) | 77 (87.5) |
| Internal enhancement | |||
| Homogeneous | 11 (5.3) | 6 (8.6) | 9 (10.2) |
| Heterogeneous | 193 (92.8) | 64 (91.4) | 71 (80.7) |
| Rim | 4 (1.9) | 0 | 8 (9.1) |
| FGT | |||
| Heterogeneously dense or dense | 96 (46.2) | 38 (54.3) | 35 (39.8) |
| Predominantly fatty or scattered | 112 (53.8) | 32 (45.7) | 53 (60.2) |
Data are presented as mean ± standard deviation or n (%). ALN, axillary lymph node; ER, estrogen receptor; FGT, fibroglandular tissue; HER2, human epidermal growth factor receptor type 2; LD, largest diameter; LVI, lymphovascular invasion; MRI, magnetic resonance imaging; PE, peritumoral edema; PR, progesterone receptor; TIC, time-signal intensity curve.
Table 2
| Variants | Univariate | Multivariate | |||
|---|---|---|---|---|---|
| OR (95% CI) | P value | OR (95% CI) | P value | ||
| Age | 0.978 (0.951–1.005) | 0.115 | – | – | |
| Menstrual status | 0.930 (0.509–1.699) | 0.814 | – | – | |
| ER status | 1.103 (0.540–2.254) | 0.787 | – | – | |
| PR status | 1.142 (0.594–2.193) | 0.691 | – | – | |
| HER2 status | 1.104 (0.567–2.147) | 0.771 | – | – | |
| Ki-67 status | 1.882 (0.947–3.743) | 0.071 | – | – | |
| Location | 0.831 (0.468–1.476) | 0.528 | – | – | |
| LD (mm) | 1.033 (1.000–1.066) | 0.047 | 1.020 (0.986–1.055) | 0.246 | |
| TIC | 1.364 (0.694–2.681) | 0.368 | – | – | |
| MRI-ALN status | 2.653 (1.405–5.008) | 0.003 | 2.670 (1.384–5.153) | 0.003 | |
| Peritumoral edema | 2.367 (1.318–4.252) | 0.004 | 2.303 (1.246–4.257) | 0.008 | |
| Multifocal or multicentric | 1.771 (0.832–3.769) | 0.138 | – | – | |
| Shape | 1.531 (0.750–3.125) | 0.242 | – | – | |
| Margin | 2.349 (0.970–5.684) | 0.058 | – | – | |
| Internal enhancement | 0.620 (0.213–1.814) | 0.380 | – | – | |
| FGT | 0.787 (0.442–1.403) | 0.417 | – | – | |
ALN, axillary lymph node; CI, confidence interval; ER, estrogen receptor; FGT, fibroglandular tissue; HER2, human epidermal growth factor receptor type 2; LD, largest diameter; MRI, magnetic resonance imaging; OR, odds ratio; PE, peritumoral edema; PR, progesterone receptor; TIC, time-signal intensity curve.
Radiomics feature selection and model construction
From intratumoral regions, 1,130 radiomics features were extracted separately from the DCE, DWI, and T2WI sequences, with the same number of features extracted from the peritumoral region. Features with ICCs greater than 0.75 were initially selected, and then those with correlation coefficients exceeding 0.90 were removed, from each pair. GA was applied to further screen the radiomics features for final selection (Figure S1). The TPA model, composed of features from the intra- and peritumoral regions, yielded higher AUCs than did the TA or PA model in all datasets (Figure 3). The distributions of the TPA radscore for each patient in the training, validation, and test datasets are shown in Figure S2. Finally, the TPA radscore, MRI-ALN status, and PE were integrated to construct the combined model.
Comparison of the different models
According to the DeLong test, the combined model exhibited significantly better ability in predicting LVI status than did the clinical model in the training set (AUC: 0.888 vs. 0.665; P<0.001), validation set (AUC: 0.856 vs. 0.635; P=0.001), and test set (AUC: 0.853 vs. 0.686; P=0.001) (Table 3). Calibration curves showed good agreement between actual probabilities and the probabilities estimated by the models (Figure S3). The DCA indicated that the combined model could provide a higher net benefit than could the clinical model in predicting LVI status in all datasets (Figure 4).
Table 3
| Data set | Model | AUC (95% CI) | ACC | SEN | SPE | P value |
|---|---|---|---|---|---|---|
| Training set | TA | 0.812 (0.754–0.870) | 0.755 | 0.718 | 0.774 | 0.013 |
| PA | 0.830 (0.772–0.887) | 0.803 | 0.746 | 0.832 | 0.046 | |
| TPA | 0.878 (0.828–0.927) | 0.832 | 0.831 | 0.832 | 0.344 | |
| Clinical | 0.665 (0.591–0.739) | 0.591 | 0.775 | 0.496 | <0.001 | |
| Combined | 0.888 (0.841–0.934) | 0.822 | 0.845 | 0.810 | Ref. | |
| Validation set | TA | 0.752 (0.629–0.874) | 0.743 | 0.682 | 0.771 | 0.163 |
| PA | 0.757 (0.641–0.873) | 0.729 | 0.773 | 0.708 | 0.097 | |
| TPA | 0.825 (0.726–0.924) | 0.757 | 0.864 | 0.708 | 0.214 | |
| Clinical | 0.635 (0.497–0.774) | 0.671 | 0.500 | 0.750 | 0.001 | |
| Combined | 0.856 (0.769–0.943) | 0.757 | 0.909 | 0.688 | Ref. | |
| Test set | TA | 0.766 (0.664–0.867) | 0.659 | 0.897 | 0.542 | 0.192 |
| PA | 0.704 (0.583–0.824) | 0.693 | 0.759 | 0.661 | 0.051 | |
| TPA | 0.823 (0.726–0.921) | 0.795 | 0.862 | 0.763 | 0.203 | |
| Clinical | 0.686 (0.571–0.802) | 0.670 | 0.690 | 0.661 | 0.001 | |
| Combined | 0.853 (0.760–0.946) | 0.852 | 0.793 | 0.881 | Ref |
ACC, accuracy; AUC, area under the curve; CI, confidence interval; LVI, lymphovascular invasion; PA, peritumoral area; Ref., reference; SEN, sensitivity; SPE, specificity; TA, tumor area; TPA, tumor-plus-peritumoral area.
Model explanation
The SHAP method was employed to interpret the output of the combined model by quantifying each variable’s contribution to the prediction. As shown in the SHAP summary plots in Figure 5A,5B, the contributions of the features to the combined model were evaluated via the mean SHAP values and are displayed in descending order. In addition, local explanation was used to analyze how the model made predictions for individual cases.
Figure 6A,6B illustrate two patients with breast cancer with different LVI statuses postoperation. Furthermore, an interactive SHAP force plot of the training set is provided in Figure 6C (https://pretend58.github.io/LVI-SHAP-FORCE/LVI-shap.html). This plot enables dynamic exploration of individual feature contributions.
Discussion
The findings from this study indicate that the TPA constructed using multiparametric radiomics features from the combined intratumoral and peritumoral areas exhibited excellent performance across the training, validation, and test datasets. TPA features may have greater value in preoperative tumor evaluation and help inform the formulation of patient treatment strategies and clinical decisions. Furthermore, the combined model significantly outperformed the clinical model in predicting LVI.
Previous studies on LVI have primarily focused solely on evaluating LVI status using radiomics derived from DCE-MRI (16,30,31). However, multiparametric MRI radiomics offers a more comprehensive perspective on tumor biological behavior, enabling more precise assessment of LVI status (6,12). This approach enhances predictive capability by integrating a diversity sequence features. Features derived from multiparametric MRI leverage the strengths of each sequence, providing a richer and more detailed depiction of tumor characteristics (7,16). In addition, the development of LVI involves complex interactions among tumor cells and their surrounding microenvironment, including the degradation of the extracellular matrix, direct cell-to-cell contact, and the regulation of various cytokines and chemical signals (32,33). Previous MRI-based radiomics studies on breast cancer have suggested that PA with a 4-mm radial delineation distance from the tumor provides valuable information for diagnosing malignant lesions and predicting programmed cell death ligand-1 status (27,34). In line with this, our TPA model, which combines intratumoral and peritumoral radiomics features, outperformed the models based solely on TA or PA in predicting LVI status. Although no statistically significant difference was observed between the TPA model and the TA or PA models in a few of the datasets, the TPA model generally exhibited greater predictive ability, highlighting its potential advantage in comprehensive feature integration.
PE typically indicates lymphatic obstruction and alterations in local blood circulation, suggesting the potential for tumor invasion into or in close proximity to the lymphovascular system (35,36). Furthermore, the characteristics of ALNs are essential in assessing LVI (36). The diagnostic performance of the combined model, particularly when integrated with TPA radscore and clinical factors, consistently surpassed that of the clinical model across all datasets (P<0.05). Compared to previous studies (30,36,37), our study achieved an improvement in predicting LVI through use of intratumoral and peritumoral multiparametric MRI.
Given the challenge clinicians face in accepting prediction models that lack direct explainability and interpretability, the SHAP method was employed to elucidate the workings of this model (38). SHAP-derived force maps provided detailed, patient-specific assessments, enhancing transparency in model interpretation. Moreover, SHAP’s visualization capabilities improved the clarity and depth of feature impact analysis, including their interactive effects. This approach enhances understanding and facilitates the model’s practical utility for clinicians. The high SHAP contribution of the TPA radscore suggests that the combined intra- and peritumoral radiomic patterns may reflect tumor heterogeneity and microenvironmental interactions associated with LVI status.
This study involved several limitations which should be noted. First, we employed a retrospective design, and further larger-scale, multicenter prospective trials are needed to validate and reinforce these findings. Second, the use of a 4-mm peritumoral region in this study was based on findings reported in the majority of the related literature. However, we did not further examine or determine the optimal size of the peritumoral region for generating the ROI. Finally, the scope was limited to invasive breast cancer and did not include ductal carcinoma in situ (DCIS), which may restrict the generalizability of the results. Future studies should incorporate DCIS to enhance diagnostic accuracy.
Conclusions
The TPA model demonstrated superior performance as compared to the TA model and PA model. By integrating the TPA radscore with clinical factors, the combined model may serve as a biomarker for predicting LVI. The results suggest that the combined model could aid clinicians in identifying patients with LVI, thereby facilitating the development of personalized treatment plans.
Acknowledgments
None.
Footnote
Reporting Checklist: The authors have completed the TRIPOD+AI reporting checklist. Available at https://qims.amegroups.com/article/view/10.21037/qims-2024-2685/rc
Data Sharing Statement: Available at https://qims.amegroups.com/article/view/10.21037/qims-2024-2685/dss
Funding: This study was supported by (I)
Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://qims.amegroups.com/article/view/10.21037/qims-2024-2685/coif). W.T. reports that this research was funded by the National Natural Science Foundation of China (No. 82302314), Basic and Applied Basic Research Foundation of Guangdong Province (No. 2022A1515110792) and Science and Technology Projects in Guangzhou (No. 2024A03J1030). Y.F. is currently an employee of Shukun Technology Co., Ltd. Q.K. reports that this research was funded by the Medical Scientific Research Foundation of Guangdong Province (No. A2023073). Y.G. reports that this research was funded by the National Natural Science Foundation of China (No. 81901711), Basic and Applied Basic Research Foundation of Guangdong Province (Nos. 2023A1515220097 and 2024A1515010653), Science and Technology Projects in Guangzhou (No. 2025A03J4163), Guangzhou First People’s Hospital Frontier Medical Technology Project (QY-C04), the Special Fund for the Construction of High-level Key Clinical Specialty (Medical Imaging) in Guangzhou, and Guangzhou Key Laboratory of Molecular Imaging and Clinical Translational Medicine. W.L. reports that this research was funded by the Science and Technology Projects in Guangzhou (No. 2025A03J4162). The other authors have no conflicts of interest to declare.
Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. This study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments and was approved by the Institutional Ethics Committee of Guangzhou First People’s Hospital (Approval No. K-2023161-01) and the Third Affiliated Hospital of Sun Yat-Sen University (Approval No. SL-RG2024-070-01). The requirement for informed consent was waived due to the retrospective nature of the analysis.
Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.
References
- Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, Jemal A. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2024;74:229-63. [Crossref] [PubMed]
- Britto AV, Schenka AA, Moraes-Schenka NG, Alvarenga M, Shinzato JY, Vassallo J, Ward LS. Immunostaining with D2-40 improves evaluation of lymphovascular invasion, but may not predict sentinel lymph node status in early breast cancer. BMC Cancer 2009;9:109. [Crossref] [PubMed]
- Rakha EA, Martin S, Lee AH, Morgan D, Pharoah PD, Hodi Z, Macmillan D, Ellis IO. The prognostic significance of lymphovascular invasion in invasive breast carcinoma. Cancer 2012;118:3670-80. [Crossref] [PubMed]
- Curigliano G, Burstein HJ, Winer EP, Gnant M, Dubsky P, Loibl S, et al. De-escalating and escalating treatments for early-stage breast cancer: the St. Gallen International Expert Consensus Conference on the Primary Therapy of Early Breast Cancer 2017. Ann Oncol 2017;28:1700-12. [Crossref] [PubMed]
- Wang C, Wu Y, Shao J, Liu D, Li W. Clinicopathological variables influencing overall survival, recurrence and post-recurrence survival in resected stage I non-small-cell lung cancer. BMC Cancer 2020;20:150. [Crossref] [PubMed]
- Zhang J, Wang G, Ren J, Yang Z, Li D, Cui Y, Yang X. Multiparametric MRI-based radiomics nomogram for preoperative prediction of lymphovascular invasion and clinical outcomes in patients with breast invasive ductal carcinoma. Eur Radiol 2022;32:4079-89. [Crossref] [PubMed]
- Yang X, Fan X, Lin S, Zhou Y, Liu H, Wang X, Zuo Z, Zeng Y. Assessment of Lymphovascular Invasion in Breast Cancer Using a Combined MRI Morphological Features, Radiomics, and Deep Learning Approach Based on Dynamic Contrast-Enhanced MRI. J Magn Reson Imaging 2024;59:2238-49. [Crossref] [PubMed]
- Chen S, Sui Y, Ding S, Chen C, Liu C, Zhong Z, Liang Y, Kong Q, Tang W, Guo Y. A simple and convenient model combining multiparametric MRI and clinical features to predict tumour-infiltrating lymphocytes in breast cancer. Clin Radiol 2023;78:e1065-74. [Crossref] [PubMed]
- Chen S, Du Y, Hu Y, Ling R, Huang D, Xiang J, Liang Y, Wei X, Tang W, Guo Y. Preoperative MRI of breast squamous cell carcinoma: diagnostic value of distinguishing between two subtypes. Clin Radiol 2022;77:e321-8. [Crossref] [PubMed]
- Zhang C, Liang Z, Feng Y, Xiong Y, Manwa C, Zhou Q. Risk Factors for Lymphovascular Invasion in Invasive Ductal Carcinoma Based on Clinical and Preoperative Breast MRI Features: a Retrospective Study. Acad Radiol 2023;30:1620-7. [Crossref] [PubMed]
- Şendur HN, Şendur AB. MRI-Based Radiomics May Provide More In-depth Information Regarding Lymphovascular Invasion Status in Patients with Breast Cancer. Acad Radiol 2023;30:2710-1. [Crossref] [PubMed]
- Jiang Y, Zeng Y, Zuo Z, Yang X, Liu H, Zhou Y, Fan X. Leveraging multimodal MRI-based radiomics analysis with diverse machine learning models to evaluate lymphovascular invasion in clinically node-negative breast cancer. Heliyon 2024;10:e23916. [Crossref] [PubMed]
- Lv L, Zhang Z, Zhang D, Chen Q, Liu Y, Qiu Y, Fu W, Yin X, Chen X. Machine-learning radiomics to predict bone marrow metastasis of neuroblastoma using magnetic resonance imaging. Cancer Innov 2023;2:405-15. [Crossref] [PubMed]
- Wu PQ, Guo FL, Wang J, Gao Y, Feng ST, Chen SL, Ma J, Liu YB. Development and validation of a dynamic contrast-enhanced magnetic resonance imaging-based habitat and peritumoral radiomic model to predict axillary lymph node metastasis in patients with breast cancer: a retrospective study. Quant Imaging Med Surg 2024;14:8211-26. [Crossref] [PubMed]
- Zhang C, Zhong M, Liang Z, Zhou J, Wang K, Bu J. MRI-based radiomic and machine learning for prediction of lymphovascular invasion status in breast cancer. BMC Med Imaging 2024;24:322. [Crossref] [PubMed]
- Zheng H, Jian L, Li L, Liu W, Chen W. Delta-Radiomics Based on Dynamic Contrast-Enhanced MRI for Predicting Lymphovascular Invasion in Invasive Breast Cancer. Acad Radiol 2024;31:1762-72. [Crossref] [PubMed]
- Jiang W, Meng R, Cheng Y, Wang H, Han T, Qu N, Yu T, Hou Y, Xu S. Intra- and Peritumoral Based Radiomics for Assessment of Lymphovascular Invasion in Invasive Breast Cancer. J Magn Reson Imaging 2024;59:613-25. [Crossref] [PubMed]
- Liu Z, Hong M, Li X, Lin L, Tan X, Liu Y. Predicting axillary lymph node metastasis in breast cancer patients: A radiomics-based multicenter approach with interpretability analysis. Eur J Radiol 2024;176:111522. [Crossref] [PubMed]
- Zheng G, Peng J, Shu Z, Jin H, Han L, Yuan Z, Qin X, Hou J, He X, Gong X. Predicting pathological complete response to neoadjuvant chemotherapy in breast cancer patients: use of MRI radiomics data from three regions with multiple machine learning algorithms. J Cancer Res Clin Oncol 2024;150:147. [Crossref] [PubMed]
- Zhang R, Hong M, Cai H, Liang Y, Chen X, Liu Z, Wu M, Zhou C, Bao C, Wang H, Yang S, Hu Q. Predicting the pathological invasiveness in patients with a solitary pulmonary nodule via Shapley additive explanations interpretation of a tree-based machine learning radiomics model: a multicenter study. Quant Imaging Med Surg 2023;13:7828-41. [Crossref] [PubMed]
- Xue T, Zhu L, Tao Y, Ye X, Yu H. Development and validation of an interpretable delta radiomics-based model for predicting invasive ground-glass nodules in lung adenocarcinoma: a retrospective cohort study. Quant Imaging Med Surg 2024;14:4086-97. [Crossref] [PubMed]
- Huang Y, Wang X, Cao Y, Li M, Li L, Chen H, Tang S, Lan X, Jiang F, Zhang J. Multiparametric MRI model to predict molecular subtypes of breast cancer using Shapley additive explanations interpretability analysis. Diagn Interv Imaging 2024;105:191-205. [Crossref] [PubMed]
- Yu X, Xiang J, Zhang Q, Chen S, Tang W, Li X, Sui Y, Liu W, Kong Q, Guo Y. Triple-negative breast cancer: predictive model of early recurrence based on MRI features. Clin Radiol 2023;78:e798-807. [Crossref] [PubMed]
- Han L, Zhu Y, Liu Z, Yu T, He C, Jiang W, Kan Y, Dong D, Tian J, Luo Y. Radiomic nomogram for prediction of axillary lymph node metastasis in breast cancer. Eur Radiol 2019;29:3820-9. [Crossref] [PubMed]
- Zhao M, Wu Q, Guo L, Zhou L, Fu K. Magnetic resonance imaging features for predicting axillary lymph node metastasis in patients with breast cancer. Eur J Radiol 2020;129:109093. [Crossref] [PubMed]
- Xu Z, Ding Y, Zhao K, Han C, Shi Z, Cui Y, Liu C, Lin H, Pan X, Li P, Chen M, Wang H, Deng X, Liang C, Xie Y, Liu Z. MRI characteristics of breast edema for assessing axillary lymph node burden in early-stage breast cancer: a retrospective bicentric study. Eur Radiol 2022;32:8213-25. [Crossref] [PubMed]
- Wu Z, Lin Q, Wang H, Chen J, Wang G, Fu G, Li L, Bian T. Intratumoral and Peritumoral Radiomics Based on Preoperative MRI for Evaluation of Programmed Cell Death Ligand-1 Expression in Breast Cancer. J Magn Reson Imaging 2024;60:588-99. [Crossref] [PubMed]
- Xu H, Liu J, Chen Z, Wang C, Liu Y, Wang M, Zhou P, Luo H, Ren J. Intratumoral and peritumoral radiomics based on dynamic contrast-enhanced MRI for preoperative prediction of intraductal component in invasive breast cancer. Eur Radiol 2022;32:4845-56. [Crossref] [PubMed]
- Li X, Zhang N, Hu C, Lin Y, Li J, Li Z, et al. CT-based radiomics signature of visceral adipose tissue for prediction of disease progression in patients with Crohn's disease: A multicentre cohort study. EClinicalMedicine 2023;56:101805. [Crossref] [PubMed]
- Liu Z, Feng B, Li C, Chen Y, Chen Q, Li X, Guan J, Chen X, Cui E, Li R, Li Z, Long W. Preoperative prediction of lymphovascular invasion in invasive breast cancer with dynamic contrast-enhanced-MRI-based radiomics. J Magn Reson Imaging 2019;50:847-57. [Crossref] [PubMed]
- Lai T, Chen X, Yang Z, Huang R, Liao Y, Chen X, Dai Z. Quantitative parameters of dynamic contrast-enhanced magnetic resonance imaging to predict lymphovascular invasion and survival outcome in breast cancer. Cancer Imaging 2022;22:61. [Crossref] [PubMed]
- Kariri YA, Aleskandarany MA, Joseph C, Kurozumi S, Mohammed OJ, Toss MS, Green AR, Rakha EA. Molecular Complexity of Lymphovascular Invasion: The Role of Cell Migration in Breast Cancer as a Prototype. Pathobiology 2020;87:218-31. [Crossref] [PubMed]
- Ribelles N, Santonja A, Pajares B, Llácer C, Alba E. The seed and soil hypothesis revisited: current state of knowledge of inherited genes on prognosis in breast cancer. Cancer Treat Rev 2014;40:293-9. [Crossref] [PubMed]
- Zhou J, Zhang Y, Chang KT, Lee KE, Wang O, Li J, Lin Y, Pan Z, Chang P, Chow D, Wang M, Su MY. Diagnosis of Benign and Malignant Breast Lesions on DCE-MRI by Using Radiomics and Deep Learning With Consideration of Peritumor Tissue. J Magn Reson Imaging 2020;51:798-809. [Crossref] [PubMed]
- Li X, Luo K, Zhang N, Chen W, Li B, Lu Z, Chen Y, Wu K. Prediction of Lymphovascular invasion status in breast cancer based on magnetic resonance imaging radiomics features. Magn Reson Imaging 2024;109:91-5. [Crossref] [PubMed]
- Xu Z, Xie Y, Wu L, Chen M, Shi Z, Cui Y, Han C, Lin H, Liu Y, Li P, Chen X, Ding Y, Liu Z. Using Machine Learning Methods to Assess Lymphovascular Invasion and Survival in Breast Cancer: Performance of Combining Preoperative Clinical and MRI Characteristics. J Magn Reson Imaging 2023;58:1580-9. [Crossref] [PubMed]
- Kayadibi Y, Kocak B, Ucar N, Akan YN, Yildirim E, Bektas S. MRI Radiomics of Breast Cancer: Machine Learning-Based Prediction of Lymphovascular Invasion Status. Acad Radiol 2022;29:S126-34.
- Hu J, Xu J, Li M, Jiang Z, Mao J, Feng L, Miao K, Li H, Chen J, Bai Z, Li X, Lu G, Li Y. Identification and validation of an explainable prediction model of acute kidney injury with prognostic implications in critically ill children: a prospective multicenter cohort study. EClinicalMedicine 2024;68:102409. [Crossref] [PubMed]

