Performance of the Models Predicting In-Hospital Mortality in Patients with ST-Segment Elevation Myocardial Infarction with Predictors in Categorical and Continuous Forms

Shakhgeldyan K.I., Kuksin N.S., Domzhalov I.G., Geltser B.I.

Key words: prognostic models; data categorization; ST-segment elevation myocardial infarction; mortality; risk factors; method of Shapley additive explanation.

The aim of the study is to assess the performance of predictive models developed on the basis of predictors in the continuous and categorical forms to predict the probability of in-hospital mortality (IHM) in patients with ST-segment elevation myocardial infarction (STEMI) after percutaneous coronary intervention (PCI).

Materials and Methods. A single-center retrospective study has been conducted, within the framework of which data from 4674 medical records of patients with STEMI after PCI, treated at the Regional Vascular Center of Vladivostok (Russia), have been analyzed. Two groups of patients were identified: group 1 consisted of 318 (6.8%) individuals who died in the hospital, group 2 included 4356 (93.2%) patients with a favorable outcome of treatment. IHM prognostic models were developed using multivariate logistic regression (MLR), random forest (RF), and stochastic gradient boosting (SGB). 6-metric qualities were used to evaluate the accuracy of the models. Threshold values of IHM predictors were determined using a grid search to find the optimal cut-off points, calculating centroids, and Shapley additive explanations. The latter helped evaluate the degree to which the model predictors influence the endpoint.

Results. Based on the results of the multi-stage analysis of indicators of clinical and functional status of the STEMI patients, new predictors of IHM have been identified and validated, complementing the factors of the GRACE scoring system, their categorization has been carried out and prognostic models with continuous and categorical variables based on MLR, RF, and SGB have been developed. These models had a high (AUC — 0.88 to 0.90) and comparable predictive accuracy, but their predictors differed in various degrees of influence on the endpoint. The comparative analysis has shown that the Shapley additive explanation method has advantages in categorizing predictors compared to other methods and allows for detailing the structure of their relationships with IHM.

Conclusion. The use of modern data mining methods, including machine learning algorithms, categorization of predictors, and assessment of the degree of their effect on the endpoint, makes it possible to develop predictive models possessing high accuracy and the properties of explanation of the generated conclusions.

Introduction

Today, one of the effective methods of treating ST-segment elevation myocardial infarction (STEMI) is revascularization of myocardium using percutaneous coronary intervention [1]. However, in-hospital mortality (IHM) rate after emergency percutaneous coronary intervention (PCI) remains high from 4 to 7% making the need for predicting unfavorable events crucial [2].

The GRACE scoring system (Global Registry of Acute Coronary Events) is referred to the most required tools of riskometry, the improvement of which is the aim of a number of investigations of the last years [3–6]. In the majority of cases, the base factors of this system are complemented with new predictors to build prognostic models, and the correct selection of their threshold values remains important.

At present, methods of predictive analytics based on machine learning, which are increasingly being applied in different field of medicine, may be used to solve this task [7–10]. At the same time, implementation of the machine learning models into clinical practice is rather limited due to their “nontransparency”. This problem may be solved by means of categorization of continuous variables used in prognostic algorithms. Data categorization makes it possible to determine threshold values of the analyzed indicators, the deviations in which may be used for detection of the risk factors and clinical reasoning of predicted probability of unfavorable events [11, 12]. Besides, by combining the risk factors one can realize the possibility of characterizing a complex impact of various features on the response variable [13]. Moreover, some authors believe that indicators in the categorical form, dichotomized in particular, may lead to the loss of information, distortion of the analysis results [14–16], increased share of false-positive [17] and false-negative [18] conclusions. Despite the indicated drawbacks, recommendations of the STROBE (Strengthening the reporting of observational studies in epidemiology) confirm the appropriateness of using data categorization provided that the methods of its implementation are indicated [19].

The aim of the study isto assess the performance of predictive models built on the basis of predictors in the continuous and categorical forms to predict the probability of in-hospital mortality in patients with ST-segment elevation myocardial infarction after percutaneous coronary intervention.

Materials and Methods

Characteristics of patients. A single-center retrospective cohort study has been carried out, within the scope of which medical records of 4674 patients (3200 men and 1474 women) with STEMI at the age of 26–93 years (median — 63 years, 95% confidence interval (CI): 62–63), have been analyzed. Patients were treated in the Regional Vascular Center “Primorskiy Territory Clinical Hospital No.1” (Vladivostok, Russia) in the period from 2015 to 2021 [20]. All patients underwent emergency PCI. Patients were divided into two groups: group 1 comprised 318 patients (6.8%) died in the hospital; group 2 included 4356 patients (93.2%) with a favorable outcome. The study was conducted in compliance with the Declaration of Helsinki and approved by the local ethical committee of the Far Easten Federal University (Vladivostok, Russia), Protocol No.8 of June 8, 2023.

Patients with validated STEMI and PCI performed on the first day of the inpatient treatment met the criteria of inclusion into the study. Exclusion criteria were as follows: unstable angina, non-ST-elevation myocardial infarction, and absence of indications for PCI.

Clinical and functional status of the patients was evaluated on the first day of hospital treatment using 136 factors, the main of which are presented in Table 1. The data included 5 features from the GRACE scoring system: patients’ age, acute heart failure (AHF) according to Killip classification, heart rate (HR), systolic blood pressure (SBP), creatinine concentration in blood serum. Indicators of the laboratory tests have been also analyzed: content of erythrocytes (RBC), leukocytes (WBC), lymphocytes (LYM), neutrophils (NEUT), eosinophils (EOS); hemoglobin (Hb), thrombocytes (PLT), and thrombocrit (PCT); international normalized ratio (INR); thrombin time (TT); prothrombin index (PTI), activated partial thromboplastin time (APTT); levels of fibrinogen and glucose in blood serum. Postoperative echocardiographic examination included determination of the longitudinal and transverse dimensions of left and right atrium (LA1, LA2 and RA1, RA2), end systolic (ESD) and diastolic (EDD) dimension of the left ventricle (LV), left ventricular ejection fraction (LV EF) using the Teichholz formula, mean pulmonary artery pressure (mPAP). The following estimate indicators have been assessed: relative myocardium mass index of the left ventricle (LV RMMI), relative posterior wall thickness (RPWT) of the LV, body mass index (BMI).

Table 1. Clinical and functional description of patients with ST-segment elevation myocardial infarction

The final point of the investigation was presented by the IHM indicator in STEMI patients after PCI from all causes in the form of categorical binary feature (“presence” or “development”).

Methods of statistical analysis and machine learning. According to the Kolmogorov–Smirnov test, the data were not distributed normally, therefore, nonparametric statistical methods were applied. The indicators were presented as a median (Me) and interquartile intervals [Q1; Q3]; the Mann–Whitney test was used for intergroup comparison of continuous variables, while ÷² was applied for categorical ones. Odds ratio (OR) and their 95% CI was calculated using the Fisher’s exact test. Differences were considered statistically significant at p<0.01.

Models were developed using methods of multivariate logistic regression (MLR), random forest (RF), and stochastic gradient boosting (SGB). Their quality was estimated by six metrics: area under ROC-curve (AUC), sensitivity (Sen), specificity (Spec), F1-score, positive predictive value (PPV), negative predictive value (NPV).

To select the threshold values of potential predictors, methods of optimization on the grid with a pitch of Δ=(max–min)/100 were employed: minimization of p-value, Min(p), maximization of OR, Max(OR), and AUC, Max(AUC), method of centroids, and Shapley additive explanations (SHAP) [21, 22]. The latter was also used to evaluate the degree to which the model predictors influence the endpoint.

The dataset was divided into 2 samples: for training and cross-validation (80%), and for final testing (20%). The procedure of training and cross-validation was performed by stratification in k-Folders on 10 samples. The averaged quality metrics AUC, Sen, and Spec were used to choose the best model, select and validate predictors and select optimal hyper-parameters by grid search over acceptable values. For final testing, the best models of MLR, RF, and SGB with optimal parameters and hyper-parameters were trained on 80% of the dataset, and validated on the subgroup for final testing. To estimate the quality metrics by confidence regions, the procedure was repeated 500 times performing randomly the initial sampling using Monte Carlo method. The data analysis and model building were done in Python 3.9.16 with an open source code.

Study design included 4 stages. At the first stage, a pool of potential IHM predictors was formed using the tests of intergroup comparisons (see Table 1).

At the second stage, methods of machine learning were used to develop prognostic IHM models with predictors in the continuous form including 5 basic factors of the GRACE scale. To improve the prognostic accuracy, the model structure was supplemented in a step-wise manner with new predictors selected at the first stage of the investigation provided that statistically significant difference was at the level p<0.01. The prognostic significance of the predictor was considered validated, if the AUC value increased after its inclusion into the model. At this stage, the degree of influence of the best model predictors on the study endpoint was analyzed using the SHAP method.

At the third stage, continuous predictors were categorized by means of different techniques in order to find the threshold values, the deviations from which would allow us to refer them to the IHM risk factors.

At the fourth stage, prognostic IHM models were generated based on the categorical predictors and evaluated in terms of their impact on the endpoint.

Results

At the first stage, an intergroup analysis of clinical, demographic, and laboratory indicators has been carried out. The majority of them, including all predictors of the GRACE scale, had statistically significant differences (see Table 1). Elderly females short in height predominated among the dead patients. Besides, it was typical for patients of group 1 to have AHF Killip class III and IV; lower values of SBP, DBP, LV EF, LYM, EOS, PTI, RBC, Hb; higher levels of HR, mPAP, LV ESD, creatinine, NEUT, APTT, INR, urea, glucose; increased atrial dimensions and LV RMMI indicator. They more often suffered from anterior myocardial infarction, type 2 diabetes (DM2), atrial fibrillation (AF), and chronic kidney disease (CKD).

At the second stage, prognostic IHM models were generated, where new factors were used in the continuous form additionally to the 5 basis indicators of the GRACE scoring system. These factors were selected during iterative testing of the pool of potential predictors obtained at the first stage. Testing was performed by including alternately each potential predictor into the base model of the GRACE scale and leaving in the model the only one that gave maximal increase of the AUC metrics. In the next iterations, the procedure was repeated for the remaining potential predictors. In this way, 5 new prognostic factors were selected: LV EF, NEUT, EOS, PCT, and glucose. It should be also noted that the comparison of AUC values for the model previously built by us [23] using the same patient sample and the predictors of GRACE only and the model supplemented by new predictors, demonstrated a higher accuracy of the latter (AUC — 0.836 vs. 0.90).

The comparison of the predictive value of the models generated by MLR, SGB, and RF has shown that they possess a high prognostic capacity and have close values of quality metrics during cross-validation and final testing (AUC varied from 0.884 to 0.90). This shows the absence of their re-training and good generalization properties (Table 2).

Table 2. Assessment of accuracy of in-hospital mortality prognostic models for patients with ST-segment elevation myocardial infarction after percutaneous coronary intervention using predictors

in the continuous form, Me [Q1; Q3]

Assessing the impact of continuous predictors on the endpoint using the SHAP method has shown that LV EF and creatinine have the closest association with IHM. HR, NEUT, and glucose influenced this point to a lesser degree, while indicators such as EOS, SBP, patients’ age, PCT, and AHF Kilipp class had the smallest effect on the endpoint (Figure 1).

Figure 1. Assessment of importance of in-hospital mortality predictors in the continuous form in the model of multivariate logistic regression

At the third stage of the study, IHM predictors were categorized in the continuous form using grid search to find the optimal cut-off threshold, SHAP method, and centroid calculation. Application of the threshold values, the deviation from which is associated with the higher probability of IHM, allows for considering the categorized data as risk factors for adverse events. The risk factor is encoded as “1” with the postfix “+” if the predictor value exceeds the threshold, or with the postfix “–” if the value does not reach the threshold; in other cases, when the predictor value is in another range, the risk factor is encoded as “0”.

The results of the study have shown that the threshold values obtained by various methods often differ from each other. For example, the cut-off threshold for LV EF indicator according to SHAP was <45%, while after OR and AUC maximization, the cut-off point was fixed at the level of 31 and 51%, respectively (Table 3). At the same time, the threshold values determined by the SHAP algorithms were closest to the criterial boundaries verified by the Min(p) method. The threshold values obtained by the Max(OR) had an extreme cut-off threshold and allowed for identification of only a narrow group of individuals with a high IHM probability. It is worth mentioning that the SHAP method enables investigators not only to determine the threshold boundaries but to assess the intensity of the impact on the IHM indicators, whose values are in the “risk zone”. The following features are referred to the categorical factors selected by this method: age >70 years, HR >89 per minute, SBP <95 mm Hg, AHF Killip class >II, creatinine >166 μmol/L, LV EF <45%, NEUT >77%, EOS <0.2%, PCT >0.32%, glucose >8.9 mmol/L (Figure 2). With the use of a LV EF diagram as an example, it is seen that the IHM probability increases successively in the value range of 44–31% and grows sharply at the level of this indicator <30%. The elevation of glucose in the blood by more than 8.9 mmol/L increases the risk of IHM, but the probability of the fatal outcome grows significantly at its level exceeding 17 mmol/L.

Table 3. Categorization of continuous predictors of in-hospital mortality using various methods

Figure 2. The effect of continuous indicators and their threshold values on the endpoint evaluated by the SHAP method

A dashed line designates the cut-off threshold

At the fourth stage of the study, MLR-based prognostic IHM models with predictors in a categorical form have been developed (Table 4). The comparative analysis has shown that the majority of models possessed a high predictive capacity irrespective of the method of threshold value determination. The model, in which predictor categorization was done by the Max(OR), was an exception because it did not provide the acceptable prediction accuracy (AUC — 0.662). The evaluation of the model quality metrics with categorical and continuous predictors has demonstrated the absence of statistically significant differences. For example, 95% CI for AUC medians in the analyzed models was 0.882–0.887 and 0.841–0.959, respectively, at p=0.172, which indicated their comparable prognostic accuracy.

Table 4. Accuracy assessment of the models for predicting in-hospital mortality based on predictors in the categorical form, Me [Q1; Q3]

The analysis of categorical variable impact on IHM by the SHAP method has shown that the greatest effect on IHM was caused by the following risk factors: HR >89 per minute, creatinine >166 μmol/L, and neutrophil content >77% (SHAP values of 1.28; 1.27, and 1.20, respectively) (Figure 3). Weaker association with the endpoint was observed in the indicators of glucose >8.9 mmol/L, LV EF <45%, AHF (Killip class III and IV), SBP <95 mm Hg (SHAP — 1.10, 1.02, 0.98, 0.82, respectively), while the weakest impact on IHM was produced by the age of >71 years, eosinophil content <0.2%, and thrombocrit >0.32% (SHAP — 0.62, 0.58, and 0.14).

Figure 3. Assessment of importance of in-hospital mortality predictors in the categorical form in the multivariate logistic regression model

Discussion

In recent years, machine learning-based prognostic models are being developed, the structure of which is presented by the factors with a higher predictive potential than in the classic scales of riscometry. Perspective tools for their selection are the algorithms of explainable artificial intelligence (XAI), the elements of which include determination of the threshold values of the analyzed factors and assessment of their impact on the endpoint of the investigation. The XAI concept is based on the grounding and interpretation of various decisions obtained as a result of modeling, evaluation of their significance, and accuracy of the generated conclusions [24]. One of the barriers for implementation of these principles is multifactor and nonlinear nature of prognostic models, when a battery of diverse data associated with different causes of fatal events influence the endpoint. Non-transparency of interconnections of various factors with IHM may be partially overcome by their categorization, which allows one to detail the correlations of indicators of clinical and functional status of STEMI patients with the resulting variable. According to the literature data, the most available method of categorization is descriptive statistics with calculation of medians, quartiles, and quantiles [16, 25, 26]. At the same time, a large portion of critical remarks on categorization is connected exactly with this approach, which is caused primarily by the dependence of these threshold values on the specific sample, absence of interconnection with the clinical context, ignoring possible non-linear relations, and others. Another approach is based on the selection of the threshold value, known from practice as going beyond the norm [26]. An alternative method is searching for the optimal threshold values based on minimization or maximization of the target functions. In our opinion, categorization must be considered only within the framework of solving a specific clinical task, although the selected predictors may be of interest for realization of other prognostic investigations. Whatever the method was used to determine their threshold values, they may result in the loss of information, on the one hand, while bringing new knowledge, on the other. In the present study, the threshold values were determined using the methods of grid search to find the optimal cut-off points, calculation of centroids, and SHAP. Deviations from the threshold values increased their predictive potential and allowed us to refer these indicators to the IHM risk factors in STEMI patients after PCI. It has been established that SHAP method, which is considered as a XAI technology, is a promising tool of categorization due to a precise estimation of the cut-off thresholds and analysis of interrelations of predictors in the continuous and categorical forms with the investigation endpoint. Despite the comparable accuracy of the prognostic IHM models with continuous and categorical predictors, there were certain differences in the intensity of their influence on the endpoint. Thus, indicators of LV EF and creatinine demonstrated the greatest interconnection with fatal outcome among the continuous predictors, while the smallest one was shown by AHF Killip class and patients’ age. Among the categorical predictors, the most noticeable association of IHM was with HR >89 per minute, creatinine >166 μmol/L, and neutrophil level >77%, and the minimal association was with thrombocrit >32% and eosinophil content <0.2%. These differences may be explained by the endpoint of the investigation presented in the form of IHM from all causes, which does not allow for verification of the predictors connected with the concrete variant of unfavorable outcome (repeated myocardial infarction, fatal arrhythmias, bleeding, and others). The other cause of mismatched predictor importance may be connected with the fact that in our study, they were categorized with the selection of only one cut-off threshold. A single criterial boundary limits the possibilities for estimation of non-linear interconnections of IHM with the predictor values being in the risk zone. A sharp increase of IHM probability in LV EF less than 30% relative to the range of its values of 31–44% may serve as a convincing example of such situation (see Figure 2). In our case, the LV EF predictor <45% was not so important as indicators HR >89 per minute and creatinine >166 μmol/L in the prognostic IHM model with the categorical factors; besides, this predictor indicated that overcoming criterial boundary, selected in the process of dichotomization, is associated with the growing risk of an adverse outcome. In the recent years, in order to improve the accuracy of prediction, it is recommended to perform variable categorization using several cut-off thresholds, which specifies non-linear interconnections of predictors with the endpoint [19]. The results of our study demonstrate that despite some problems connected with dichotomization of continuous variables in prognostic models, it is appropriate to perform this procedure, since it broadens the possibilities for explanation and clinical interpretation of the generated conclusions. At the same time, it is quite evident that this approach needs further improvement by applying the technologies of multilevel categorization.

Study limitations are linked to its retrospective character, application of only dichotomization for categorization of the continuous variables, and model validation using the data from other medical settings.

Conclusion

In the present investigation, we selected and validated new predictors of in-hospital mortality in patients with ST-segment elevation myocardial infarction after percutaneous coronary intervention, categorized them, and developed prognostic models with continuous and categorical variables based on multivariate logistic regression, random forest, and stochastic gradient boosting. These models had a high and comparable predictive accuracy, but their predictors had different degree of impact on the endpoint. The comparative analysis has shown that Shapley additive explanation method has advantages in categorization of predictors and allows for detailing the structure of their interconnections with in-hospital mortality. To improve prognostic models with categorical predictors, it is appropriate to use multilevel cut-off thresholds upgrading the quality and explanation of the generated conclusions.

Study funding. The work was supported by the grant of the Russian Science Foundation No.23-21-00250, https://rscf.ru/project/23-21-00250/.

Conflicts of interest. The authors have no evident or potential conflicts of interest to declare.

References

Ibanez B., James S., Agewall S., Antunes M.J., Bucciarelli-Ducci C., Bueno H., Caforio A.L.P., Crea F., Goudevenos J.A., Halvorsen S., Hindricks G., Kastrati A., Lenzen M.J., Prescott E., Roffi M., Valgimigli M., Varenhorst C., Vranckx P., Widimský P.; ESC Scientific Document Group. 2017 ESC Guidelines for the management of acute myocardial infarction in patients presenting with ST-segment elevation: The Task Force for the management of acute myocardial infarction in patients presenting with ST-segment elevation of the European Society of Cardiology (ESC). Eur Heart J 2018; 39(2): 119–177, https://doi.org/10.1093/eurheartj/ehx393.
Pfuntner A., Wier L.M., Stocks C. Most frequent procedures performed in U.S. hospitals, 2011. In: Healthcare Cost and Utilization Project (HCUP) Statistical Briefs. Rockville (MD): Agency for Healthcare Research and Quality (US); 2014.
Wang J.J., Fan Y., Zhu Y., Zhang J.D., Zhang S.M., Wan Z.F., Su H.L., Jiang N. Biomarkers enhance the long-term predictive ability of the KAMIR risk score in Chinese patients with ST-elevation myocardial infarction. Chin Med J (Engl) 2019; 132(1): 30–41, https://doi.org/10.1097/cm9.0000000000000015.
Liu X.J., Wan Z.F., Zhao N., Zhang Y.P., Mi L., Wang X.H., Zhou D., Wu Y., Yuan Z.Y. Adjustment of the GRACE score by HemoglobinA1c enables a more accurate prediction of long-term major adverse cardiac events in acute coronary syndrome without diabetes undergoing percutaneous coronary intervention. Cardiovasc Diabetol 2015; 14: 110, https://doi.org/10.1186/s12933-015-0274-4.
Chen X., Shao M., Zhang T., Zhang W., Meng Y., Zhang H., Hai H., Li G. Prognostic value of the combination of GRACE risk score and mean platelet volume to lymphocyte count ratio in patients with ST-segment elevation myocardial infarction after percutaneous coronary intervention. Exp Ther Med 2020; 19(6): 3664–3674, https://doi.org/10.3892/etm.2020.8626.
Wenzl F.A., Kraler S., Ambler G., Weston C., Herzog S.A., Räber L., Muller O., Camici G.G., Roffi M., Rickli H., Fox K.A.A., de Belder M., Radovanovic D., Deanfield J., Lüscher T.F. Sex-specific evaluation and redevelopment of the GRACE score in non-ST-segment elevation acute coronary syndromes in populations from the UK and Switzerland: a multinational analysis with external cohort validation. Lancet 2022; 400(10354): 744–756, https://doi.org/10.1016/s0140-6736(22)01483-0.
Li R., Shen L., Ma W., Yan B., Chen W., Zhu J., Li L., Yuan J., Pan C. Use of machine learning models to predict in-hospital mortality in patients with acute coronary syndrome. Clin Cardiol 2023; 46(2): 184–194, https://doi.org/10.1002/clc.23957.
Zack C.J., Senecal C., Kinar Y., Metzger Y., Bar-Sinai Y., Widmer R.J., Lennon R., Singh M., Bell M.R., Lerman A., Gulati R. Leveraging machine learning techniques to forecast patient prognosis after percutaneous coronary intervention. JACC Cardiovasc Interv 2019; 12(14): 1304–1311, https://doi.org/10.1016/j.jcin.2019.02.035.
Du X., Wang H., Wang S., He Y., Zheng J., Zhang H., Hao Z., Chen Y., Xu Z., Lu Z. Machine learning model for predicting risk of in-hospital mortality after surgery in congenital heart disease patients. Rev Cardiovasc Med 2022; 23(11): 376, https://doi.org/10.31083/j.rcm2311376.
Zhao P., Liu C., Zhang C., Hou Y., Zhang X., Zhao J., Sun G., Zhou J. Using machine learning to predict the in-hospital mortality in women with ST-segment elevation myocardial infarction. Rev Cardiovasc Med 2023; 24(5): 126, https://doi.org/10.31083/j.rcm2405126.
MacCallum R.C., Zhang S., Preacher K.J., Rucker D.D. On the practice of dichotomization of quantitative variables. Psychol Meth 2002; 7(1): 19–40, https://doi.org/10.1037/1082-989x.7.1.19.
Gupta R., Day C.N., Tobin W.O., Crowson C.S. Understanding the effect of categorization of a continuous predictor with application to neuro-oncology. Neurooncol Pract 2021; 9(2): 87–90, https://doi.org/10.1093/nop/npab049.
Geltser B.I., Shakhgeldyan K.I., Rublev V.Yu., Domzhalov I.G., Tsivanyuk M.M., Shekunova O.I. Phenotyping of risk factors and prediction of inhospital mortality in patients with coronary artery disease after coronary artery bypass grafting based on explainable artificial intelligence methods. Rossijskij kardiologiceskij zurnal 2023; 28(4): 5302, https://doi.org/10.15829/1560-4071-2023-5302.
Dawson N.V., Weiss R. Dichotomizing continuous variables in statistical analysis: a practice to avoid. Med Decis Making 2012; 32(2): 225–226, https://doi.org/10.1177/0272989x12437605.
Salis Z., Gallego B., Sainsbury A. Researchers in rheumatology should avoid categorization of continuous predictor variables. BMC Med Res Methodol 2023; 23(1): 104, https://doi.org/10.1186/s12874-023-01926-4.
Altman D.G., Royston P. The cost of dichotomising continuous variables. BMJ 2006; 332(7549): 1080, https://doi.org/10.1136/bmj.332.7549.1080.
Austin P.C., Brunner L.J. Inflation of the type I error rate when a continuous confounding variable is categorized in logistic regression analyses. Stat Med 2004; 23(7): 1159–1178, https://doi.org/10.1002/sim.1687.
Streiner D.L. Breaking up is hard to do: the heartbreak of dichotomizing continuous data. Can J Psychiatr 2002; 47(3): 262–266, https://doi.org/10.1177/070674370204700307.
von Elm E., Altman D.G., Egger M., Pocock S.J., Gøtzsche P.C., Vandenbroucke J.P.; STROBE Initiative. The Strengthening the Reporting of Observational Studies Epidemiology (STROBE) statement: guidelines for reporting observational studies. Lancet 2007; 370(9596): 1453–1457, https://doi.org/10.1016/s0140-6736(07)61602-x.
Gel’tser B.I., Shakhgel’dyan K.I., Domzhalov I.G., Kuksin N.S., Kokarev E.A., Pak R.L. Prognosticheskaya otsenka kliniko-funktsional’nogo statusa patsientov s infarktom miokarda s pod,,emom segmenta ST posle chreskozhnogo koronarnogo vmeshatel’stva. Svidetel’stvo o registratsii bazy dannykh 2023622740, 10.08.2023. Zayavka No.2023622516 ot 28.07.2023 [Prognostic assessment of the clinical and functional status of patients with ST-segment elevation myocardial infarction after percutaneous coronary intervention. Database registration certificate 2023622740, October 8, 2023. Application No.2023622516 dated August 28, 2023].
Valente F., Henriques J., Paredes S., Rocha T., de Carvalho P., Morais J. A new approach for interpretability and reliability in clinical risk prediction: acute coronary syndrome scenario. Artif Intell Med 2021; 117: 102113, https://doi.org/10.1016/j.artmed.2021.102113.
Lundberg S.M., Lee S.I. A unified approach to interpreting model predictions. In: Advances in neural information processing systems. arXiv; 2017, https://doi.org/10.48550/arxiv.1705.07874.
Geltser B.I., Shahgeldyan K.I., Domzhalov I.G., Kuksin N.S., Kokarev E.A., Kotelnikov V.N., Rublev V.Yu. Prediction of in-hospital mortality in patients with ST-segment elevation acute myocardial infarction after percutaneous coronary intervention. Rossijskij kardiologiceskij zurnal 2023; 28(6): 5414, https://doi.org/10.15829/1560-4071-2023-5414.
Molnar C. Interpretable machine learning. A guide for making black box models explainable. 2023. URL: https://christophm.github.io/interpretable-ml-book/.
Mabikwa O.V., Greenwood D.C., Baxter P.D., Fleming S.J. Assessing the reporting of categorised quantitative variables in observational epidemiological studies. BMC Health Serv Res 2017; 17(1): 201, https://doi.org/10.1186/s12913-017-2137-z.
Turner E.L., Dobson J.E., Pocock S.J. Categorisation of continuous risk factors in epidemiological publications: a survey of current practice. Epidemiol Perspect Innov 2010; 7: 9, https://doi.org/10.1186/1742-5573-7-9.