Machine Learning Application to Predict Bicycle Ergometer Test Results: a Prospective Cohort Study

Berezina E.V., Blinova K.A., Dmitrieva O.A., Mishina I.E.

Key words: machine learning; bicycle ergometry test; six-minute walk test; cardiac rehabilitation; prediction; gradient boosting.

The aim of the study was to develop an optimal technique to predict the results of a bicycle ergometry test (BET) based on the parameters recorded during a six-minute walk test (6MWT) using machine learning methods.

Materials and Methods. The study involved 56 patients who had experienced acute myocardial infarction and were undergoing the second stage of cardiac rehabilitation. The patients underwent a complete examination, including history taking, physical examination, anthropometric assessment, as well as a symptom-limited BET and 6MWT. During the 6MWT, we recorded the following: the distance covered, the heart rate, the blood pressure, the oxygen saturation, Borg rating of perceived exertion, the number of steps taken, and the electrocardiographic data. The algorithms for random forest, gradient boosting, k-nearest neighbors, and multiple linear regression were used to construct the machine learning models. The performance of the models was evaluated based on a determination coefficient, a mean absolute error, a mean square error, and a root mean square error. SHAP analysis was applied to interpret the findings.

Results. The gradient boosting model provided the best prediction quality with a high determination coefficient (R² being around 0.99) and low error values for both target metrics: the distance walked in the 6MWT and the metabolic equivalent achieved during the BET. The significance analysis of features revealed the heart rate, age, and the body mass index to have the greatest impact on predicting the 6MWT distance, while for predicting the metabolic equivalent, the distance covered, the number of steps, and the body mass index were the most significant.

Conclusion. The developed gradient boosting-based machine learning model demonstrated its high efficiency in predicting the results of the 6MWT-based BET. The suggested method can serve as a valuable auxiliary tool to plan cardiac rehabilitation programs, particularly in cases when BET is difficult or impossible to perform. The use of SHAP analysis helped to understand the contribution of each feature to the prediction, increasing the confidence in the model results.

Introduction

Cardiovascular diseases (CVD) are a serious challenge for healthcare worldwide, being the leading cause of incidence and mortality [1]. According to the World Health Organization, CVD are responsible for 19 million deaths a year, corresponding to nearly 32% of total global mortality [2]. An important element of a complex management of CVD patients is cardiac rehabilitation, providing an integral approach to recovery and secondary prevention. The findings of systematic reviews have indicated the positive effect of cardiac rehabilitation on the survival rate of patients after acute coronary syndrome compared to control groups participating in no rehabilitation programs [3–5]. CVD death rate was found to decrease by 26%, and the frequency rate of readmissions reduced by 18% [6]. The ultimate purpose of cardiac rehabilitation is patient recovery and providing a patient with full reintegration into community without any functional limitations that in total contribute to the improvement of a clinical status, life quality and survival rate prognosis [7].

Different exercise stress tests are used to assess exercise tolerance when appointing trainings in cardiac rehabilitation. A bicycle ergometer (bicycle ergometer test — BET) is used to determine the metabolic equivalent of task (МЕТ) of the taken stress test. However, the method is characterized by relatively high cost and it is time-consuming [8]. Despite the fact BET can be occasionally used as part of cardiac rehabilitation program, a 6-minute walk test (6MWT) is frequently applied for the routine monitoring of patients’ physical capability. 6MWT is an economical, relatively rapid, and safe technique to assess the functional tolerance of exercise in patients with cardiovascular pathologies. Its wide application is due to the possible monitoring of different CVD courses, as well as the efficacy evaluation of therapeutical and rehabilitation procedures [9, 10]. It should be noted that BET and 6MWT are essentially different. BET considers achieving the maximum capacity within 8–12 min by using the protocol with gradually increasing intensity [11], while 6MWT is a test with constant exercise intensity, and traditionally classified as submaximal, making the test safer. In particular, performing a maximum test, such as BET, poses higher risks for patients with severe conditions compared to submaximal exercises (6MWT) [12]. The fact is confirmed by the differences in safety requirements when performing both tests, including the need for qualified staff, a defibrillator, electrocardiographic monitoring, and the first aid supplies. Thus, in clinical practice there is frequently the problem to determine the optimal exercise intensity when carrying out trainings during cardiac rehabilitation, when performing BET is impossible for various reasons.

Over the last years, the interest in artificial intelligence (AI) advances in medical community has grown, that particularly stimulated the introduction of monitoring systems into rehabilitation practice [13]. One of the key areas of AI application is the image processing and categorization that has already been realized in many medical fields, including radiology, oncology, dermatology, and cardiology [14–17]. AI is being increasingly used in robot-assisted surgery, genomics, prognosis of clinical outcomes, and in decision-making processes. The research findings have not infrequently demonstrated AI can achieve the competency proficiency of clinicians and even leave them behind.

Machine learning methods are used for modeling complex interactions between sets of variables [14–17]. Particularly, cardiac rehabilitation efficiency requires the analysis of clinical, psychological, and anthropometric parameters, as well as the characteristics related to risk factors and cardioprotective habits. AI technologies are able to provide the integration of data obtained through different sources, including sensors to record physiological parameters such as blood pressure, heart rate (HR), electrocardiographic data on exertion, oxygen saturation (SpO₂) in blood, and others. Thus, there is the potential opportunity of BET data prognosis based on different parameters obtained from remote biosensors during a 6MWT.

Despite the growing interest in AI in medicine, there are just several studies in rehabilitation. Near twenty years ago, Zhu et al. [18] in their comparative study involving over 20,000 patients with home care demonstrated that even a simple algorithm of k-nearest neighbors can predict more accurately the rehabilitation potential than standard clinical evaluation reports. In their further research, Zhu et al. [19] stated the support vector machine and random forests to far exceed traditional data analysis techniques applied in rehabilitation.

Lin et al. [20] used machine learning to predict poststroke rehabilitation results. The data analysis of nearly 300 patients using logistic regression, support vector machine, and random forest showed regression algorithms enable to assess Barthel index value with a mean absolute error being around 10, while classifiers provide over 70% accuracy in dividing patients into three categories according to the activity level.

The aim of the study was to develop an optimal technique to predict the results of a BET based on the parameters recorded during a 6MWT using machine learning methods.The present study aimed at developing an effective accessory tool for planning cardiac rehabilitation programs.

Materials and Methods

The present study was prospective, cohort; and analyzed the data of 56 patients with acute myocardial infarction: 45 male (79.4%) and 11 female (20.6%) patients. The study was carried out in Cardiological Dispensary (Russia), where the inpatients had the second stage of cardiac rehabilitation.

All patients underwent a complex examination including: history- and complaint-taking, physical examination, as well as anthropometric assessment (height, body mass, body mass index — BMI). Within 24 h prior to the physical rehabilitation started, all patients had symptom-limited BET and 6MWT. The reason for BET termination was a patient’s refusal of further test performance due to subjective exercise intolerance (expressed fatigue, the pain syndrome in the lower limbs, breathlessness, etc.). The condition was interpreted as achieving the exercise intolerance limit. 6MWT was carried out under US guidance using Accordix — the telemedicine system (Neurosoft, Ivanovo). Accordix is designed for remote monitoring and recording physiological parameters, particularly, ECG at rest and on exertion. The system capabilities include the computer-aided analysis of the recorded data in a real-time mode, including a 6MWT. At an initial stage the following characteristics were assessed: the distance covered by a patient during a 6MWT; the achieved power capacity when performing BET (in MET); the presence of dynamic changes on ECG when performing both tests. Prior to the exercise tests, there were measured HR, arterial blood pressure and SpO₂. When performing a 6MWT, in addition to the distance covered, we evaluated Borg rating of received exertion (6–20 score scale) and the number of steps taken.

To achieve the target goals, there were developed several machine learning models: random forest regression (RFR), gradient boosting, k-nearest neighbors (kNN) and multiple linear regression (MLR). The machine learning models were developed, and the findings were visually analyzed in programing language Python using Google Colab.

The presence of highly correlating characteristics is known to result in the model performance reduction due to high dispersion and less interpretability [21]. For this reason, the correlation coefficient threshold for the considered features was set as 0.7. In case the threshold was higher, one of the correlating features was excluded. Resulting from plotting the correlation matrix of the initial sampling features, there were found two characteristics, which were excluded from the further analysis: chronic heart failure grade and coronary artery disease grade (Figure 1).

Figure 1. Correlation matrix of the patients’ characteristics under study

Age — mean patient age (years); BMI — body mass index; HR — heart rate, prior to a six-minute walk test (6MWT) (bpm); SBP — systolic blood pressure, prior to 6MWT (mm Hg); DBP — diastolic blood pressure, prior to 6MWT (mm Hg); SpO₂ — oxygen saturation (%); Borg scale — Borg rating of perceived exertion (scores); distance — the distance covered (m); CHF grade — chronic heart failure grade; CAD grade — coronary artery disease grade; number of steps — the number of steps taken

Results

Two developed models (random forest regression and gradient boosting) are characterized by a sufficiently high determination coefficient. In the models of k-nearest neighbors and multiple linear regression, the coefficient values were less than 0.5 in case of 6MWT value predicted. It means that the share of variational factorial features is the minor part of dispersion compared with the other factors unaccounted in the model and having an effect on the predicted variable change. Regression models developed under such conditions are of low practical importance. The gradient boosting model showed the best results by all characteristics, demonstrating the highest correction values and the lowest error rate (see the Table).

Parameters of the predicted models

For more detailed analysis of the factors influencing the predicted values in the gradient boosting model, there was developed a column chart of the features sorted out by a mean SHAP value (Figure 2 (а)). It is seen that the absolute significance of some features was rather high. So, to predict 6MWT value, the most significant was HR value with SHAP value — 4.53; the less contribution to prediction was made by the distance covered (SHAP value — 2.63) and the patient age (SHAP value — 2.28). BMI had SHAP value — 1.47. In case of MET prognosis (Figure 2 (b)) the most contribution was made by the distance covered with SHAP value — 0.46; then downwards: BMI (SHAP value — 0.26) and the number of steps taken (SHAP value — 0.24). HR, age, systolic and diastolic blood pressure had the minimum effect on MET value prognosis.

Figure 2. Mean absolute SHAP value in the gradient boosting model for a 6-minute walk test (6MWT) (a) and the metabolic equivalent (b)

HR — heart rate, prior to 6MWT (bpm); distance — the distance covered (m);age — mean patient age (years); BMI — body mass index; DBP — diastolic blood pressure, prior to 6MWT (mm Hg); SpO₂ — oxygen saturation (%); number of steps — the number of steps taken; SBP — systolic blood pressure, prior to 6MWT (mm Hg)

Discussion

In contrast to the works by De Cannière et al. [22], Desai et al. [23], and Alshurafa et al. [24], who used the support vector machine, random forest, decision trees or neural networks to monitor patients, we used gradient boosting, which is a relatively new approach in this field. De Cannière et al. [22] in the support vector model achieved the mean absolute error — 42.8±36.8 in performance capability prognosis evaluated by 6MWT distance, while the mentioned gradient boosting model demonstrated significantly lower error values for both target metrics (6MWT and MET).

SHAP analysis used to interpret the gradient boosting model findings enabled to determine the most significant variables in BET outcome prognosis. The analysis also revealed the differences in the significance of features to predict 6MWT distance and MET.

The most significant for 6MWT distance prognosis appeared to be HR prior to the test, the patient age, and BMI. The higher BMI is frequently associated with the physical form deterioration, the decreased aerobic capacity and an increased load on the locomotor apparatus that can restrict the covered distance [25]. Initial HR reflects the autonomic nervous tone and the functional condition of the cardiovascular system having an impact on physical efficiency [26]. With age, the body functional reserve decreases, it also has a negative effect on the distance covered [27].

In its turn, the number of steps and BMI appeared to be the most important for MET prognosis. The number of taken steps is directly related to the physical activity level and energy consumption. BMI, similar to 6MWT distance, can restrict the achievement of higher MET levels. It is worth noting that such parameters as HR, systolic and diastolic blood pressure demonstrated less significance. It can be related to their higher variability and the dependence on a number of factors unaccounted in the model.

The clinical significance of the developed gradient boosting model is in its potentiality to optimize cardiac rehabilitation programs. In cases when BET is difficult or impossible to perform (e.g., in severe patients or in case there is no equipment needed), our model can serve as a valuable accessory tool to assess the functional performance and plan individual training programs. It will allow doctors to make more appropriate decisions on physical load intensity, exercise duration and other cardiac rehabilitation aspects that consequently can result in improving the therapy results and patients’ safety. The model can also be used to monitor patients’ progress in cardiac rehabilitation and the early correction of a training program. It can contribute to the improvement of life quality and survival rate prognosis in CVD patients.

The chief limitation of the present study was the relatively small sampling volume that can potentially restrict the generalizability of findings.

Conclusion

The present prospective cohort study demonstrated the efficiency of using current machine learning methods to predict BET results based on the data obtained by a 6MWT. The application of random forest, gradient boosting, k-nearest neighbors, and multiple linear regression enabled to identify the most informative features, which have an impact on the predicted 6MWT and MET values.

The gradient boosting model was stated to provide the best prognosis quality characteristics with a high determination coefficient (R²=about 0.99) and low error values (МАЕ, MSE, RMSE) for both target metrics. It confirms the approach efficiency to predict the results of BET based on 6MWT findings with no need for costly and resource-intensive tests. The method is particularly relevant for severe patients, who can hardly undergo maximum load tests, which can be risk-bearing.

Study funding. The present study was carried out as a part of the state task for Ivanovo State Medical University “Remote cardiac monitoring in medical rehabilitation” (SLFW-2024-0022), INIS RTD No.124031100012-3.

Conflict of interest. The authors declare no conflict of interest.

References

Gaidai O., Cao Y., Loginov S. Global cardiovascular diseases death rate prediction. Curr Probl Cardiol 2023; 48(5): 101622, https://doi.org/10.1016/j.cpcardiol.2023.101622.
World Health Organization. Cardiovascular diseases (CVDs). 31 July 2025. URL: https://www.who.int/en/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds).
Shokri K., Karimian A., Radfar A., Mohammadi A., Amerizadeh A., Karimi R., Sadeghi M. Effect of exercise-based cardiac rehabilitation in patients with acute coronary syndrome: a systematic review and meta-analysis. BMC Sports Sci Med Rehabil 2025; 17(1): 233, https://doi.org/10.1186/s13102-025-01270-8.
Meng Y., Zhuge W., Huang H., Zhang T., Ge X. The effects of early exercise on cardiac rehabilitation-related outcome in acute heart failure patients: a systematic review and meta-analysis. Int J Nurs Stud 2022; 130: 104237, https://doi.org/10.1016/j.ijnurstu.2022.104237.
Mahmood A., Ray R., Bin Salam S.S.T., Haque F., Akkaldevi J., Masmoum M.D., Hassan M.S., Essani B., Anjum T., Mirza M.S.S. The effectiveness of cardiac rehabilitation programs in improving cardiovascular outcomes: systematic review and meta-analysis. Cureus 2024; 16(10): e72450, https://doi.org/10.7759/cureus.72450.
Moghei M., Turk-Adawi K., Isaranuwatchai W., Sarrafzadegan N., Oh P., Chessex C., Grace S.L. Cardiac rehabilitation costs. Int J Cardiol 2017; 244: 322–328, https://doi.org/10.1016/j.ijcard.2017.06.030.
Thomas R.J., Beatty A.L., Beckie T.M., Brewer L.C., Brown T.M., Forman D.E., Franklin B.A., Keteyian S.J., Kitzman D.W., Regensteiner J.G., Sanderson B.K., Whooley M.A. Home-based cardiac rehabilitation: a scientific statement from the american association of cardiovascular and pulmonary rehabilitation, the American Heart Association, and the American College of Cardiology. Circulation 2019; 140(1): e69–e89, https://doi.org/10.1161/CIR.0000000000000663.
Kardioreabilitatsiya i vtorichnaya profilaktika. Pod red. Aronova D.M. [Cardiac rehabilitation and secondary prevention. Aronov D.M. (editor)]. Moscow: GEOTAR-Media; 2021, https://doi.org/10.33029/9704-6218-8-car-2021-1-464.
Coulshed A., Coulshed D., Pathan F. Systematic review of the use of the 6-minute walk test in measuring and improving prognosis in patients with ischemic heart disease. CJC Open 2023; 5(11): 816–825, https://doi.org/10.1016/j.cjco.2023.08.003.
Mikhailovskaya T.V., Nazarova O.A., Dovgalyuk Yu.V., Chistyakova Yu.V., Mishina I.E. Methodological issues of assessment of sixminute walk test in patients with coronary artery disease. Bulletin of Rehabilitation Medicine 2021; 20(3): 37–44, https://doi.org/10.38025/2078-1962-2021-20-3-37-44.
Mapelli M., Salvioni E., Paneroni M., Gugliandolo P., Bonomi A., Scalvini S., Raimondo R., Sciomer S., Mattavelli I., La Rovere M.T., Agostoni P. Brisk walking can be a maximal effort in heart failure patients: a comparison of cardiopulmonary exercise and 6 min walking test cardiorespiratory data. ESC Heart Fail 2022; 9(2): 812–821, https://doi.org/10.1002/ehf2.13781.
Cavero-Redondo I., Saz-Lara A., Bizzozero-Peroni B., Núñez-Martínez L., Díaz-Goñi V., Calero-Paniagua I., Matínez-García I., Pascual-Morena C. Accuracy of the 6-minute walk test for assessing functional capacity in patients with heart failure with preserved ejection fraction and other chronic cardiac pathologies: results of the ExIC-FEp trial and a meta-analysis. Sports Med Open 2024; 10(1): 74, https://doi.org/10.1186/s40798-024-00740-6.
Rasa A.R. Artificial intelligence and its revolutionary role in physical and mental rehabilitation: a review of recent advancements. Biomed Res Int 2024; 2024: 9554590, https://doi.org/10.1155/bmri/9554590.
Norgeot B., Glicksberg B.S., Butte A.J. A call for deep-learning healthcare. Nat Med 2019; 25(1): 14–15, https://doi.org/10.1038/s41591-018-0320-3.
Esteva A., Robicquet A., Ramsundar B., Kuleshov V., DePristo M., Chou K., Cui C., Corrado G., Thrun S., Dean J. A guide to deep learning in healthcare. Nat Med 2019; 25(1): 24–29, https://doi.org/10.1038/s41591-018-0316-z.
Chilamkurthy S., Ghosh R., Tanamala S., Biviji M., Campeau N.G., Venugopal V.K., Mahajan V., Rao P., Warier P. Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study. Lancet 2018; 392(10162): 2388–2396, https://doi.org/10.1016/S0140-6736(18)31645-3.
Ehteshami Bejnordi B., Veta M., Johannes van Diest P., van Ginneken B., Karssemeijer N., Litjens G., van der Laak JAWM; the CAMELYON16 Consortium; Hermsen M., Manson Q.F., Balkenhol M., Geessink O., Stathonikos N., van Dijk M.C., Bult P., Beca F., Beck A.H., Wang D., Khosla A., Gargeya R., Irshad H., Zhong A., Dou Q., Li Q., Chen H., Lin H.J., Heng P.A., Haß C., Bruni E., Wong Q., Halici U., Öner M.Ü., Cetin-Atalay R., Berseth M., Khvatkov V., Vylegzhanin A., Kraus O., Shaban M., Rajpoot N., Awan R., Sirinukunwattana K., Qaiser T., Tsang Y.W., Tellez D., Annuscheit J., Hufnagl P., Valkonen M., Kartasalo K., Latonen L., Ruusuvuori P., Liimatainen K., Albarqouni S., Mungal B., George A., Demirci S., Navab N., Watanabe S., Seno S., Takenaka Y., Matsuda H., Ahmady Phoulady H., Kovalev V., Kalinovsky A., Liauchuk V., Bueno G., Fernandez-Carrobles M.M., Serrano I., Deniz O., Racoceanu D., Venâncio R. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA 2017; 318(22): 2199–2210, https://doi.org/10.1001/jama.2017.14585.
Zhu M., Chen W., Hirdes J.P., Stolee P. The k-nearest neighbor algorithm predicted rehabilitation potential better than current Clinical Assessment Protocol. J Clin Epidemiol 2007; 60(10): 1015–1021, https://doi.org/10.1016/j.jclinepi.2007.06.001.
Zhu M., Cheng L., Armstrong J.J., Poss J.W., Hirdes J.P., Stolee P. Using machine learning to plan rehabilitation for home care clients: beyond "black-box" predictions. In: Dua S., Acharya U., Dua P. (editors). Machine learning in healthcare informatics. Intelligent systems reference library, vol 56. Springer, Berlin, Heidelberg; 2014, https://doi.org/10.1007/978-3-642-40017-9_9.
Lin W.Y., Chen C.H., Tseng Y.J., Tsai Y.T., Chang C.Y., Wang H.Y., Chen C.K. Predicting post-stroke activities of daily living through a machine learning-based approach on initiating rehabilitation. Int J Med Inform 2018; 111: 159–164, https://doi.org/10.1016/j.ijmedinf.2018.01.002.
Sharma A., Lysenko A., Jia S., Boroevich K.A., Tsunoda T. Advances in AI and machine learning for predictive medicine. J Hum Genet 2024; 69(10): 487–497, https://doi.org/10.1038/s10038-024-01231-y.
De Cannière H., Corradi F., Smeets CJP, Schoutteten M., Varon C., Van Hoof C., Van Huffel S., Groenendaal W., Vandervoort P. Wearable monitoring and interpretable machine learning can objectively track progression in patients during cardiac rehabilitation. Sensors (Basel) 2020; 20(12): 3601, https://doi.org/10.3390/s20123601.
Desai F., Chowdhury D., Kaur R., Peeters M., Arya R.C., Wander G.S., Gill S.S., Buyya R. HealthCloud: a system for monitoring health status of heart patients using machine learning and cloud computing. Internet of Things 2022; 17: 10048, https://doi.org/10.1016/j.iot.2021.100485.
Alshurafa N., Sideris C., Pourhomayoun M., Kalantarian H., Sarrafzadeh M., Eastwood J.A. Remote health monitoring outcome success prediction using baseline and first month intervention data. IEEE J Biomed Health Inform 2017; 21(2): 507–514, https://doi.org/10.1109/JBHI.2016.2518673.
den Uijl I., van den Berg-Emons R.J.G., Sunamura M., Lenzen M.J., Stam H.J., Boersma E., Tenbült-van Limpt N.C.C.W., Kemps H.M.C., Geleijnse M.L., Ter Hoeve N. Effects of a dedicated cardiac rehabilitation program for patients with obesity on body weight, physical activity, sedentary behavior, and physical fitness: the OPTICARE XL randomized controlled trial. Phys Ther 2023; 103(9): pzad055, https://doi.org/10.1093/ptj/pzad055.
O'Connor F.K., Chen D., Sharma P., Adsett J., Hwang R., Roberts L., Bach A., Louis M., Morris N. Physiological responses to sit-to-stand and six-minute walk tests in heart failure: a randomised trial. Heart Lung Circ 2025; 34(8): 789–797, https://doi.org/10.1016/j.hlc.2025.03.002.
Büsching G., Schmid J.P. 6-minute walk test: exploring factors influencing perceived intensity in older patients undergoing cardiac rehabilitation-a qualitative study. Healthcare (Basel) 2025; 13(7): 735, https://doi.org/10.3390/healthcare13070735.