Is It Possible to Predict the Length of Stay of Patients Undergoing Hip-Replacement Surgery?
Next Article in Journal
Gaps to Best Practices for Teleconsultations Performed by General Practitioners: A Descriptive Cross-Sectional Study
Previous Article in Journal
Determinants of Sleep Disorders and Occupational Burnout among Nurses: A Cross-Sectional Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Is It Possible to Predict the Length of Stay of Patients Undergoing Hip-Replacement Surgery?

by
Teresa Angela Trunfio
1,*,
Anna Borrelli
2 and
Giovanni Improta
3,4
1
Department of Advanced Biomedical Sciences, University of Naples “Federico II”, 80131 Naples, Italy
2
“San Giovanni di Dio e Ruggi d’Aragona” University Hospital, 84121 Salerno, Italy
3
Department of Public Health, University of Naples “Federico II”, 80131 Naples, Italy
4
Interdepartmental Center for Research in Healthcare Management and Innovation in Healthcare (CIRMIS), University of Naples “Federico II”, 80131 Naples, Italy
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(10), 6219; https://doi.org/10.3390/ijerph19106219
Submission received: 28 March 2022 / Revised: 16 May 2022 / Accepted: 17 May 2022 / Published: 20 May 2022

Abstract

:
The proximal fracture of the femur and hip is the most common reason for hospitalization in orthopedic departments. In Italy, 115,989 hip-replacement surgeries were performed in 2019, showing the economic relevance of studying this type of procedure. This study analyzed the data relating to patients who underwent hip-replacement surgery in the years 2010–2020 at the “San Giovanni di Dio e Ruggi d’Aragona” University Hospital of Salerno. The multiple linear regression (MLR) model and regression and classification algorithms were implemented in order to predict the total length of stay (LOS). Lastly, using a statistical analysis, the impact of COVID-19 was evaluated. The results obtained from the regression analysis showed that the best model was MLR, with an R2 value of 0.616, compared with XGBoost, Gradient-Boosted Tree, and Random Forest, with R2 values of 0.552, 0.543, and 0.448, respectively. The t-test showed that the variables that most influenced the LOS, with the exception of pre-operative LOS, were gender, age, anemia, fracture/dislocation, and urinary disorders. Among the classification algorithms, the best result was obtained with Random Forest, with a sensitivity of the longest LOS of over 89%. In terms of the overall accuracy, Random Forest and Gradient-Boosted Tree achieved a value of 71.76% and an error of 28.24%, followed by Decision Tree, with an accuracy of 71.13% and an error of 28.87%, and, finally, Support Vector Machine, with an accuracy of 65.06% and an error of 34.94%. A significant difference in cardiovascular disease, fracture/dislocation, and post-operative LOS variables was shown by the chi-squared test and Mann–Whitney test in the comparison between 2019 (before COVID-19) and 2020 (in full pandemic emergency conditions).

1. Introduction

The proximal fracture of the femur and hip is the most common reason for hospitalization in orthopedic departments. Hip fractures put patients at risk of cardiovascular, pulmonary, thrombotic, infectious, and bleeding complications that can lead to death [1]. The only strategy to prevent immediate negative outcomes is to proceed in a timely manner with surgery. Despite the procedure, however, patients experience increased mortality, health complications, and reduced quality of life [2,3,4].
Although hip fractures account for less than 20% of all osteoporosis-associated fractures [5], considered second only to cardiovascular disease by the World Health Organization [6], they are often used as an indicator of the health of the population and to evaluate the economic impact of this condition. In fact, they account for the majority of morbidity-related and mortality-related health expenditure in men and women over the age of 50 [7,8]. Specifically, globally, 1.3 million fractures were reported in the year 1990, and this figure is estimated to reach 7–21 million by 2050, with an associated expenditure that will reach 9.8 billion USD in the United States and 650 million CAD in Canada [9,10]. These data are associated with the demographic trend, in recent years, of increasing life expectancy, which has changed the age profile of the population. For example, in Italy, the reference country for this study, an increase in life expectancy has been observed in recent years, reaching 79.7 years for men and 84.4 for women [11], with a consequent increase in chronic and degenerative diseases. In the country, about half of the population over 65 has degenerative pathologies of an arthritic nature, with a high impact on motor ability, thus making prosthetic interventions in the orthopedic field among the most frequently performed. In particular, an increase in hip-replacement surgeries (whose elective share amounts to about 2/3) the last five years was recorded, from 104,425 in 2015 to 115,989 in 2019 (+11.1%). In 2020, due to the COVID-19 pandemic, with containment measures such as lockdown and the blocking of elective surgery, there was a marked decrease in the number of cases (N = 96,822), which was quantifiable as 19,167 fewer hospitalizations (−16.5%) compared to the previous year, and a reduction compared to the previous trend figure, which reached 18% (a value corresponding to approximately 21 thousand fewer hospitalizations than expected) [12].
A health care process that involves an increasing number of patients and is transversal, especially when considering patients who are admitted for traumatic fractures [9], must involve effectiveness and efficiency controls to improve not only patient outcomes, but also to ensure the proper use of resources.
A widely used indicator in the literature is the length of stay (LOS). The LOS is an important performance indicator of hospital costs and management. An unnecessary increase in LOS, in addition to affecting resources, exposes patients to nosocomial infections and functional decline [13].
With this in mind, the following work intends to investigate the LOS of patients who underwent hip arthroplasty in the years 2010–2020 at “San Giovanni di Dio e Ruggi d’Aragona” University Hospital of Salerno (Italy). This study was born as an extension of a previous work [14], in which we analyzed a limited number of patients, included in this study, and a limited number of variables. The aim is to build a valid predictive model capable of determining the duration of bed occupancy, based on patients’ clinical and demographic variables, and understanding which are the main factors that influence the total LOS. Finally, the impact of COVID-19 on patients undergoing this procedure is analyzed.

Related Works

Several studies use advanced data processing in order to support doctors in the prevention, diagnosis, and treatment of diseases [15,16,17,18,19,20,21] or the management of hospital resources [22,23,24,25,26]. In the orthopedic field, many articles study the performance associated with the flow of patients who are admitted for fractures of the lower limbs. For example, Lefaivre et al. determined the effect of delayed surgery on discharge times, in-hospital death, the presence of major and minor medical complications, and the incidence of sores in hip fracture patients. Bracy et al. [27], on the other hand, showed how the institution of orthopedic–hospitalist comanagement (OHC) improves the efficiency of hip-fracture management, as measured by inpatient LOS and time to surgery [28]. Fisher et al. have shown how early mobilization helps reduce the total LOS [29]. With the aim of reducing the total LOS, Fast Tracks were born, a combination of clinical and organizational factors optimized to reduce convalescence and perioperative morbidity, including functional recovery with a consequent reduction in hospitalizations. Husted et al. highlighted the benefits of orthopedic Fast Track in Denmark [30].
Furthermore, in Italy, several studies were conducted to investigate the epidemiology of the problem [31,32] and the choice of prostheses [33], and to improve the process. Scala et al. analyzed how with a Lean Six Sigma approach, a reduction in the total LOS of 39% is achieved for patients admitted with fractures of the femur [34]. Latessa et al. instead used the same methodology to implement Fast Track, with a statistically significant reduction of 12.7% in the LOS [35]. Although there are studies at national and international level that use predictive algorithms for the study of the total LOS [36,37,38,39], there are no other studies in the literature that analyze hip fractures in a large number of patients, including multiple clinical variables and the impact of COVID-19. The hypothesis of this paper is that particular clinical conditions or patient demographics may have a significant impact on LOS and on which healthcare management needs to focus more, to achieve benefits including cost containment considerations. In addition, the COVID-19 pandemic, with all the protocols put in place, may have further affected the process under consideration.

2. Materials and Methods

This study analyzed the data relating to patients who underwent hip-replacement surgery in the years 2010–2020 at the “San Giovanni di Dio e Ruggi d’Aragona” University Hospital of Salerno (Italy). Specifically, all patients who had hip surgery as their primary procedure were selected, with the following ICD-9 codes:
  • 8151: total hip replacement,
  • 8152: partial hip replacement,
  • 8153: revision hip replacement
Using the hospital discharge forms, the following information was extracted for the 2515 patients included in the study:
  • Age,
  • Gender (Male/Female),
  • Date of admission, discharge, and principal procedure,
  • Main and secondary diagnoses,
Starting from this information, the following independent variables were obtained:
  • Gender,
  • Age,
  • Pre-Operative LOS,
  • Diabetes (yes/no),
  • Hypertension (yes/no),
  • Obesity (yes/no),
  • Anemia (yes/no),
  • Vitamin D deficiency (yes/no),
  • Tumor (yes/no),
  • Fracture/Dislocation (yes/no),
  • Brain disorders (yes/no),
  • Urinary disorders (yes/no),
  • Cardiovascular disease (yes/no),
  • Respiratory disease (yes/no),
  • Anticoagulant therapy (yes/no).
Our data, provided by the Hospital’s Health Department, are completely anonymous, and no personal information is linked or linkable to a specific person. The output is the total LOS in days obtained as the difference between the date of discharge and date of admission. All clinical variables were obtained by analyzing the main and secondary diagnoses reported in the discharge form. Therefore, without a detailed characterization of the clinical picture of each patient, the variables simply indicate the presence (1 Yes) or absence (0 No) of conditions related to that comorbidity. The variable Fracture/Luxation makes it possible to differentiate the proportion of elderly patients who underwent elective surgery from those who suffered a traumatic event.
Figure 1 shows the distribution of all the variables in the dataset.

2.1. Regression and Classification Models

The 15 variables defined above (i.e., gender, age, pre-operative LOS, diabetes, hypertension, obesity, anemia, vitamin D deficiency, tumor, fracture/dislocation, brain disorders, urinary disorders, cardiovascular disease, respiratory disease, and anticoagulant therapy) were used as inputs for the study of total LOS, i.e., the output. The first processing involved the implementation of the MLR model. To this end, IBM SPSS Statistics Version 26.0 software (IBM Corp., Armonk, NY, USA) was used. This software was also used to verify all the preliminary hypotheses on residuals, autocorrelation, the presence of outliers, and the multicollinearity. After this first processing, further regressive algorithms were used, i.e., Random Forest RF, Gradient-Boosted Tree GBT, XGBoost, and Linear Regression LR. RF is a supervised-learning algorithm in which multiple learning algorithms are combined to improve performance. Although it can produce an overfitting, the resulting model is accurate and powerful. GBT is a non-parametric statistical learning algorithm used for both classification and regression problems. As RF, the decision model produced is a set of simple forecasting models, typically decision trees, which are progressively added to each step to improve the result obtained by the previous Weak Learner. The Decision Tree (DT) is a tree-like decision model where the target value is predicted by simple decision rules identified from the data. DTs are simple to understand and require little data preparation, but its disadvantages include overfitting and the creation of biased trees if some classes dominate. XGBoost algorithm is a gradient-boosting algorithm, built through the progressive addition of decision trees in order to improve the performance of the previous tree. In addition, models are fitted using any arbitrary differentiable loss function and gradient descent optimization algorithm. This gives the technique its name, “gradient boosting”, since the loss gradient is minimized as the model is fit, in a similar manner to a neural network. LR is a model that assumes a linear relationship between output and input. Different techniques can be used to prepare or train the linear regression equation from data, the most common of which is called Ordinary Least Squares. Learning, in this case, means estimating the value to be attributed to the coefficients, starting from the available data. Next, the classification algorithms, i.e., Random Forest (RF), Decision Tree (DT), Gradient-Boosted Tree (GBT), and Support Vector Machine (SVM) were implemented. SVM algorithm finds a hyperplane in an N-dimensional space (N—the number of features) that has a maximum margin, i.e., the maximum distance between data points of both classes. To this end, a loss function is used. SVM is effective in high dimensional spaces but it does not directly provide probability estimates. The other algorithms are defined above. This second part was developed with Knime Analytics Platform. For all algorithms, the dataset was broken down into training set and test set, at 80% and 20%, respectively.

2.2. Statistical Analysis

To analyze the impact of COVID-19 on the sample under examination, two sub-groups were extracted:
  • Group 1: Patients discharged in 2019 and, therefore, before COVID-19.
  • Group 2: Patients discharged in 2020 in full pandemic.
Statistical tests were implemented to identify any differences in the two groups. Before proceeding with the selection of the statistical tests, the Kolmogorov–Smirnov test was performed which showed the non-normality of the two distributions. For this reason, the Mann–Whitney U (MW) and chi-squared test with a 95% confidence interval were used.

3. Results

Preliminary to the elaboration, the hypotheses underlying the implementation of the MLR model were verified. The Durbin–Watson test had an output of 1.934. The test always has a value ranging between 0 and 4. A value of 2.0 indicated that there was no autocorrelation detected in the sample. Continuing with the analysis of the residuals, from the graph showing “standardized expected value regression” on the x-axis against “standardized residual regression”, shown in Figure 2, a random distribution around zero was observed, which supported the hypothesis of homoscedasticity. The residuals therefore had a constant variance.
Concluding the residual analysis, the Quartile–Quartile plot (Q–Q plot) presented in Figure 3 was used to evaluate the distribution trend. If the two sets came from a population with the same distribution, the points were expected to fall approximately along this reference line. The greater the departure from this reference line, the greater the evidence for the conclusion that the two data sets came from populations with different distributions.
Although the curve did not exactly retrace the ideal line, the slight variation did not affect the good performance of the model.
Before implementing the model, the absence of multicollinearity was tested using the Pearson correlation and the tolerance and variance inflation factor (VIF), while the presence of outliers was determined through the calculation of Cook’s distance. Table 1 shows the results of the Pearson correlation.
The results of the Pearson correlation showed that the LOS had the highest correlation with the pre-operative LOS, included by definition in LOS, while for the other variables, the correlation was always lower than 0.7.
For the tolerance and VIF, the former always assumed a value greater than 0.2, while the latter was always less than 10, suggesting the absence of multicollinearity. Lastly, Cook’s distance was always less than 1.
Having verified the hypotheses, the MLR model was implemented.
Table 2 shows an R2 value just above the 0.5 threshold, showing that it was quite representative of the specific case study. Table 3 shows the details of the coefficients and the t-test applied to the variables with a significance of 95%.
The results of the t-test highlighted that gender, age, pre-operative LOS, anemia, fracture/dislocation, and urinary disorders were significantly correlated with the total LOS. Standardized coefficients help to compare the effect of each individual independent variable to the dependent variable. In this case, assuming the value 0 when comorbidities were absent, a patient with anemia conditioned the dependent variable more by having the highest beta coefficient associated with it, if the pre-operative LOS was excluded. In addition, according to the beta column, women (gender: 1 male/2 female) with advanced age, as this was a continuous variable, significantly influenced the dependent variable of the model.
In addition to the MLR model, further regression algorithms were tested. Table 4 shows the results obtained in terms of R2 and root mean squared error.
Among the algorithms, XGBoost and LR had the best performance, with an R2 value of 0.552, followed by GBT, with 0.543, and, finally, RF, with 0.448. However, even the best value of R2, obtained with XGBoost/LR, did not improve the performance of the MLR model. The results obtained with the best algorithms used are shown in graphic form in Figure 4 and Figure 5.
After the regression models, four different classification algorithms were tested. For implementation, the LOS was divided into three categories, as indicated below:
  • LOS ≤ 6 days.
  • 6 days < LOS ≤ 12 days.
  • LOS > 12 days.
Table 5 shows the results obtained.
With an accuracy of 71.76% and an error of 28.24%, RF and GBT had the best performance, followed by DT, with an accuracy of 71.13% and an error of 28.87%, and, finally, SVM, with an accuracy of 65.06% and an error of 34.94%. For all the algorithms, optimal results were not achieved in all three categories. The results, however, showed a high ability to predict longer LOS, which weigh heavily on healthcare costs. The details of the classification for the best algorithm are shown in Table 6.
To analyze the global feature importance, a Global Surrogate Random Forest was used. Global Surrogate Random Forest is a Random Forest model trained to approximate the predictions of already implemented RF models. Random Forest is trained on standard pre-processed input data with optimized parameters “tree depth”, “number of models,” and “minimum child node size”. The surrogate model was trained successfully. Specifically, focusing on class 3, that is, the one to which the longest stay corresponded, which was the one that was of greatest relevance to health management, the model returned an accuracy of 0.942, and the overall significance characteristic shown in Figure 6.
Among the variables that most affected the model from class 3, in accordance with the specific procedure analyzed, excluding the pre-operative LOS, were age, fracture/dislocation and vitamin D deficiency. Gender, anemia, and urinary disorders, which in the MLR model were significantly related to total hospitalization, in this case, had a non-significant impact and were included in the variable, other.
Lastly, the impact of COVID-19 on the model parameters was analyzed. Specifically, the pre-COVID-19 (year 2019) and during-COVID-19 (year 2020) data were compared using statistical analysis. The results are reported in Table 7.
The statistical tests highlighted a significant difference in cardiovascular disease, fracture/dislocation, and post-operative LOS.

4. Discussion

In this study, a set of variables was analyzed in order to be able to predict the LOS for hip-replacement surgery. The analysis was conducted at “San Giovanni di Dio e Ruggi d’Aragona” University Hospital of Salerno (Italy), analyzing the data recorded from 2010 to 2020.

4.1. Results of Regression and Classification Models

This work is an extension of a previous work, published by the same research group, in which MLR and ML algorithms were used to investigate the LOS only for the years 2019–2020 [14]. Using this previous article as a reference, the same tools were used in this study. The results obtained for the regression models showed that the best was MLR, with an R2 value of 0.616, which was slightly lower than the previous result, of 0.687. The model was therefore quite representative of the case study in which it was implemented. The statistical test instead showed that the variables that most influence the model, with the exception of the pre-operative LOS, which by, definition depends on it, were gender, age, anemia, fracture/dislocation, and urinary disorders. This result was in line with those previously reported in the literature. For example, Ricci et al. [40] and Latessa et al. [35] highlighted a different LOS according to gender, while Scala et al. [34] showed an influence of cardiovascular diseases. Husted et al. [41], on the other hand, showed that age, sex, comorbidity, and pre- and post-operative hemoglobin levels influence post-operative outcomes in general, including LOS and patient satisfaction, while Calgue et al. [42] showed that significant effects are also due to the type of fracture.
The classification models did not show significant results for the three categories envisaged by the work. With an accuracy of 71.76% and an error of 28.24%, RF and GBT had the best performance, which did not reach the accuracy of over 83% obtained by GBT in [14]. Although the model as a whole could not be validated, the confusion matrix showed the high capacity of the model in predicting cases with LOS greater than 14 days. This is strategically important for healthcare facilities, as these are the cases that have the greatest impact on resource consumption and healthcare costs.

4.2. COVID-19’s Impact

The impact of the SARS-CoV-2 pandemic on the sample was analyzed. Comparing the same variables for the year 2019 (pre-COVID-19) and the year 2020 (during COVID-19), the statistical tests highlighted a significant difference in terms of cardiovascular disease, fracture/dislocation, and post-operative LOS. In particular, there was an increase in the number of patients undergoing surgery with cardiovascular comorbidities or a diagnosis of fracture/dislocation. This, unlike the results reported in the literature [35,41,42], did not cause an increase in postoperative LOS, which actually decreased. This phenomenon can be explained both by the protocols put in place to contain the pandemic and limit the time spent in hospital and by the reduced number of beds, which were mostly dedicated to COVID patients.

4.3. Uniqueness of the Present Study, Clinical Implications, and Limitations

The strength of the work is that it considers a large number of data and variables that help to further characterize the sample, also including the changes caused by the pandemic. The ability to understand which variables have the greatest impact on the LOS can help healthcare managers to allocate resources or implement specific pathways, such as fast tracks [30], for privileged access to treatment and the elimination of inefficiencies.
However, this work is not without limitations. In particular, the effect that multiple procedures have on LOS is not considered, and the results cannot be generalized, since this is a single-center study. In addition, variables that could be used to analyze the socioeconomic status of the patients were not included, and the data source, hospital discharge records, did not allow the precise characterization of the degree of severity of the comorbidities studied.

5. Conclusions

In this study, the data of 2515 patients undergoing hip-replacement surgery at “San Giovanni di Dio e Ruggi d’Aragona” University Hospital of Salerno (Italy) in the years 2010–2020 were processed using regression and classification models. Both elaborations showed that the variables that most influenced the LOS were age and the presence of fracture/dislocation. These results, together with the good performance of the models, could be used by healthcare managers to create specific pathways, according to the age or the main diagnoses that lead to interventions. This can help both bed management, through LOS prediction and turnover planning, but also all other hospital resources. The analysis of the impact of COVID-19, therefore, could be an important pointer to capture the inadvertent positive effects of the pandemic from an organizational perspective, such as the establishment of specific protocols that led to the effective and efficient use of hospital facilities.
Future developments will include the implementation of additional data processing and classification techniques, focusing in more detail on patients’ pathways and how they have changed due to the pandemic. Furthermore, additional variables will be included in the models in addition to the specific characterization of those already provided.

Author Contributions

Conceptualization, G.I. and A.B.; methodology, G.I.; software, T.A.T.; validation, G.I. and A.B.; formal analysis, T.A.T.; investigation, T.A.T.; resources, T.A.T.; data curation, T.A.T.; writing—original draft preparation, T.A.T.; writing—review and editing, G.I. and A.B.; visualization, G.I. and A.B.; supervision, G.I. and A.B.; project administration, G.I. and A.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

In compliance with the Declaration of Helsinki and with the Italian Legislative Decree 211/2003, Implementation of the 2001/20/CE directive, since no patients/children were involved in the study, a signed informed consent form and ethical approval are not mandatory for this study. Furthermore, in compliance with the regulations of the Italian National Institute of Health, our study is not reported among those needing assessment by the Ethical Committee of the Italian National Institute of Health.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated and/or analyzed during the current study are not publicly available for privacy reasons, but they are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. LeBlanc, E.S.; Hillier, T.A.; Pedula, K.L.; Rizzo, J.H.; Cawthon, P.M.; Fink, H.A.; Cauley, J.A.; Bauer, D.C.; Black, D.M.; Cummings, S.R.; et al. Hip fracture and increased short-term but not long-term mortality in healthy older women. Arch. Intern. Med. 2011, 171, 1831–1837. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Barrett, J.A.; Baron, J.A.; Beach, M.L. Mortality and pulmonary embolism after fracture in the elderly. Osteoporos. Int. 2003, 14, 889–894. [Google Scholar] [CrossRef] [PubMed]
  3. Johnell, O.; Kanis, J.A.; Oden, A.; Sernbo, I.; Redlund-Johnell, I.; Petterson, C.; De Laet, C.; Jonsson, B. Mortality after osteoporotic fractures. Osteoporos. Int. 2004, 15, 38–42. [Google Scholar] [CrossRef] [PubMed]
  4. Empana, J.P.; Dargent-Molina, P.; Breart, G. Effect of hip fracture on mortality in elderly women: The EPIDOS Prospective Study. J. Am. Geriatr. Soc. 2004, 52, 685–690. [Google Scholar] [CrossRef]
  5. Ström, O.; Borgström, F.; Kanis, J.A.; Compston, J.; Cooper, C.; McCloskey, E.V.; Jönsson, B. Osteoporosis: Burden, health care provision and opportunities in the EU. A report prepared in collaboration with the International Osteoporosis Foundation (IOF) and the European Federation of Pharmaceutical Industry Associations (EFPIA). Arch. Osteoporos. 2011, 6, 59–155. [Google Scholar] [CrossRef]
  6. Kanis, J.A.; McCloskey, E.V.; Johansson, H.; Cooper, C.; Rizzoli, R.; Reginster, J.Y. European guidance for the diagnosis and management of osteoporosis in postmenopausal women. Osteoporos. Int. 2008, 19, 399–428. [Google Scholar] [CrossRef] [Green Version]
  7. Kanis, J.A.; Johnell, O. Requirements for DXA for the management of osteoporosis in Europe. Osteoporos. Int. 2005, 16, 229–238. [Google Scholar] [CrossRef]
  8. Kanis, J.A. Assessment of Osteoporosis at the Primary Health-Care Level; Technical Report; WHO Collaborating Centre, University of Sheffield: Sheffield, UK, 2008; Available online: http://www.shef.ac.uk/FRAX/index.htm (accessed on 28 February 2022).
  9. Parker, M.; Antony, J. Hip fracture. BMJ 2006, 333, 27–30. [Google Scholar] [CrossRef]
  10. Bhandari, M.; Swiontkowski, M. Management of acute hip fracture. N. Engl. J. Med. 2017, 377, 2053–2062. [Google Scholar] [CrossRef]
  11. istat.it [webpage on the Internet]. Annual Italian Statistics 2020; Istituto Nazionale di Statistica: Roma, Italy, 2020; Available online: http://dati.istat.it/ (accessed on 22 November 2021).
  12. Programma Nazionale Esiti—Edizione 2021. Report PNE. 2021. Available online: https://pne.agenas.it/ (accessed on 28 February 2022).
  13. Moore, L.; Stelfox, H.T.; Turgeon, A.F.; Nathens, A.B.; Lavoie, A.; Émond, M.; Bourgeois, G.; Neveu, X. Derivation and Validation of a Quality Indicator of Acute Care Length of Stay to Evaluate Trauma Care. Ann. Surg. 2014, 260, 1121–1127. [Google Scholar] [CrossRef]
  14. Ponsiglione, C.; Angela Trunfio, T.; Bruno, F.; Borrelli, A. Regression and Machine Learning analysis to predict the length of stay in patients undergoing hip replacement surgery. In Proceedings of the 2021 International Symposium on Biomedical Engineering and Computational Biology (BECB 2021), New York, NY, USA, 13–15 August 2021; pp. 1–5. [Google Scholar] [CrossRef]
  15. Scala, A.; Loperto, I.; Carrano, R.; Federico, S.; Triassi, M.; Improta, G. Assessment of proteinuria level in nephrology patients using a machine learning approach. In Proceedings of the 2021 5th International Conference on Medical and Health Informatics (ICMHI 2021), New York, NY, USA, 14–16 May 2021; pp. 13–16. [Google Scholar] [CrossRef]
  16. Cesarelli, M.; Romano, M.; Bifulco, P.; Improta, G.; D’Addio, G. An application of symbolic dynamics for FHRV assessment. Stud. Health Technol. Inform. 2012, 180, 123–127. [Google Scholar] [PubMed]
  17. Rosa, D.; Balato, G.; Ciaramella, G.; Soscia, E.; Improta, G.; Triassi, M. Long-term clinical results and MRI changes after autologous chondrocyte implantation in the knee of young and active middle aged patients. J. Orthop. Traumatol. 2016, 17, 55–62. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Santini, S.; Pescapé, A.; Valente, A.S.; Abate, V.; Improta, G.; Triassi, M.; Ricchi, P.; Filosa, A. Using fuzzy logic for improving clinical daily-care of β-thalassemia patients. In Proceedings of the 2017 IEEE International Conference, Fuzzy Systems (FUZZ-IEEE), Naples, Italy, 9–12 July 2017; pp. 1–6. [Google Scholar]
  19. Raiola, E.; Triassi, M.; Improta, G.; Di Cicco, M.V.; Montella, E.; Ferraro, A.; Cerchione, R.; Centobelli, P. Implementation of lean practices to reduce healthcare associated infections. Int. J. Health Technol. Manag. 2020, 18, 51. [Google Scholar] [CrossRef]
  20. Ponsiglione, A.M.; Cesarelli, G.; Amato, F.; Romano, M. Optimization of an artificial neural network to study accelerations of foetal heart rhythm. In Proceedings of the 2021 IEEE 6th International Forum on Research and Technology for Society and Industry (RTSI), Naples, Italy, 6–9 September 2021; pp. 159–164. [Google Scholar] [CrossRef]
  21. Ponsiglione, A.M.; Romano, M.; Amato, F. A Finite-State Machine Approach to Study Patients Dropout from Medical Examinations. In Proceedings of the 2021 IEEE 6th International Forum on Research and Technology for Society and Industry (RTSI), Naples, Italy, 6–9 September2021; pp. 289–294. [Google Scholar] [CrossRef]
  22. Cesarelli, G.; Scala, A.; Vecchione, D.; Ponsiglione, A.M.; Guizzi, G. An Innovative Business Model for a Multi-echelon Supply Chain Inventory Management Pattern. In Journal of Physics: Conference Series, Proceedings of the 2020 International Symposium on Automation, Information and Computing (ISAIC 2020), Beijing, China, 2–4 December 2020; IOP Publishing: Bristol, UK, 2021; Volume 1828, p. 1828. [Google Scholar]
  23. Improta, G.; Scala, A.; Trunfio, T.A.; Guizzi, G. Application of Supply Chain Management at Drugs Flow in an Italian Hospital District. In Journal of Physics: Conference Series, Proceedings of the 2020 International Symposium on Automation, Information and Computing (ISAIC 2020), Beijing, China, 2–4 December 2020; IOP Publishing: Bristol, UK, 2021; Volume 1828, p. 012081. [Google Scholar] [CrossRef]
  24. Di Laura, D.; D’Angiolella, L.; Mantovani, L.; Squassabia, G.; Clemente, F.; Santalucia, I.; Improta, G.; Triassi, M. Efficiency measures of emergency departments: An Italian systematic literature review. BMJ Open Qual. 2021, 10, e001058. [Google Scholar] [CrossRef]
  25. Improta, G.; Luciano, M.A.; Vecchione, D.; Cesarelli, G.; Rossano, L.; Santalucia, I.; Triassi, M. Management of the Diabetic Patient in the Diagnostic Care Pathway. In IFMBE Proceedings, Proceedings of the 8th European Medical and Biological Engineering Conference, Portorož, Slovenia, 29 November 29–3 December 2020; Jarm, T., Cvetkoska, A., Mahnič-Kalamiza, S., Miklavcic, D., Eds.; Springer: Cham, Switzerland, 2020; Volume 80. [Google Scholar] [CrossRef]
  26. Improta, G.; Ponsiglione, A.M.; Parente, G.; Romano, M.; Cesarelli, G.; Rea, T.; Russo, M.; Triassi, M. Evaluation of Medical Training Courses Satisfaction: Qualitative Analysis and Analytic Hierarchy Process. In Proceedings of the European Medical and Biological Engineering Conference, Portorož, Slovenia, 29 November–3 December 2020; Springer: Cham, Switzerland, 2020; pp. 518–526. [Google Scholar]
  27. Lefaivre, K.A.; Macadam, S.A.; Davidson, D.J.; Gandhi, R.; Chan, H.; Broekhuyse, H.M. Length of stay, mortality, morbidity and delay to surgery in hip fractures. J. Bone Jt. Surgery. 2009, 91, 922–927. [Google Scholar] [CrossRef]
  28. Bracey, D.N.; Kiymaz, T.C.; Holst, D.C.; Hamid, K.S.; Plate, J.F.; Summers, E.C.; Emory, C.L.; Jinnah, R.H. An Orthopedic-Hospitalist Comanaged Hip Fracture Service Reduces Inpatient Length of Stay. Geriatr. Orthop. Surg. Rehabil. 2016, 7, 171–177. [Google Scholar] [CrossRef]
  29. Fisher, S.R. Early Ambulation and Length of Stay in Older Adults Hospitalized for Acute Illness. Arch. Intern. Med. 2010, 170, 1942–1943. [Google Scholar] [CrossRef] [Green Version]
  30. Husted, H.; Jensen, C.M.; Solgaard, S.; Kehlet, H. Reduced length of stay following hip and knee arthroplasty in Denmark 2000–2009: From research to implementation. Arch. Orthop. Trauma. Surg. 2011, 132, 101–104. [Google Scholar] [CrossRef]
  31. Piscitelli, P.; Gimigliano, F.; Gatto, S.; Marinelli, A.; Chitano, G.; Greco, M.; Di Paola, L.; Sbenaglia, E.; Benvenuto, M.; Muratore, M.; et al. Hip fractures in Italy: 2000–2005 extension study. Osteoporos. Int. 2009, 21, 1323–1330. [Google Scholar] [CrossRef]
  32. Piscitelli, P.; Iolascon, G.; Gimigliano, F.; Muratore, M.; Camboa, P.; Borgia, O.; Forcina, B.; Fitto, F.; Robaud, V.; Termini, G.; et al. Incidence and costs of hip fractures compared to acute myocardial infarction in the Italian population: A 4-year survey. Osteoporos. Int. 2006, 18, 211–219. [Google Scholar] [CrossRef] [Green Version]
  33. Latessa, I.; Ricciardi, C.; Jacob, D.; Jónssonr, H., Jr.; Gambacorta, M.; Improta, G.; Gargiulo, P. Health technology assessment through Six Sigma Methodology to assess cemented and uncemented protheses in total hip arthroplasty. Eur. J. Transl. Myol. 2021, 31, 9651. [Google Scholar] [CrossRef] [PubMed]
  34. Scala, A.; Ponsiglione, A.; Loperto, I.; Della Vecchia, A.; Borrelli, A.; Russo, G.; Triassi, M.; Improta, G. Lean Six Sigma Approach for Reducing Length of Hospital Stay for Patients with Femur Fracture in a University Hospital. Int. J. Environ. Res. Public Health 2021, 18, 2843. [Google Scholar] [CrossRef] [PubMed]
  35. Latessa, I.; Fiorillo, A.; Picone, I.; Balato, G.; Trunfio, T.A.; Scala, A.; Triassi, M. Implementing fast track surgery in hip and knee arthroplasty using the lean Six Sigma methodology. TQM J. 2021, 33, 131–147. [Google Scholar] [CrossRef]
  36. Ramkumar, P.N.; Navarro, S.M.; Haeberle, H.S.; Karnuta, J.M.; Mont, M.A.; Iannotti, J.P.; Patterson, B.M.; Krebs, V.E. Development and validation of a machine learning algorithm after primary total hip arthroplasty: Applications to length of stay and payment models. J. Arthroplast. 2019, 34, 632–637. [Google Scholar] [CrossRef] [PubMed]
  37. Johannesdottir, K.B.; Kehlet, H.; Petersen, P.B.; Aasvang, E.K.; Sørensen, H.B.D.; Jørgensen, C.C. Machine learning classifiers do not improve prediction of hospitalization > 2 days after fast-track hip and knee arthroplasty compared with a classical statistical risk model. Acta Orthop. 2022, 93, 117–123. [Google Scholar] [CrossRef] [PubMed]
  38. Menzies, I.B.; Mendelson, D.A.; Kates, S.L.; Friedman, S.M. The impact of comorbidity on perioperative outcomes of hip fractures in a geriatric fracture model. Geriatr. Orthop. Surg. Rehabil. 2012, 3, 129–134. [Google Scholar] [CrossRef] [Green Version]
  39. Colella, Y.; Scala, A.; De Lauri, C.; Bruno, F.; Cesarelli, G.; Ferrucci, G.; Borrelli, A. Studying variables affecting the length of stay in patients with lower limb fractures by means of Machine Learning. In Proceedings of the 2021 5th International Conference on Medical and Health Informatics, Kyoto, Japan, 14–16 May 2021. [Google Scholar]
  40. Ricci, W.M.; Brandt, A.; McAndrew, C.; Gardner, M.J. Factors Affecting Delay to Surgery and Length of Stay for Patients with Hip Fracture. J. Orthop. Trauma 2015, 29, e109–e114. [Google Scholar] [CrossRef] [Green Version]
  41. Husted, H.; Holm, G.; Jacobsen, S. Predictors of length of stay and patient satisfaction after hip and knee replacement surgery: Fast-track experience in 712 patients. Acta Orthop. 2008, 79, 168–173. [Google Scholar] [CrossRef]
  42. Clague, J.E.; Craddock, E.; Andrew, G.; Horan, M.A.; Pendleton, N. Predictors of outcome following hip fracture. Admission time predicts length of stay and in-hospital mortality. Injury 2002, 33, 1–6. [Google Scholar] [CrossRef]
Figure 1. Distribution of the features in the dataset.
Figure 1. Distribution of the features in the dataset.
Ijerph 19 06219 g001
Figure 2. Homoscedasticity of the data.
Figure 2. Homoscedasticity of the data.
Ijerph 19 06219 g002
Figure 3. Q–Q plot.
Figure 3. Q–Q plot.
Ijerph 19 06219 g003
Figure 4. Linear Regression.
Figure 4. Linear Regression.
Ijerph 19 06219 g004
Figure 5. XGBoost.
Figure 5. XGBoost.
Ijerph 19 06219 g005
Figure 6. Global importance Feature.
Figure 6. Global importance Feature.
Ijerph 19 06219 g006
Table 1. Pearson correlation.
Table 1. Pearson correlation.
LOSGenderAgePre-Operative LOSDiabetesHypertensionObesityAnemiaVitamin D DeficiencyTumorFracture/DislocationBrain DisordersUrinary DisordersCardiovascular DiseaseRespiratory DiseaseAnticoagulant Therapy
Pearson CorrelationLOS1.0000.0540.1370.772−0.027−0.104−0.0230.049−0.0540.0690.248−0.0090.0460.1090.0240.002
Gender0.0541.0000.182−0.010−0.0080.0800.0290.1040.040−0.0080.0550.011−0.035−0.016−0.085−0.029
Age0.1370.1821.000.0880.0600.189−0.0180.1260.115−0.0050.0950.1190.0770.2180.0540.064
Pre-operative LOS0.772−0.0100.0881.000−0.064−0.161−0.022−0.064−0.1010.0720.260−0.0190.0050.078−0.002−0.008
Diabetes−0.027−0.0080.060−0.0641.0000.202−0.0200.0900.0360.052−0.0240.0280.0330.0660.0790.068
Hypertension−0.1040.0800.189−0.1610.2021.0000.0620.1740.130−0.039−0.1420.0580.0620.1770.1120.066
Obesity−0.0230.029−0.018−0.022−0.0200.0621.0000.007−0.011−0.006−0.031−0.0190.0280.004−0.0140.031
Anemia0.0490.1040.126−0.0640.0900.1740.0071.0000.1540.033−0.0290.0900.0890.0630.0550.066
Vitamin D deficiency−0.0540.0400.115−0.1010.0360.130−0.0110.1541.0000.001−0.0520.1250.0050.0720.0830.024
Tumor0.069−0.008−0.0050.0720.052−0.039−0.0060.0330.0011.0000.0170.0040.0240.0420.105−0.018
Fracture/Dislocation0.2480.0550.0950.260−0.024−0.142−0.031−0.029−0.0520.0171.000−0.041−0.0190.202−0.042−0.050
Brain disorders−0.0090.0110.119−0.0190.0280.058−0.0190.0900.1250.004−0.0411.000−0.0180.0400.0380.014
Urinary disorders0.046−0.0350.0770.0050.00330.0620.0280.0890.0050.024−0.019−0.0181.0000.0670.0080.027
Cardiovascular disease0.109−0.0160.2180.0780.0660.1770.0040.0630.0720.0420.2020.0400.0671.0000.0400.183
Respiratory disease0.024−0.0850.054−0.0020.0790.112−0.0140.0550.0830.105−0.0420.0380.0080.0401.0000.025
Anticoagulant therapy0.002−0.0290.064−0.0080.0680.0660.0310.0660.024−0.018−0.0500.0140.0270.1830.0251.000
Sig. (1-tailed)LOS0.0030.0000.0000.0890.0000.1200.0070.0030.0000.0000.3260.0110.0000.1100.465
Gender0.0030.0000.3080.3410.0000.0710.0000.0230.3400.0030.2840.0400.2180.0000.071
Age0.0000.0000.0000.0010.0000.1900.0000.0000.4020.0000.0000.0000.0000.0040.001
Pre-operative LOS0.0000.3080.0000.0010.0000.1320.0010.0000.0000.0000.1770.3940.0000.4510.352
Diabetes0.0890.3410.0010.0010.0000.1600.0000.0360.0050.1170.0820.0480.0000.0000.000
Hypertension0.0000.0000.0000.0000.0000.0010.0000.0000.0260.0000.0020.0010.0000.0000.000
Obesity0.1200.0710.1900.1320.1600.0010.3540.2890.3730.0600.1690.0830.4210.2350.060
Anemia0.0070.0000.0000.0010.0000.0000.3540.0000.0500.0760.0000.0000.0010.0030.000
Vitamin D deficiency0.0030.0230.0000.0000.0360.0000.2890.0000.4820.0050.0000.3920.0000.0000.114
Tumor0.0000.3400.4020.0000.0050.0260.3730.0500.4820.1940.4200.1180.0170.0000.183
Fracture/dislocation0.0000.0030.0000.0000.1170.0000.0600.0760.0050.1940.0210.1660.0000.0170.006
Brain disorders0.3260.2840.0000.1770.0820.0020.1690.0000.0000.4200.0210.1890.0220.0280.236
Urinary disorders0.0110.0400.0000.3940.0480.0010.0830.0000.3920.1180.1660.1890.0000.3520.090
Cardiovascular disease0.0000.2180.0000.0000.0000.0000.4210.0010.0000.0170.0000.0220.0000.0220.000
Respiratory disease0.1100.0000.0040.4510.0000.0000.2350.0030.0000.0000.0170.0280.3520.0220.107
Anticoagulant therapy0.4650.0710.0010.3520.0000.0000.0600.0000.1140.1830.0060.2360.0900.0000.107
Table 2. Multiple linear regression model.
Table 2. Multiple linear regression model.
RR2Adjusted R2Std. Error of the Estimate
Model0.7850.6160.6133.726
Table 3. Coefficients of MLR model.
Table 3. Coefficients of MLR model.
Unstandardized CoefficientsStandardized Coefficientstp-Value
BStd. ErrorBeta
(Constant)4.4050.522-8.4420.000
Gender0.6090.1620.0483.7620.000
Age0.0200.0070.0402.9600.003
Pre-operative LOS1.0110.0170.76057.9080.000
Diabetes0.2210.2570.0110.8620.389
Hypertension−0.1660.178−0.013−0.9330.351
Obesity−0.6241.250−0.006−0.4990.618
Anemia1.1300.1730.0846.5370.000
Vitamin D deficiency0.1270.4300.0040.2950.768
Tumor0.3280.7050.0060.4650.642
Fracture/dislocation0.5930.1960.0403.0200.003
Brain disorders−0.1590.261−0.008−0.6100.542
Urinary disorders1.1150.4330.0322.5720.010
Cardiovascular disease0.3480.1760.0271.9830.048
Respiratory disease0.6320.3350.0241.8880.059
Anticoagulant therapy−0.1160.470−0.003−0.2480.804
Table 4. Results of regression algorithms.
Table 4. Results of regression algorithms.
LRRFGBTXGBoost
R20.5520.4480.5430.552
Root mean squared error3.8434.4973.8833.843
Table 5. Performance metrics of all selected algorithms.
Table 5. Performance metrics of all selected algorithms.
Performance MetricsClassDTGBTRFSVM
Accuracy (%)Overall71.1371.7671.7665.06
Error (%)Overall28.8728.2428.2434.94
Precision (%)165.3569.4955.0463.46
261.5860.9380.6861.29
389.1989.6675.1467.69
Sensitivity (%)164.3463.5776.3476.74
271.0274.4359.1732.39
376.3075.1489.6689.60
Specificity (%)187.3989.6884.9483.67
274.1772.1985.7188.08
394.7595.0887.0975.74
F-measure (%)164.8466.4063.9669.47
265.9667.0168.2742.38
382.2481.7681.7677.11
Table 6. Random Forest confusion matrix.
Table 6. Random Forest confusion matrix.
Real/Predicted123
171202
25714241
3114130
Table 7. Analysis of COVID-19 impact.
Table 7. Analysis of COVID-19 impact.
Variable2019
N = 272
2020
N = 185
p-Value
Age
Mean77.7678.220.800
Gender
Male88590.918
Female184126
Pre-operative LOS
Mean3.053.140.066
Post-operative LOS
Mean7.707.090.040
Diabetes
No2331550.582
Yes3930
Hypertension
No1591010.413
Yes11384
Anemia
No1681170.749
Yes10468
Obesity
No2681850.098
Yes40
Vitamin D deficiency
No2251540.884
Yes4731
Tumor
No2641800.880
Yes85
Fracture/dislocation
No2621420.000
Yes1043
Brain disorders
No2181550.325
Yes5430
Urinary disorders
No2611770.883
Yes118
Cardiovascular disease
No1921010.000
Yes8084
Anticoagulant therapy
No2571780.396
Yes157
Respiratory disease
No2431740.080
Yes2911
LOS
Mean10.7510.220.240
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Trunfio, T.A.; Borrelli, A.; Improta, G. Is It Possible to Predict the Length of Stay of Patients Undergoing Hip-Replacement Surgery? Int. J. Environ. Res. Public Health 2022, 19, 6219. https://doi.org/10.3390/ijerph19106219

AMA Style

Trunfio TA, Borrelli A, Improta G. Is It Possible to Predict the Length of Stay of Patients Undergoing Hip-Replacement Surgery? International Journal of Environmental Research and Public Health. 2022; 19(10):6219. https://doi.org/10.3390/ijerph19106219

Chicago/Turabian Style

Trunfio, Teresa Angela, Anna Borrelli, and Giovanni Improta. 2022. "Is It Possible to Predict the Length of Stay of Patients Undergoing Hip-Replacement Surgery?" International Journal of Environmental Research and Public Health 19, no. 10: 6219. https://doi.org/10.3390/ijerph19106219

APA Style

Trunfio, T. A., Borrelli, A., & Improta, G. (2022). Is It Possible to Predict the Length of Stay of Patients Undergoing Hip-Replacement Surgery? International Journal of Environmental Research and Public Health, 19(10), 6219. https://doi.org/10.3390/ijerph19106219

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop