Abstract
Effective disease prediction based on electronic health records (EHR) is an important topic in health informatics. The current methods usually use common deep-learning models for disease prediction. However, it is difficult to fully learn the graphic encounter structure of EHR to improve prediction performance. Moreover, in prediction tasks, chronic kidney disease (CKD) has a poor prognosis due to excessive risk factors and complex comorbidities. Therefore, we propose a CKD onset prediction model called Graph Convolution Synthetic Transformer (GCST) based on EHR, using Fusion Attention Mechanism to solve these challenges. By modifying Transformer, GCST uses Factorized Dense Attention and Medical Local Attention to learn global and local attention, generating Synthetic Attention to learn the potential encounter structure and meaningful medical knowledge of EHR. In addition, we also propose a transfer learning strategy based on sample weighted correction to guide the prediction of GCST in specific low-resource EHR. We conduct sufficient experiments on three datasets to test the performance of GCST. Experiments show that GCST has significant improvement over state-of-the-art models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bastarache, L.: Using phecodes for research with the electronic health record: from PheWAS to PheRS. Ann. Rev. Biomed. Data Sci. 4, 1–19 (2021)
Che, Z., Kale, D., Li, W., Bahadori, M.T., Liu, Y.: Deep computational phenotyping. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 507–516 (2015)
Chen, Y., Ma, T., Yang, X., Wang, J., Song, B., Zeng, X.: Muffin: multi-scale feature fusion for drug-drug interaction prediction. Bioinformatics 37(17), 2651–2658 (2021)
Choi, E., Bahadori, M.T., Song, L., Stewart, W.F., Sun, J.: Gram: graph-based attention model for healthcare representation learning. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 787–795 (2017)
Choi, E., Bahadori, M.T., Sun, J., Kulas, J., Schuetz, A., Stewart, W.: Retain: an interpretable predictive model for healthcare using reverse time attention mechanism. In: Advances in Neural Information Processing Systems 29 (2016)
Choi, E., Xiao, C., Stewart, W., Sun, J.: Mime: multilevel medical embedding of electronic health records for predictive healthcare. In: Advances in Neural Information Processing Systems 31 (2018)
Choi, E., et al.: Learning the graphical structure of electronic health records with graph convolutional transformer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 606–613 (2020)
Cui, L., Biswal, S., Glass, L.M., Lever, G., Sun, J., Xiao, C.: Conan: complementary pattern augmentation for rare disease detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 614–621 (2020)
Ding, Y., Tang, J., Guo, F.: Identification of drug-side effect association via multiple information integration with centered kernel alignment. Neurocomputing 325, 211–224 (2019)
Ghassemi, M., Naumann, T., Schulam, P., Beam, A.L., Chen, I.Y., Ranganath, R.: A review of challenges and opportunities in machine learning for health. AMIA Summits Transl. Sci. Proc. 2020, 191 (2020)
Jagannatha, A.N., Yu, H.: Bidirectional RNN for medical event detection in electronic health records. In: Proceedings of the Conference. Association for Computational Linguistics. North American Chapter. Meeting, vol. 2016, p. 473. NIH Public Access (2016)
Kalantar-Zadeh, K., Jafar, T.H., Nitsch, D., Neuen, B.L., Perkovic, V.: Chronic kidney disease. Lancet 398(10302), 786–802 (2021)
Khope, S.R., Elias, S.: Critical correlation of predictors for an efficient risk prediction framework of ICU patient using correlation and transformation of MIMIC-III dataset. Data Sci. Eng. 7(1), 71–86 (2022). https://doi.org/10.1007/s41019-022-00176-6
Lee, C.Y., Chen, Y.P.P.: Prediction of drug adverse events using deep learning in pharmaceutical discovery. Brief. Bioinform. 22(2), 1884–1901 (2021)
Li, J., Wu, B., Sun, X., Wang, Y.: Causal hidden markov model for time series disease forecasting. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12105–12114 (2021)
Luo, J., Ye, M., Xiao, C., Ma, F.: HitaNet: hierarchical time-aware attention networks for risk prediction on electronic health records. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 647–656 (2020)
Ma, F., Chitta, R., Zhou, J., You, Q., Sun, T., Gao, J.: Dipole: diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1903–1911 (2017)
Ma, F., Gao, J., Suo, Q., You, Q., Zhou, J., Zhang, A.: Risk prediction on electronic health records with prior medical knowledge. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1910–1919 (2018)
Miotto, R., Li, L., Kidd, B.A., Dudley, J.T.: Deep patient: an unsupervised representation to predict the future of patients from the electronic health records. Sci. Rep. 6(1), 1–10 (2016)
Peng, X., Long, G., Shen, T., Wang, S., Jiang, J.: Sequential diagnosis prediction with transformer and ontological representation. In: 2021 IEEE International Conference on Data Mining (ICDM), pp. 489–498. IEEE (2021)
Pham, T., Tran, T., Phung, D., Venkatesh, S.: DeepCare: a deep dynamic memory model for predictive medicine. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J.Z., Wang, R. (eds.) Advances in Knowledge Discovery and Data Mining: 20th Pacific-Asia Conference, PAKDD 2016, Auckland, New Zealand, April 19-22, 2016, Proceedings, Part II, pp. 30–41. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-31750-2_3
Ramchoun, H., Ghanou, Y., Ettaouil, M., Janati Idrissi, M.A.: Multilayer perceptron: architecture optimization and training (2016)
Seber, G.A., Lee, A.J.: Linear Regression Analysis, vol. 330. John Wiley & Sons (2003)
Shang, J., Ma, T., Xiao, C., Sun, J.: Pre-training of graph augmented transformers for medication recommendation. arXiv preprint arXiv:1906.00346 (2019)
Si, Y., et al.: Deep representation learning of patient data from electronic health records (EHR): a systematic review. J. Biomed. Inform. 115, 103671 (2021)
Zhu, D.: A survey of advanced information fusion system: from model-driven to knowledge-enabled. Data Sci. Eng. 8(2), 85–97 (2023). https://doi.org/10.1007/s41019-023-00209-8
Acknowledgment
This work is supported by the Natural Science Foundation of China (62172351), 14th Five-Year Plan Civil Aerospace Pre-research Project of China (D020101), and the Fund of Prospective Layout of Scientific Research for Nanjing University of Aeronautics and Astronautics.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhu, D. et al. (2023). Graph Convolution Synthetic Transformer for Chronic Kidney Disease Onset Prediction. In: Yang, X., et al. Advanced Data Mining and Applications. ADMA 2023. Lecture Notes in Computer Science(), vol 14178. Springer, Cham. https://doi.org/10.1007/978-3-031-46671-7_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-46671-7_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46670-0
Online ISBN: 978-3-031-46671-7
eBook Packages: Computer ScienceComputer Science (R0)