Abstract
Purpose:
Kidney stone disease (KSD) is a common urological disorder with an increasing incidence worldwide. The extensive knowledge about KSD is dispersed across multiple databases, challenging the visualization and representation of its hierarchy and connections. This paper aims at constructing a disease-specific knowledge graph for KSD to enhance the effective utilization of knowledge by medical professionals and promote clinical research and discovery.
Methods:
Text parsing and semantic analysis were conducted on literature related to KSD from PubMed, with concept annotation based on biomedical ontology being utilized to generate semantic data in RDF format. Moreover, public databases were integrated to construct a large-scale knowledge graph for KSD. Additionally, case studies were carried out to demonstrate the practical utility of the developed knowledge graph.
Results:
We proposed and implemented a Kidney Stone Disease Knowledge Graph (KSDKG), covering more than 90 million triples. This graph comprised semantic data extracted from 29,174 articles, integrating available data from UMLS, SNOMED CT, MeSH, DrugBank and Microbe-Disease Knowledge Graph. Through the application of three cases, we retrieved and discovered information on microbes, drugs and diseases associated with KSD. The results illustrated that the KSDKG can integrate diverse medical knowledge and provide new clinical insights for identifying the underlying mechanisms of KSD.
Conclusion:
The KSDKG efficiently utilizes knowledge graph to reveal hidden knowledge associations, facilitating semantic search and response. As a blueprint for developing disease-specific knowledge graphs, it offers valuable contributions to medical research.
Similar content being viewed by others
Data availability
The data associated with this study are not publicly available at this time, as they are currently reserved for additional analyses. However, further details can be provided upon reasonable request, if needed.
Notes
References
Hao X, Shao Z, Zhang N, Jiang M, Cao X, Li S, Guan Y, Wang C. Integrative genome-wide analyses identify novel loci associated with kidney stones and provide insights into its genetic architecture. Nat Commun. 2023;14(1):7498.
Gillams K, Juliebø-Jones P, Juliebø SØ, Somani BK. Gender differences in kidney stone disease (ksd): findings from a systematic review. Curr Urol Rep. 2021;22:1–8.
Goldfarb DS, Avery AR, Beara-Lasic L, Duncan GE, Goldberg J. A twin study of genetic influences on nephrolithiasis in women and men. Kidney Int Rep. 2019;4(4):535–40.
Chmiel JA, Stuivenberg GA, Al KF, Akouris PP, Razvi H, Burton JP, Bjazevic J. Vitamins as regulators of calcium-containing kidney stones-new perspectives on the role of the gut microbiome. Nat Rev Urol. 2023;20(10):615–37.
Crivelli JJ, Maalouf NM, Paiste HJ, Wood KD, Hughes AE, Oates GR, Assimos DG. Disparities in kidney stone disease: a scoping review. J Urol. 2021;206(3):517–25.
Li L, Liu M, Lai C, Ji W, Xu K, Zhou Y. Analysis of residual stones in patients and related influencing factors after percutaneous nephrolithotomy: a retrospective study. In: 2023 IEEE 11th international conference on healthcare informatics (ICHI). IEEE; 2023. p. 32–39.
Peerapen P, Thongboonkerd V. Kidney stone prevention. Adv Nutr. 2023;14(3):555–69.
Sassanarakkit S, Peerapen P, Thongboonkerd V. Stonemod: a database for kidney stone modulatory proteins with experimental evidence. Sci Rep. 2020;10(1):15109.
Liu M, Luo J, Li L, Pan X, Tan S, Ji W, Zhang H, Tang S, Liu J, Wu B, et al. Design and development of a disease-specific clinical database system to increase the availability of hospital data in china. Health Inf Sci Syst. 2023;11(1):11.
Hogan A, Blomqvist E, Cochez M, d’Amato C, Melo GD, Gutierrez C, Kirrane S, Gayo JEL, Navigli R, Neumaier S, et al. Knowledge graphs. ACM Comput Surv (Csur). 2021;54(4):1–37.
Peng C, Xia F, Naseriparsa M, Osborne F. Knowledge graphs: opportunities and challenges. Artif Intell Rev. 2023;56(11):13071–102.
Zhong L, Wu J, Li Q, Peng H, Wu X. A comprehensive survey on automatic knowledge graph construction. ACM Comput Surv. 2023;56(4):1–62.
Wang T, Zhang Y, Zhang Y, Lu H, Yu B, Peng S, Ma Y, Li D. A hybrid model based on deep convolutional network for medical named entity recognition. J Electr Comput Eng. 2023;2023(1):8969144.
Aldwairi M, Jarrah M, Mahasneh N, Al-khateeb B. Graph-based data management system for efficient information storage, retrieval and processing. Inf Process Manage. 2023;60(2): 103165.
Ji X, Ritter A, Yen P-Y. Using ontology-based semantic similarity to facilitate the article screening process for systematic reviews. J Biomed Inform. 2017;69:33–42.
Wu X, Duan J, Pan Y, Li M. Medical knowledge graph: data sources, construction, reasoning, and applications. Big Data Min Anal. 2023;6(2):201–17.
Chen A, Huang R, Wu E, Han R, Wen J, Li Q, Zhang Z, Shen B. The generation of a lung cancer health factor distribution using patient graphs constructed from electronic medical records: retrospective study. J Med Internet Res. 2022;24(11):40361.
Zhao X, Wang Y, Wen T. The construction of a tcm knowledge graph and application of potential knowledge discovery in diabetic kidney disease by integrating diagnosis and treatment guidelines and real-world clinical data. Front Pharmacol. 2023;14:1147677.
Liu F, Liu M, Li M, Xin Y, Gao D, Wu J, Zhu J. Automatic knowledge extraction from Chinese electronic medical records and rheumatoid arthritis knowledge graph construction. Quant Imaging Med Surg. 2023;13(6):3873.
An B. Construction and application of Chinese breast cancer knowledge graph based on multi-source heterogeneous data. Math Biosci Eng. 2023;20(4):6776–99.
Jin S, Liang H, Zhang W, Li H, et al. Knowledge graph for breast cancer prevention and treatment: literature-based data analysis study. JMIR Med Inform. 2024;12(1):52210.
Papadakis E, Baryannis G, Batsakis S, Adamou M, Huang Z, Antoniou G. Adhd-kg: a knowledge graph of attention deficit hyperactivity disorder. Health Inf Sci Syst. 2023;11(1):52.
Feng F, Tang F, Gao Y, Zhu D, Li T, Yang S, Yao Y, Huang Y, Liu J. Genomickb: a knowledge graph for the human genome. Nucl Acids Res. 2023;51(D1):950–6.
Chandak P, Huang K, Zitnik M. Building a knowledge graph to enable precision medicine. Sci Data. 2023;10(1):67.
Santos A, Colaço AR, Nielsen AB, Niu L, Strauss M, Geyer PE, Coscia F, Albrechtsen NJW, Mundt F, Jensen LJ, et al. A knowledge graph to interpret clinical proteomics data. Nat Biotechnol. 2022;40(5):692–702.
Byambasuren O, Yang Y, Sui Z, Dai D, Chang B, Li S, Zan H. Preliminary study on the construction of Chinese medical knowledge graph. J Chin Inf Process. 2019;33(10):1–9.
White J. Pubmed 2.0. Med Ref Serv Q. 2020;39(4):382–7.
Ali W, Saleem M, Yao B, Hogan A, Ngomo A-CN. A survey of RDF stores & SPARQL engines for querying knowledge graphs. VLDB J. 2022;31:1–26.
Ait-Mokhtar S, Bruijn B, Hagege C, Rupi P. Intermediary-stage ie components. Technical report, D3. 5. Technical report, EURECA Project; 2014.
Khiari A. Identification of variants of compound terms. PhD thesis, Master Thesis. Technical Report. Université Paul Sabatier, Toulouse; 2015.
Fu C, Zhong R, Jiang X, He T, Jiang X. An integrated knowledge graph for microbe-disease associations. In: Health information science: 9th international conference, HIS 2020, Amsterdam, The Netherlands, October 20–23, 2020, proceedings 9. Springer; 2020. p. 79–90.
Paul S, Mitra A, Koner C. A review on graph database and its representation. In: 2019 international conference on recent advances in energy-efficient computing and communication (ICRAECC). IEEE; 2019. pp. 1–5.
Güting RH. Graphdb: modeling and querying graphs in databases. In: VLDB, vol. 94, Citeseer; 1994. pp. 12–15.
Lan G, Liu T, Wang X, Pan X, Huang Z. A semantic web technology index. Sci Rep. 2022;12(1):3672.
Aleksander SA, Balhoff J, Carbon S, Cherry JM, Drabkin HJ, Ebert D, Feuermann M, Gaudet P, Harris NL, et al. The gene ontology knowledgebase in 2023. Genetics. 2023;224(1):031.
Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. Kegg: new perspectives on genomes, pathways, diseases and drugs. Nucl Acids Res. 2017;45(D1):353–61.
Tzelves L, Türk C, Skolarikos A. European association of urology urolithiasis guidelines: where are we going? Eur Urol Focus. 2021;7(1):34–8.
Pan S, Luo L, Wang Y, Chen C, Wang J, Wu X. Unifying large language models and knowledge graphs: a roadmap. IEEE Trans Knowl Data Eng. 2024. https://doi.org/10.1109/TKDE.2024.3352100.
Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, Scales N, Tanwani A, Cole-Lewis H, Pfohl S, et al. Large language models encode clinical knowledge. Nature. 2023;620(7972):172–80.
Acknowledgements
We sincerely express our gratitude to Professor Kewei Xu and Dr. Cong Lai from the Department of Urology at Sun Yat-sen Memorial Hospital, Sun Yat-sen University, for their assistance and professional guidance during the design and development of our works.
Funding
This work was supported by the Key Research and Development Program of China (2022YFC3601600), the Guangzhou Science and Technology Plan (202201011545), the National Natural Science Foundation of China (61876194), the Science and Technology Innovation Special Project of Guangdong Province (202011020004), and Fundamental Research Funds for the Central Universities, Sun Yat-Sen University (24xkjc025).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there are no Conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Man, J., Shi, Y., Hu, Z. et al. KSDKG: construction and application of knowledge graph for kidney stone disease based on biomedical literature and public databases. Health Inf Sci Syst 12, 54 (2024). https://doi.org/10.1007/s13755-024-00309-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13755-024-00309-3