Abstract
Accurate identification of drug-target interactions (DTIs) is essential for drug development. It not only helps the researchers to understand the mechanism of drug action, but also contributes to the innovative drug discovery and repositioning. However, due to the limitation the high cost and long time, the traditional experimental methods are difficult to be widely applied for DTIs prediction. In this study, we propose an in silico method for predicting drug-target interactions by Node2vec node embedding in molecular associations network (MAN). Specifically, the MAN is constructed by integrating the associations among drug, protein, disease, lncRNA and miRNA. Then, the node2vec embedding method is employed to obtain a behavior feature vector of each node in the network. The traditional attribute feature vector comes from the drug molecular fingerprint and protein sequences. Finally, a random forest (RF) classifier is performed on these features to predict potential drug-target pairs. The experimental results show that the behavior feature could obtain 87.37% accuracy, which is obviously better than the traditional attribute feature. This work is not only more robust and reliable for predicting DTIs, but also provides an alternative way for other biomolecules associations prediction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Rask-Andersen, M., Almén, M.S., Schiöth, H.B.: Trends in the exploitation of novel drug targets. Nat. Rev. Drug Discov. 10, 579–590 (2011)
Li, Y.H., et al.: Clinical trials, progression-speed differentiating features and swiftness rule of the innovative targets of first-in-class drugs. Brief. Bioinform. 21, 649–662 (2020)
Liu, Y., Wu, M., Miao, C., Zhao, P., Li, X.-L.: Neighborhood regularized logistic matrix factorization for drug-target interaction prediction. PLoS Comput. Biol. 12, e1004760 (2016)
Tang, J., et al.: Drug target commons: a community effort to build a consensus knowledge base for drug-target interactions. Cell Chem. Biol. 25, 224–229 (2018). e222
Wang, L., You, Z.-H., Li, L.-P., Yan, X., Zhang, W.: Incorporating chemical sub-structures and protein evolutionary information for inferring drug-target interactions. Sci. Rep. 10, 1–11 (2020)
Ji, B.-Y., You, Z.-H., Cheng, L., Zhou, J.-R., Alghazzawi, D., Li, L.-P.: Predicting miRNA-disease association from heterogeneous information network with GraRep embedding model. Sci. Rep. 10, 1–12 (2020)
You, Z.-H., et al.: PBMDA: a novel and effective path-based computational model for miRNA-disease association prediction. PLoS Comput. Biol. 13, e1005455 (2017)
Chen, X., Yan, C.C., Zhang, X., You, Z.-H.: Long non-coding RNAs and complex diseases: from experimental results to computational models. Brief. Bioinform. 18, 558–576 (2017)
Wang, M.-N., You, Z.-H., Wang, L., Li, L.-P., Zheng, K.: LDGRNMF: LncRNA-disease associations prediction based on graph regularized non-negative matrix factorization. Neurocomputing (2020)
Wang, M.-N., You, Z.-H., Li, L.-P., Wong, L., Chen, Z.-H., Gan, C.-Z.: GNMFLMI: graph regularized nonnegative matrix factorization for predicting LncRNA-MiRNA interactions. IEEE Access 8, 37578–37588 (2020)
Wong, L., Huang, Y.A., You, Z.H., Chen, Z.H., Cao, M.Y.: LNRLMI: linear neighbour representation for predicting lncRNA-miRNA interactions. J. Cell Mol. Med. 24, 79–87 (2020)
Guo, Z.-H., You, Z.-H., Wang, Y.-B., Huang, D.-S., Yi, H.-C., Chen, Z.-H.: Bioentity2vec: attribute-and behavior-driven representation for predicting multi-type relationships between bioentities. GigaScience 9, giaa032 (2020)
Guo, Z.-H., et al.: MeSHHeading2vec: a new method for representing MeSH headings as vectors based on graph embedding algorithm. Brief. Bioinf. (2020)
You, Z.-H., Zhou, M., Luo, X., Li, S.: Highly efficient framework for predicting interactions between proteins. IEEE Trans. Cybern. 47, 731–743 (2016)
Wang, Y., You, Z.-H., Yang, S., Li, X., Jiang, T.-H., Zhou, X.: A high efficient biological language model for predicting protein–protein interactions. Cells 8, 122 (2019)
Chen, Z.-H., You, Z.-H., Li, L.-P., Wang, Y.-B., Wong, L., Yi, H.-C.: Prediction of self-interacting proteins from protein sequence information based on random projection model and fast Fourier transform. Int. J. Mol. Sci. 20, 930 (2019)
Chen, Z.-H., You, Z.-H., Guo, Z.-H., Yi, H.-C., Luo, G.-X., Wang, Y.-B.: Prediction of drug-target interactions from multi-molecular network based on deep walk embedding model. Front. Bioeng. Biotechnol. 8, 338 (2020)
Wang, Y.-B., You, Z.-H., Yang, S., Yi, H.-C., Chen, Z.-H., Zheng, K.: A deep learning-based method for drug-target interaction prediction based on long short-term memory neural network. BMC Med. Inform. Decis. Mak. 20, 1–9 (2020)
Wang, L., et al.: Identification of potential drug–targets by combining evolutionary information extracted from frequency profiles and molecular topological structures. Chem. Biol. Drug Design (2019)
Huang, Y.-A., You, Z.-H., Chen, X.: A systematic prediction of drug-target interactions using molecular fingerprints and protein sequences. Curr. Protein Pept. Sci. 19, 468–478 (2018)
Li, Z., et al.: In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences. Sci. Rep. 7, 1–13 (2017)
Luo, Y., et al.: A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information. Nat. Commun. 8, 1–13 (2017)
Van Laarhoven, T., Nabuurs, S.B., Marchiori, E.: Gaussian interaction profile kernels for predicting drug–target interaction. Bioinformatics 27, 3036–3043 (2011)
Chen, X., Liu, M.-X., Yan, G.-Y.: Drug–target interaction prediction by random walk on the heterogeneous network. Mol. BioSyst. 8, 1970–1978 (2012)
Ezzat, A., Wu, M., Li, X.-L., Kwoh, C.-K.: Computational prediction of drug–target interactions using chemogenomic approaches: an empirical survey. Brief. Bioinform. 20, 1337–1357 (2019)
Guo, Z.-H., Yi, H.-C., You, Z.-H.: Construction and comprehensive analysis of a molecular association network via lncRNA–miRNA–disease–drug–protein graph. Cells 8, 866 (2019)
Wishart, D.S., et al.: DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 46, D1074–D1082 (2018)
Davis, A.P., et al.: The comparative toxicogenomics database: update 2019. Nucleic Acids Res. 47, D948–D954 (2019)
Piñero, J., et al.: DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. Nucleic Acids Res. gkw943 (2016)
Cheng, L., et al.: LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse. Nucleic Acids Res. 47, D140–D144 (2019)
Chen, G., et al.: LncRNADisease: a database for long-non-coding RNA-associated diseases. Nucleic Acids Res. 41, D983–D986 (2012)
Miao, Y.-R., Liu, W., Zhang, Q., Guo, A.-Y.: lncRNASNP2: an updated database of functional SNPs and mutations in human and mouse lncRNAs. Nucleic Acids Res. 46, D276–D280 (2018)
Chou, C.-H., et al.: miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions. Nucleic Acids Res. 46, D296–D302 (2018)
Huang, Z., et al.: HMDD v3.0: a database for experimentally supported human microRNA–disease associations. Nucleic acids research 47, 1013–1017 (2019)
Szklarczyk, D., et al.: The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible. Nucleic Acids Res. gkw937 (2016)
Grover, A., Leskovec, J.: node2vec: Scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855–864 (2016)
Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710 (2014)
Kurant, M., Markopoulou, A., Thiran, P.: On the bias of BFS (breadth first search). In: 2010 22nd International Teletraffic Congress (lTC 22), pp. 1–8. IEEE (2010)
Cui, L., et al.: A novel artificial bee colony algorithm with depth-first search framework and elite-guided search equation. Inf. Sci. 367, 1012–1044 (2016)
Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2, 18–22 (2002)
Wong, L., You, Z.-H., Guo, Z.-H., Yi, H.-C., Chen, Z.-H., Cao, M.-Y.: MIPDH: a novel computational model for predicting microRNA–mRNA interactions by DeepWalk on a heterogeneous network. ACS Omega 5, 17022–17032 (2020)
Chen, Z.-H., Li, L.-P., He, Z., Zhou, J.-R., Li, Y., Wong, L.: An improved deep forest model for predicting self-interacting proteins from protein sequence using wavelet transformation. Front. Genet. 10, 90 (2019)
Wang, Y., You, Z., Li, L., Chen, Z.: A survey of current trends in computational predictions of protein-protein interactions. Front. Comput. Sci. 14, 144901 (2020)
Acknowledgments
This work is supported by the Xinjiang Natural Science Foundation under Grant 2017D01A78. This work is also supported in part by the National Natural Science Foundation of China, under Grants 61902342.
Author information
Authors and Affiliations
Contributions
ZH Chen and ZH You conceived the algorithm, carried out analyses, prepared the data sets, carried out experiments, and wrote the manuscript; ZH Guo and HC Yi designed, performed; GX Luo and YB W analyzed experiments and checked the manuscript; All authors read and approved the final manuscript.
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
The authors declare no conflict of interest.
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, ZH., You, ZH., Guo, ZH., Yi, HC., Luo, GX., Wang, YB. (2020). Predicting Drug-Target Interactions by Node2vec Node Embedding in Molecular Associations Network. In: Huang, DS., Jo, KH. (eds) Intelligent Computing Theories and Application. ICIC 2020. Lecture Notes in Computer Science(), vol 12464. Springer, Cham. https://doi.org/10.1007/978-3-030-60802-6_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-60802-6_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60801-9
Online ISBN: 978-3-030-60802-6
eBook Packages: Computer ScienceComputer Science (R0)