{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T16:32:32Z","timestamp":1740155552174,"version":"3.37.3"},"reference-count":79,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T00:00:00Z","timestamp":1658448000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T00:00:00Z","timestamp":1658448000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001407","name":"Department of Biotechnology , Ministry of Science and Technology","doi-asserted-by":"publisher","award":["GAP-0282 (HiCHiCoB Centre)"],"id":[{"id":"10.13039\/501100001407","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2022,12]]},"abstract":"Abstract<\/jats:title>Virtual screening (VS) aids in prioritizing unknown bio-interactions between compounds and protein targets for empirical drug discovery. In standard VS exercise, roughly 10% of top-ranked molecules exhibit activity when examined in biochemical assays, which accounts for many false positive hits, making it an arduous task. Attempts for conquering false-hit rates were developed through either ligand-based or structure-based VS separately; however, nonetheless performed remarkably well. Here, we present an advanced VS framework\u2014automated hit identification and optimization tool (A-HIOT)\u2014comprises chemical space-driven stacked ensemble for identification and protein space-driven deep learning architectures for optimization of an array of specific hits for fixed protein receptors. A-HIOT implements numerous open-source algorithms intending to integrate chemical and protein space leading to a high-quality prediction. The optimized hits are the selective molecules which we retrieve after extreme refinement implying chemical space and protein space modules of A-HIOT. Using CXC chemokine receptor 4, we demonstrated the superior performance of A-HIOT for hit molecule identification and optimization with tenfold cross-validation accuracies of 94.8% and 81.9%, respectively. In comparison with other machine learning algorithms, A-HIOT achieved higher accuracies of 96.2% for hit identification and 89.9% for hit optimization on independent benchmark datasets for CXCR4 and 86.8% for hit identification and 90.2% for hit optimization on independent test dataset for androgen receptor (AR), thus, shows its generalizability and robustness. In conclusion, advantageous features impeded in A-HIOT is making a reliable approach for bridging the long-standing gap between ligand-based and structure-based VS in finding the optimized hits for the desired receptor. The complete resource (framework) code is available at https:\/\/gitlab.com\/neeraj-24\/A-HIOT<\/jats:ext-link>.<\/jats:p>\n Graphical Abstract<\/jats:bold><\/jats:p>","DOI":"10.1186\/s13321-022-00630-7","type":"journal-article","created":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T11:05:03Z","timestamp":1658487903000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Machine intelligence-driven framework for optimized hit selection in virtual screening"],"prefix":"10.1186","volume":"14","author":[{"given":"Neeraj","family":"Kumar","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2175-9799","authenticated-orcid":false,"given":"Vishal","family":"Acharya","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,7,22]]},"reference":[{"key":"630_CR1","doi-asserted-by":"publisher","DOI":"10.3389\/fchem.2020.00343","author":"EHB Maia","year":"2020","unstructured":"Maia EHB, Assis LC, de Oliveira TA et al (2020) Structure-based virtual screening: from classical to artificial intelligence. Front Chem. https:\/\/doi.org\/10.3389\/fchem.2020.00343","journal-title":"Front Chem"},{"key":"630_CR2","doi-asserted-by":"publisher","first-page":"314","DOI":"10.1038\/534314a","volume":"534","author":"N Nosengo","year":"2016","unstructured":"Nosengo N (2016) Can you teach old drugs new tricks? Nature 534:314\u2013316. https:\/\/doi.org\/10.1038\/534314a","journal-title":"Nature"},{"key":"630_CR3","doi-asserted-by":"publisher","first-page":"1V","DOI":"10.1186\/s12859-020-3379-z","volume":"21","author":"K Pliakos","year":"2020","unstructured":"Pliakos K, Vens C (2020) Drug-target interaction prediction with tree-ensemble learning and output space reconstruction. BMC Bioinformatics 21:1V. https:\/\/doi.org\/10.1186\/s12859-020-3379-z","journal-title":"BMC Bioinformatics"},{"key":"630_CR4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-019-13807-w","volume":"11","author":"O M\u00e9ndez-Lucio","year":"2020","unstructured":"M\u00e9ndez-Lucio O, Baillif B, Clevert DA et al (2020) De novo generation of hit-like molecules from gene expression signatures using artificial intelligence. Nat Commun 11:1\u201310. https:\/\/doi.org\/10.1038\/s41467-019-13807-w","journal-title":"Nat Commun"},{"key":"630_CR5","doi-asserted-by":"publisher","first-page":"2705","DOI":"10.1021\/acs.jcim.5b00444","volume":"55","author":"H Iwata","year":"2015","unstructured":"Iwata H, Sawada R, Mizutani S et al (2015) Large-scale prediction of beneficial drug combinations using drug efficacy and target profiles. J Chem Inf Model 55:2705\u20132716. https:\/\/doi.org\/10.1021\/acs.jcim.5b00444","journal-title":"J Chem Inf Model"},{"key":"630_CR6","doi-asserted-by":"publisher","DOI":"10.1016\/J.ABB.2020.108730","volume":"698","author":"CN Cavasotto","year":"2021","unstructured":"Cavasotto CN, Di Filippo JI (2021) Artificial intelligence in the early stages of drug discovery. Arch Biochem Biophys 698:108730. https:\/\/doi.org\/10.1016\/J.ABB.2020.108730","journal-title":"Arch Biochem Biophys"},{"key":"630_CR7","doi-asserted-by":"publisher","first-page":"947","DOI":"10.1517\/17460440903190961","volume":"4","author":"SS Phatak","year":"2009","unstructured":"Phatak SS, Stephan CC, Cavasotto CN (2009) High-throughput and in silico screenings in drug discovery. Expert Opin Drug Discov 4:947\u2013959. https:\/\/doi.org\/10.1517\/17460440903190961","journal-title":"Expert Opin Drug Discov"},{"key":"630_CR8","doi-asserted-by":"publisher","first-page":"494","DOI":"10.1016\/j.cbpa.2007.08.033","volume":"11","author":"C McInnes","year":"2007","unstructured":"McInnes C (2007) Virtual screening strategies in drug discovery. Curr Opin Chem Biol 11:494\u2013502","journal-title":"Curr Opin Chem Biol"},{"key":"630_CR9","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1016\/S1359-6446(04)02939-3","volume":"9","author":"T Lengauer","year":"2004","unstructured":"Lengauer T, Lemmen C, Rarey M, Zimmermann M (2004) Novel technologies for virtual screening. Drug Discov Today 9:27\u201334. https:\/\/doi.org\/10.1016\/S1359-6446(04)02939-3","journal-title":"Drug Discov Today"},{"key":"630_CR10","doi-asserted-by":"publisher","first-page":"1375","DOI":"10.3390\/ijms20061375","volume":"20","author":"A Gimeno","year":"2019","unstructured":"Gimeno A, Ojeda-Montes MJ, Tom\u00e1s-Hern\u00e1ndez S et al (2019) The light and dark sides of virtual screening: what is there to know? Int J Mol Sci 20:1375. https:\/\/doi.org\/10.3390\/ijms20061375","journal-title":"Int J Mol Sci"},{"key":"630_CR11","doi-asserted-by":"publisher","first-page":"18477","DOI":"10.1073\/PNAS.2000585117\/SUPPL_FILE\/PNAS.2000585117.SAPP.PDF","volume":"117","author":"YO Adeshina","year":"2020","unstructured":"Adeshina YO, Deeds EJ, Karanicolas J (2020) Machine learning classification can reduce false positives in structure-based virtual screening. Proc Natl Acad Sci U S A 117:18477\u201318488. https:\/\/doi.org\/10.1073\/PNAS.2000585117\/SUPPL_FILE\/PNAS.2000585117.SAPP.PDF","journal-title":"Proc Natl Acad Sci U S A"},{"key":"630_CR12","doi-asserted-by":"publisher","first-page":"344","DOI":"10.2174\/138620709788167944","volume":"12","author":"X Ma","year":"2009","unstructured":"Ma X, Jia J, Zhu F et al (2009) Comparative analysis of machine learning methods in ligand-based virtual screening of large compound libraries. Comb Chem High Throughput Screen 12:344\u2013357. https:\/\/doi.org\/10.2174\/138620709788167944","journal-title":"Comb Chem High Throughput Screen"},{"key":"630_CR13","doi-asserted-by":"publisher","first-page":"1923","DOI":"10.2174\/1568026614666140929124445","volume":"14","author":"E Lionta","year":"2014","unstructured":"Lionta E, Spyrou G, Vassilatis DK, Cournia Z (2014) Structure-based virtual screening for drug discovery: principles, applications and recent advances. Curr Top Med Chem 14:1923. https:\/\/doi.org\/10.2174\/1568026614666140929124445","journal-title":"Curr Top Med Chem"},{"key":"630_CR14","doi-asserted-by":"publisher","first-page":"5152","DOI":"10.3390\/MOLECULES20035152","volume":"20","author":"Y Awuni","year":"2015","unstructured":"Awuni Y, Mu Y (2015) Reduction of false positives in structure-based virtual screening when receptor plasticity is considered. Molecules 20:5152. https:\/\/doi.org\/10.3390\/MOLECULES20035152","journal-title":"Molecules"},{"key":"630_CR15","doi-asserted-by":"publisher","first-page":"1878","DOI":"10.1093\/bib\/bby061","volume":"20","author":"AS Rifaioglu","year":"2019","unstructured":"Rifaioglu AS, Atas H, Martin MJ et al (2019) Recent applications of deep learning and machine intelligence on in silico drug discovery: methods, tools and databases. Brief Bioinform 20:1878\u20131912. https:\/\/doi.org\/10.1093\/bib\/bby061","journal-title":"Brief Bioinform"},{"key":"630_CR16","doi-asserted-by":"publisher","first-page":"1148","DOI":"10.1016\/J.DRUDIS.2019.02.013","volume":"24","author":"T Hoffmann","year":"2019","unstructured":"Hoffmann T, Gastreich M (2019) The next level in chemical space navigation: going far beyond enumerable compound libraries. Drug Discov Today 24:1148\u20131156. https:\/\/doi.org\/10.1016\/J.DRUDIS.2019.02.013","journal-title":"Drug Discov Today"},{"key":"630_CR17","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1039\/C0MD00020E","volume":"1","author":"JL Reymond","year":"2010","unstructured":"Reymond JL, Van Deursen R, Blum LC, Ruddigkeit L (2010) Chemical space as a source for new drugs. Medchemcomm 1:30\u201338. https:\/\/doi.org\/10.1039\/C0MD00020E","journal-title":"Medchemcomm"},{"key":"630_CR18","doi-asserted-by":"publisher","DOI":"10.1080\/17460441.2020.1730324","author":"M Vogt","year":"2020","unstructured":"Vogt M (2020) How do we optimize chemical space navigation? Expert Opin Drug Discov. https:\/\/doi.org\/10.1080\/17460441.2020.1730324","journal-title":"Expert Opin Drug Discov"},{"key":"630_CR19","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y Lecun","year":"2015","unstructured":"Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436\u2013444","journal-title":"Nature"},{"key":"630_CR20","doi-asserted-by":"publisher","unstructured":"Bahi M, Batouche M (2018) Deep learning for ligand-based virtual screening in drug discovery. In: Proceedings PAIS 2018 international conference on pattern analysis and intelligent systemshttps:\/\/doi.org\/10.1109\/PAIS.2018.8598488","DOI":"10.1109\/PAIS.2018.8598488"},{"key":"630_CR21","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1007\/bf00153759","volume":"6","author":"DW Aha","year":"1991","unstructured":"Aha DW, Kibler D, Albert MK (1991) Instance-based learning algorithms. Mach Learn 6:37\u201366. https:\/\/doi.org\/10.1007\/bf00153759","journal-title":"Mach Learn"},{"key":"630_CR22","doi-asserted-by":"publisher","first-page":"10520","DOI":"10.1021\/acs.chemrev.8b00728","volume":"119","author":"X Yang","year":"2019","unstructured":"Yang X, Wang Y, Byrne R et al (2019) Concepts of artificial intelligence for computer-assisted drug discovery. Chem Rev 119:10520\u201310594","journal-title":"Chem Rev"},{"key":"630_CR23","doi-asserted-by":"publisher","first-page":"573","DOI":"10.1038\/s42256-020-00236-4","volume":"2","author":"J Jim\u00e9nez-Luna","year":"2020","unstructured":"Jim\u00e9nez-Luna J, Grisoni F, Schneider G (2020) Drug discovery with explainable artificial intelligence. Nat Mach Intell 2:573\u2013584. https:\/\/doi.org\/10.1038\/s42256-020-00236-4","journal-title":"Nat Mach Intell"},{"key":"630_CR24","doi-asserted-by":"publisher","first-page":"1489","DOI":"10.1016\/j.drudis.2017.05.008","volume":"22","author":"A S\u00e1nchez-Rodr\u00edguez","year":"2017","unstructured":"S\u00e1nchez-Rodr\u00edguez A, P\u00e9rez-Castillo Y, Sch\u00fcrer SC et al (2017) From flamingo dance to (desirable) drug discovery: a nature-inspired approach. Drug Discov Today 22:1489\u20131502. https:\/\/doi.org\/10.1016\/j.drudis.2017.05.008","journal-title":"Drug Discov Today"},{"key":"630_CR25","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1109\/MCAS.2006.1688199","volume":"6","author":"R Polikar","year":"2006","unstructured":"Polikar R (2006) Ensemble based systems in decision making. IEEE Circuits Syst Mag 6:21\u201344. https:\/\/doi.org\/10.1109\/MCAS.2006.1688199","journal-title":"IEEE Circuits Syst Mag"},{"key":"630_CR26","doi-asserted-by":"publisher","first-page":"903","DOI":"10.1021\/CI0203702","volume":"42","author":"DK Agrafiotis","year":"2002","unstructured":"Agrafiotis DK, Cede\u00f1o W, Lobanov VS (2002) On the use of neural network ensembles in QSAR and QSPR. J Chem Inf Comput Sci 42:903\u2013911. https:\/\/doi.org\/10.1021\/CI0203702","journal-title":"J Chem Inf Comput Sci"},{"key":"630_CR27","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/S12859-019-3135-4\/FIGURES\/4","volume":"20","author":"S Kwon","year":"2019","unstructured":"Kwon S, Bae H, Jo J, Yoon S (2019) Comprehensive ensemble in QSAR prediction for drug discovery. BMC Bioinformatics 20:1\u201312. https:\/\/doi.org\/10.1186\/S12859-019-3135-4\/FIGURES\/4","journal-title":"BMC Bioinformatics"},{"issue":"7","key":"630_CR28","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-017-02114-3","volume":"71","author":"I Ponzoni","year":"2017","unstructured":"Ponzoni I, Sebasti\u00e1n-P\u00e9rez V, Requena-Triguero C et al (2017) Hybridizing feature selection and feature learning approaches in QSAR modeling for drug discovery. Sci Reports 71(7):1\u201319. https:\/\/doi.org\/10.1038\/s41598-017-02114-3","journal-title":"Sci Reports"},{"key":"630_CR29","doi-asserted-by":"publisher","DOI":"10.1038\/S41598-017-02365-0","author":"L Zhang","year":"2017","unstructured":"Zhang L, Ai H, Chen W et al (2017) CarcinoPred-EL: novel models for predicting the carcinogenicity of chemicals using molecular fingerprints and ensemble learning methods. Sci Rep. https:\/\/doi.org\/10.1038\/S41598-017-02365-0","journal-title":"Sci Rep"},{"key":"630_CR30","doi-asserted-by":"publisher","first-page":"5277","DOI":"10.3390\/MOLECULES25225277","volume":"25","author":"L Patel","year":"2020","unstructured":"Patel L, Shukla T, Huang X et al (2020) Machine learning methods in drug discovery. Mol 25:5277. https:\/\/doi.org\/10.3390\/MOLECULES25225277","journal-title":"Mol"},{"key":"630_CR31","doi-asserted-by":"publisher","DOI":"10.1063\/5.0057487","volume":"2365","author":"R Davronov","year":"2021","unstructured":"Davronov R, Adilova F (2021) A comparative analysis of the ensemble methods for drug design. AIP Conf Proc 2365:030001. https:\/\/doi.org\/10.1063\/5.0057487","journal-title":"AIP Conf Proc"},{"key":"630_CR32","doi-asserted-by":"publisher","first-page":"58","DOI":"10.1016\/j.ymeth.2014.08.005","volume":"71","author":"A Cereto-Massagu\u00e9","year":"2015","unstructured":"Cereto-Massagu\u00e9 A, Ojeda MJ, Valls C et al (2015) Molecular fingerprint similarity search in virtual screening. Methods 71:58\u201363. https:\/\/doi.org\/10.1016\/j.ymeth.2014.08.005","journal-title":"Methods"},{"key":"630_CR33","doi-asserted-by":"publisher","first-page":"315","DOI":"10.3389\/fchem.2018.00315","volume":"6","author":"A Kumar","year":"2018","unstructured":"Kumar A, Zhang KYJ (2018) Advances in the development of shape similarity methods and their application in drug discovery. Front Chem 6:315","journal-title":"Front Chem"},{"key":"630_CR34","doi-asserted-by":"publisher","first-page":"8461","DOI":"10.1021\/jm101020z","volume":"53","author":"P Ripphausen","year":"2010","unstructured":"Ripphausen P, Nisius B, Peltason L, Bajorath J (2010) Quo vadis, virtual screening? A comprehensive survey of prospective applications. J Med Chem 53:8461\u20138467. https:\/\/doi.org\/10.1021\/jm101020z","journal-title":"J Med Chem"},{"key":"630_CR35","doi-asserted-by":"publisher","first-page":"4103","DOI":"10.1021\/acs.jmedchem.5b02008","volume":"59","author":"JJ Irwin","year":"2016","unstructured":"Irwin JJ, Shoichet BK (2016) Docking screens for novel ligands conferring new biology. J Med Chem 59:4103\u20134120","journal-title":"J Med Chem"},{"key":"630_CR36","doi-asserted-by":"publisher","first-page":"2356","DOI":"10.1021\/ci900212v","volume":"49","author":"NK Salam","year":"2009","unstructured":"Salam NK, Nuti R, Sherman W (2009) Novel method for generating structure-based pharmacophores using energetic analysis. J Chem Inf Model 49:2356\u20132368. https:\/\/doi.org\/10.1021\/ci900212v","journal-title":"J Chem Inf Model"},{"key":"630_CR37","unstructured":"Ramsundar B (2016) deepchem.io. https:\/\/github.com\/deepchem\/deepchem"},{"key":"630_CR38","doi-asserted-by":"publisher","first-page":"1955","DOI":"10.1021\/ACS.JCIM.9B01053","volume":"60","author":"AJ Minnich","year":"2020","unstructured":"Minnich AJ, McLoughlin K, Tse M et al (2020) AMPL: a data-driven modeling pipeline for drug discovery. J Chem Inf Model 60:1955\u20131968. https:\/\/doi.org\/10.1021\/ACS.JCIM.9B01053","journal-title":"J Chem Inf Model"},{"key":"630_CR39","doi-asserted-by":"publisher","first-page":"3845","DOI":"10.1021\/ACS.JCIM.1C00653","volume":"61","author":"G Amendola","year":"2021","unstructured":"Amendola G, Cosconati S (2021) PyRMD: a new fully automated ai-powered ligand-based virtual screening tool. J Chem Inf Model 61:3845. https:\/\/doi.org\/10.1021\/ACS.JCIM.1C00653","journal-title":"J Chem Inf Model"},{"key":"630_CR40","doi-asserted-by":"publisher","DOI":"10.3390\/IJMS22094435","author":"TB Kimber","year":"2021","unstructured":"Kimber TB, Chen Y, Volkamer A (2021) Deep learning in virtual screening: recent applications and developments. Int J Mol Sci. https:\/\/doi.org\/10.3390\/IJMS22094435","journal-title":"Int J Mol Sci"},{"key":"630_CR41","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1016\/J.JTBI.2012.11.005","volume":"318","author":"C Yu","year":"2013","unstructured":"Yu C, Deng M, Cheng SY et al (2013) Protein space: a natural method for realizing the nature of protein universe. J Theor Biol 318:197\u2013204. https:\/\/doi.org\/10.1016\/J.JTBI.2012.11.005","journal-title":"J Theor Biol"},{"key":"630_CR42","doi-asserted-by":"publisher","first-page":"2495","DOI":"10.1021\/ACS.JCIM.6B00355","volume":"56","author":"JC Pereira","year":"2016","unstructured":"Pereira JC, Caffarena ER, dos Santos CN (2016) Boosting docking-based virtual screening with deep learning. J Chem Inf Model 56:2495\u20132506. https:\/\/doi.org\/10.1021\/ACS.JCIM.6B00355","journal-title":"J Chem Inf Model"},{"key":"630_CR43","doi-asserted-by":"publisher","first-page":"939","DOI":"10.1021\/ACSCENTSCI.0C00229","volume":"6","author":"F Gentile","year":"2020","unstructured":"Gentile F, Agrawal V, Hsing M et al (2020) Deep docking: a deep learning platform for augmentation of structure based drug discovery. ACS Cent Sci 6:939\u2013949. https:\/\/doi.org\/10.1021\/ACSCENTSCI.0C00229","journal-title":"ACS Cent Sci"},{"key":"630_CR44","doi-asserted-by":"publisher","first-page":"3329","DOI":"10.1093\/BIOINFORMATICS\/BTZ111","volume":"35","author":"M Karimi","year":"2019","unstructured":"Karimi M, Wu D, Wang Z, Shen Y (2019) DeepAffinity: interpretable deep learning of compound\u2013protein affinity through unified recurrent and convolutional neural networks. Bioinformatics 35:3329\u20133338. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTZ111","journal-title":"Bioinformatics"},{"key":"630_CR45","doi-asserted-by":"publisher","DOI":"10.3390\/MOLECULES25204723","author":"J V\u00e1zquez","year":"2020","unstructured":"V\u00e1zquez J, L\u00f3pez M, Gibert E et al (2020) Merging ligand-based and structure-based methods in drug discovery: an overview of combined virtual screening approaches. Molecules. https:\/\/doi.org\/10.3390\/MOLECULES25204723","journal-title":"Molecules"},{"key":"630_CR46","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1023\/A:1022859003006","volume":"51","author":"LI Kuncheva","year":"2003","unstructured":"Kuncheva LI, Whitaker CJ (2003) Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn 51:181\u2013207. https:\/\/doi.org\/10.1023\/A:1022859003006","journal-title":"Mach Learn"},{"key":"630_CR47","doi-asserted-by":"publisher","first-page":"42826","DOI":"10.1074\/jbc.M106582200","volume":"276","author":"N Zhou","year":"2001","unstructured":"Zhou N, Luo Z, Luo J et al (2001) Structural and functional characterization of human CXCR4 as a chemokine receptor and HIV-1 co-receptor by mutagenesis and molecular modeling studies. J Biol Chem 276:42826\u201342833. https:\/\/doi.org\/10.1074\/jbc.M106582200","journal-title":"J Biol Chem"},{"key":"630_CR48","doi-asserted-by":"publisher","first-page":"1391","DOI":"10.1084\/jem.20030267","volume":"198","author":"T Schioppa","year":"2003","unstructured":"Schioppa T, Uranchimeg B, Saccani A et al (2003) Regulation of the chemokine receptor CXCR4 by hypoxia. J Exp Med 198:1391\u20131402. https:\/\/doi.org\/10.1084\/jem.20030267","journal-title":"J Exp Med"},{"key":"630_CR49","doi-asserted-by":"publisher","first-page":"1664","DOI":"10.3389\/fimmu.2020.01664","volume":"11","author":"ME Bianchi","year":"2020","unstructured":"Bianchi ME, Mezzapelle R (2020) The chemokine receptor CXCR4 in cell proliferation and tissue regeneration. Front Immunol 11:1664\u20132322","journal-title":"Front Immunol"},{"key":"630_CR50","doi-asserted-by":"publisher","first-page":"185","DOI":"10.3390\/cells8020185","volume":"8","author":"N Kawaguchi","year":"2019","unstructured":"Kawaguchi N, Zhang T-T, Nakanishi T (2019) Involvement of CXCR4 in normal and abnormal development. Cells 8:185. https:\/\/doi.org\/10.3390\/cells8020185","journal-title":"Cells"},{"key":"630_CR51","doi-asserted-by":"publisher","first-page":"2212","DOI":"10.1016\/j.jbo.2019.100253","volume":"17","author":"TR Ullah","year":"2019","unstructured":"Ullah TR (2019) The role of CXCR4 in multiple myeloma: Cells\u2019 journey from bone marrow to beyond. J Bone Oncol 17:2212\u20131374. https:\/\/doi.org\/10.1016\/j.jbo.2019.100253","journal-title":"J Bone Oncol"},{"key":"630_CR52","doi-asserted-by":"publisher","DOI":"10.1038\/srep30155","author":"RK Mishra","year":"2016","unstructured":"Mishra RK, Shum AK, Platanias LC et al (2016) Discovery and characterization of novel small-molecule CXCR4 receptor agonists and antagonists. Sci Rep. https:\/\/doi.org\/10.1038\/srep30155","journal-title":"Sci Rep"},{"key":"630_CR53","doi-asserted-by":"publisher","first-page":"47","DOI":"10.7150\/thno.5376","volume":"3","author":"B Debnath","year":"2013","unstructured":"Debnath B, Xu S, Grande F et al (2013) Small molecule inhibitors of CXCR4. Theranostics 3:47\u201375. https:\/\/doi.org\/10.7150\/thno.5376","journal-title":"Theranostics"},{"key":"630_CR54","doi-asserted-by":"publisher","first-page":"1895","DOI":"10.1128\/AAC.04654-14","volume":"59","author":"D Das","year":"2015","unstructured":"Das D, Maeda K, Hayashi Y et al (2015) Insights into the mechanism of inhibition of CXCR4: identification of piperidinylethanamine analogs as anti-HIV-1 inhibitors. Antimicrob Agents Chemother 59:1895\u20131904. https:\/\/doi.org\/10.1128\/AAC.04654-14","journal-title":"Antimicrob Agents Chemother"},{"key":"630_CR55","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1038\/nrd.2018.168","volume":"18","author":"S Pushpakom","year":"2018","unstructured":"Pushpakom S, Iorio F, Eyers PA et al (2018) Drug repurposing: progress, challenges and recommendations. Nat Rev Drug Discov 18:41\u201358. https:\/\/doi.org\/10.1038\/nrd.2018.168","journal-title":"Nat Rev Drug Discov"},{"key":"630_CR56","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1758-2946-3-33","volume":"3","author":"NM O\u2019Boyle","year":"2011","unstructured":"O\u2019Boyle NM, Banck M, James CA et al (2011) Open babel: an open chemical toolbox. J Cheminform 3:1\u201314. https:\/\/doi.org\/10.1186\/1758-2946-3-33","journal-title":"J Cheminform"},{"key":"630_CR57","doi-asserted-by":"publisher","first-page":"587","DOI":"10.1002\/(SICI)1096-987X(199604)17:5\/6<587::AID-JCC4>3.0.CO;2-Q","volume":"17","author":"TA Halgren","year":"1996","unstructured":"Halgren TA, Nachbar RB (1996) Merck molecular force field. IV. Conformational energies and geometries for MMFF94. J Comput Chem 17:587\u2013615. https:\/\/doi.org\/10.1002\/(SICI)1096-987X(199604)17:5\/6%3c587::AID-JCC4%3e3.0.CO;2-Q","journal-title":"J Comput Chem"},{"key":"630_CR58","doi-asserted-by":"publisher","first-page":"6582","DOI":"10.1021\/jm300687e","volume":"55","author":"MM Mysinger","year":"2012","unstructured":"Mysinger MM, Carchia M, Irwin JJ, Shoichet BK (2012) Directory of useful decoys, enhanced (DUD-E): Better ligands and decoys for better benchmarking. J Med Chem 55:6582\u20136594. https:\/\/doi.org\/10.1021\/jm300687e","journal-title":"J Med Chem"},{"key":"630_CR59","doi-asserted-by":"publisher","first-page":"1466","DOI":"10.1002\/jcc.21707","volume":"32","author":"CW Yap","year":"2011","unstructured":"Yap CW (2011) PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J Comput Chem 32:1466\u20131474. https:\/\/doi.org\/10.1002\/jcc.21707","journal-title":"J Comput Chem"},{"key":"630_CR60","doi-asserted-by":"publisher","first-page":"314","DOI":"10.1007\/978-1-4899-7687-1_192","volume-title":"Encyclopedia of machine learning and data mining","author":"E Keogh","year":"2017","unstructured":"Keogh E, Mueen A (2017) Curse of dimensionality. Encyclopedia of machine learning and data mining. Springer, US, pp 314\u2013315"},{"key":"630_CR61","unstructured":"Liaw A, news MW-R, 2002 undefined Classification and regression by randomForest. researchgate.net"},{"key":"630_CR62","doi-asserted-by":"publisher","unstructured":"Chen T, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, NY, USA, pp 785\u2013794. https:\/\/doi.org\/10.1145\/2939672.2939785","DOI":"10.1145\/2939672.2939785"},{"key":"630_CR63","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/NAR\/28.1.235","volume":"28","author":"HM Berman","year":"2000","unstructured":"Berman HM, Westbrook J, Feng Z et al (2000) The protein data bank. Nucleic Acids Res 28:235\u2013242. https:\/\/doi.org\/10.1093\/NAR\/28.1.235","journal-title":"Nucleic Acids Res"},{"key":"630_CR64","doi-asserted-by":"publisher","first-page":"2326","DOI":"10.2174\/1381612811319120019","volume":"19","author":"Y Yuan","year":"2013","unstructured":"Yuan Y, Pei J, Lai L (2013) Binding site detection and druggability prediction of protein targets for structure- based drug design. Curr Pharm Des 19:2326\u20132333. https:\/\/doi.org\/10.2174\/1381612811319120019","journal-title":"Curr Pharm Des"},{"key":"630_CR65","doi-asserted-by":"publisher","first-page":"2684","DOI":"10.1021\/ci600246s","volume":"46","author":"J Chen","year":"2006","unstructured":"Chen J, Lai L (2006) Pocket vol 2: further developments on receptor-based pharmacophore modeling. J Chem Inf Model 46:2684\u20132691. https:\/\/doi.org\/10.1021\/ci600246s","journal-title":"J Chem Inf Model"},{"key":"630_CR66","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1002\/jcc.21334","volume":"31","author":"O Trott","year":"2010","unstructured":"Trott O, Olson AJ (2010) Software news and update AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem 31:455\u2013461. https:\/\/doi.org\/10.1002\/jcc.21334","journal-title":"J Comput Chem"},{"key":"630_CR67","doi-asserted-by":"publisher","DOI":"10.1002\/jcc.21256","author":"GM Morris","year":"2009","unstructured":"Morris GM, Ruth H, Lindstrom W et al (2009) Software news and updates AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J Comput Chem. https:\/\/doi.org\/10.1002\/jcc.21256","journal-title":"J Comput Chem"},{"key":"630_CR68","doi-asserted-by":"publisher","first-page":"2518","DOI":"10.1093\/bioinformatics\/btn479","volume":"24","author":"J Klekota","year":"2008","unstructured":"Klekota J, Roth FP (2008) Chemical substructures that enrich for biological activity. Bioinformatics 24:2518\u20132525. https:\/\/doi.org\/10.1093\/bioinformatics\/btn479","journal-title":"Bioinformatics"},{"key":"630_CR69","doi-asserted-by":"publisher","first-page":"W443","DOI":"10.1093\/nar\/gkv315","volume":"43","author":"S Salentin","year":"2015","unstructured":"Salentin S, Schreiber S, Haupt VJ et al (2015) PLIP: fully automated protein-ligand interaction profiler. Nucleic Acids Res 43:W443\u2013W447. https:\/\/doi.org\/10.1093\/nar\/gkv315","journal-title":"Nucleic Acids Res"},{"key":"630_CR70","doi-asserted-by":"publisher","first-page":"3117","DOI":"10.1021\/JM500132P","volume":"57","author":"N Lagarde","year":"2014","unstructured":"Lagarde N, Ben Nasr N, J\u00e9r\u00e9mie A et al (2014) NRLiSt BDB, the manually curated nuclear receptors ligands and structures benchmarking database. J Med Chem 57:3117\u20133125. https:\/\/doi.org\/10.1021\/JM500132P","journal-title":"J Med Chem"},{"key":"630_CR71","doi-asserted-by":"publisher","first-page":"1538","DOI":"10.1016\/j.drudis.2018.05.010","volume":"23","author":"YC Lo","year":"2018","unstructured":"Lo YC, Rensi SE, Torng W, Altman RB (2018) Machine learning in chemoinformatics and drug discovery. Drug Discov Today 23:1538\u20131546. https:\/\/doi.org\/10.1016\/j.drudis.2018.05.010","journal-title":"Drug Discov Today"},{"key":"630_CR72","volume-title":"Molecular descriptors for chemoinformatics","author":"R Todeschini","year":"2010","unstructured":"Todeschini R, Consonni V (2010) Molecular descriptors for chemoinformatics. Wiley Blackwell"},{"key":"630_CR73","doi-asserted-by":"publisher","first-page":"241","DOI":"10.1016\/S0893-6080(05)80023-1","volume":"5","author":"DH Wolpert","year":"1992","unstructured":"Wolpert DH (1992) Stacked generalization. Neural Netw 5:241\u2013259. https:\/\/doi.org\/10.1016\/S0893-6080(05)80023-1","journal-title":"Neural Netw"},{"key":"630_CR74","doi-asserted-by":"publisher","first-page":"4735","DOI":"10.1021\/ACS.JMEDCHEM.6B01309","volume":"60","author":"M Arimont","year":"2017","unstructured":"Arimont M, Sun S-L, Leurs R et al (2017) Structural analysis of chemokine receptor-ligand interactions. J Med Chem 60:4735. https:\/\/doi.org\/10.1021\/ACS.JMEDCHEM.6B01309","journal-title":"J Med Chem"},{"key":"630_CR75","doi-asserted-by":"publisher","first-page":"1066","DOI":"10.1126\/science.1194396","volume":"330","author":"B Wu","year":"2010","unstructured":"Wu B, Chien EYT, Mol CD et al (2010) Structures of the CXCR4 chemokine GPCR with small-molecule and cyclic peptide antagonists. Science 330:1066\u20131071. https:\/\/doi.org\/10.1126\/science.1194396","journal-title":"Science"},{"key":"630_CR76","doi-asserted-by":"publisher","first-page":"1117","DOI":"10.1126\/science.1261064","volume":"347","author":"L Qin","year":"2015","unstructured":"Qin L, Kufareva I, Holden LG et al (2015) Crystal structure of the chemokine receptor CXCR4 in complex with a viral chemokine. Science (80-) 347:1117\u20131122. https:\/\/doi.org\/10.1126\/science.1261064","journal-title":"Science (80-)"},{"key":"630_CR77","doi-asserted-by":"publisher","first-page":"765","DOI":"10.1124\/mol.119.117168","volume":"96","author":"M Arimont","year":"2019","unstructured":"Arimont M, Hoffmann C, de Graaf C, Leurs R (2019) Chemokine receptor crystal structures: what can be learned from them? Mol Pharmacol 96:765\u2013777. https:\/\/doi.org\/10.1124\/mol.119.117168","journal-title":"Mol Pharmacol"},{"key":"630_CR78","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkh454","author":"A Marchler-Bauer","year":"2004","unstructured":"Marchler-Bauer A, Bryant SH (2004) CD-search: protein domain annotations on the fly. Nucleic Acids Res. https:\/\/doi.org\/10.1093\/nar\/gkh454","journal-title":"Nucleic Acids Res"},{"key":"630_CR79","doi-asserted-by":"publisher","first-page":"944","DOI":"10.1021\/ci500091r","volume":"54","author":"PJ Ballester","year":"2014","unstructured":"Ballester PJ, Schreyer A, Blundell TL (2014) Does a more precise chemical description of protein-ligand complexes lead to more accurate prediction of binding affinity? J Chem Inf Model 54:944\u2013955. https:\/\/doi.org\/10.1021\/ci500091r","journal-title":"J Chem Inf Model"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00630-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-022-00630-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00630-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T11:06:17Z","timestamp":1658487977000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-022-00630-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,22]]},"references-count":79,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["630"],"URL":"https:\/\/doi.org\/10.1186\/s13321-022-00630-7","relation":{},"ISSN":["1758-2946"],"issn-type":[{"type":"electronic","value":"1758-2946"}],"subject":[],"published":{"date-parts":[[2022,7,22]]},"assertion":[{"value":"19 November 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 July 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 July 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that there are no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"48"}}