{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,10]],"date-time":"2024-09-10T12:36:09Z","timestamp":1725971769364},"reference-count":62,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,2,23]],"date-time":"2023-02-23T00:00:00Z","timestamp":1677110400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,2,23]],"date-time":"2023-02-23T00:00:00Z","timestamp":1677110400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100007219","name":"Natural Science Foundation of Shanghai","doi-asserted-by":"publisher","award":["19ZR1435900"],"id":[{"id":"10.13039\/100007219","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"Abstract<\/jats:title>Existing molecular property prediction methods based on deep learning ignore the generalization ability of the nonlinear representation of molecular features and the reasonable assignment of weights of molecular features, making it difficult to further improve the accuracy of molecular property prediction. To solve the above problems, an end-to-end double-head transformer neural network (DHTNN) is proposed in this paper for high-precision molecular property prediction. For the data distribution characteristics of the molecular dataset, DHTNN specially designs a new activation function, beaf, which can greatly improve the generalization ability of the nonlinear representation of molecular features. A residual network is introduced in the molecular encoding part to solve the gradient explosion problem and ensure that the model can converge quickly. The transformer based on double-head attention is used to extract molecular intrinsic detail features, and the weights are reasonably assigned for predicting molecular properties with high accuracy. Our model, which was tested on the MoleculeNet [1] benchmark dataset, showed significant performance improvements over other state-of-the-art methods.<\/jats:p>","DOI":"10.1186\/s13321-023-00700-4","type":"journal-article","created":{"date-parts":[[2023,2,23]],"date-time":"2023-02-23T14:04:49Z","timestamp":1677161089000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Double-head transformer neural network for molecular property prediction"],"prefix":"10.1186","volume":"15","author":[{"given":"Yuanbing","family":"Song","sequence":"first","affiliation":[]},{"given":"Jinghua","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Wenju","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Gang","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Zhichong","family":"Ma","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,2,23]]},"reference":[{"issue":"2","key":"700_CR1","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1039\/C7SC02664A","volume":"9","author":"Z Wu","year":"2018","unstructured":"Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS, Leswing K, Pande V (2018) Moleculenet: a benchmark for molecular machine learning. Chem Sci 9(2):513\u2013530","journal-title":"Chem Sci"},{"key":"700_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/2021\/7181815","volume":"2021","author":"J Li","year":"2021","unstructured":"Li J, Jiang X (2021) Mol-bert: an effective molecular representation with bert for molecular property prediction. Wirel Commun Mob Comput 2021:1\u20137. https:\/\/doi.org\/10.1155\/2021\/7181815","journal-title":"Wirel Commun Mob Comput"},{"issue":"2","key":"700_CR3","doi-asserted-by":"publisher","first-page":"899","DOI":"10.1007\/s11030-020-10074-6","volume":"25","author":"CA Toussi","year":"2021","unstructured":"Toussi CA, Haddadnia J, Matta CF (2021) Drug design by machine-trained elastic networks: predicting ser\/thr-protein kinase inhibitors\u2019 activities. Mol Divers 25(2):899\u2013909","journal-title":"Mol Divers"},{"issue":"1","key":"700_CR4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s43246-021-00194-3","volume":"2","author":"J Cheng","year":"2021","unstructured":"Cheng J, Zhang C, Dong L (2021) A geometric-information-enhanced crystal graph network for predicting properties of materials. Commun Mater 2(1):1\u201311","journal-title":"Commun Mater"},{"issue":"3","key":"700_CR5","doi-asserted-by":"publisher","first-page":"813","DOI":"10.1093\/bioinformatics\/btz645","volume":"36","author":"G Woo","year":"2020","unstructured":"Woo G, Fernandez M, Hsing M, Lack NA, Cavga AD, Cherkasov A (2020) Deepcop: deep learning-based approach to predict gene regulating effects of small molecules. Bioinformatics 36(3):813\u2013818","journal-title":"Bioinformatics"},{"key":"700_CR6","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-17281-1","volume-title":"A primer on QSAR\/QSPR modeling: fundamental concepts","author":"K Roy","year":"2015","unstructured":"Roy K, Kar S, Das RN (2015) A primer on QSAR\/QSPR modeling: fundamental concepts. Springer, New York"},{"issue":"4","key":"700_CR7","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1039\/cs9952400279","volume":"24","author":"AR Katritzky","year":"1995","unstructured":"Katritzky AR, Lobanov VS, Karelson M (1995) Qspr: the correlation and quantitative prediction of chemical and physical properties from structure. Chem Soc Rev 24(4):279\u2013287","journal-title":"Chem Soc Rev"},{"key":"700_CR8","doi-asserted-by":"crossref","unstructured":"Yee LC, Wei YC (2012) Current modeling methods used in QSAR\/QSPR. In: Statistical modelling of molecular descriptors in QSAR\/QSPR, vol 2, pp 1\u201331","DOI":"10.1002\/9783527645121.ch1"},{"issue":"4","key":"700_CR9","doi-asserted-by":"publisher","first-page":"285","DOI":"10.2174\/138920010791514306","volume":"11","author":"M Tareq Hassan Khan","year":"2010","unstructured":"Tareq Hassan Khan M (2010) Predictions of the admet properties of candidate drug molecules utilizing different qsar\/qspr modelling approaches. Curr Drug Metab 11(4):285\u2013295","journal-title":"Curr Drug Metab"},{"issue":"3","key":"700_CR10","doi-asserted-by":"crossref","first-page":"592","DOI":"10.1002\/jcc.21351","volume":"31","author":"D-S Cao","year":"2010","unstructured":"Cao D-S, Liang Y-Z, Xu Q-S, Li H-D, Chen X (2010) A new strategy of outlier detection for qsar\/qspr. J Comput Chem 31(3):592\u2013602","journal-title":"J Comput Chem"},{"key":"700_CR11","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1016\/j.ddtec.2020.05.001","volume":"32","author":"J Shen","year":"2019","unstructured":"Shen J, Nicolaou CA (2019) Molecular property prediction: recent trends in the era of artificial intelligence. Drug Discov Today Technol 32:29\u201336","journal-title":"Drug Discov Today Technol"},{"issue":"2","key":"700_CR12","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1021\/acs.accounts.0c00699","volume":"54","author":"WP Walters","year":"2020","unstructured":"Walters WP, Barzilay R (2020) Applications of deep learning in molecule generation and molecular property prediction. Acc Chem Res 54(2):263\u2013270","journal-title":"Acc Chem Res"},{"issue":"10","key":"700_CR13","doi-asserted-by":"publisher","first-page":"2520","DOI":"10.3390\/molecules23102520","volume":"23","author":"G Hessler","year":"2018","unstructured":"Hessler G, Baringhaus K-H (2018) Artificial intelligence in drug design. Molecules 23(10):2520","journal-title":"Molecules"},{"issue":"20","key":"700_CR14","doi-asserted-by":"publisher","first-page":"2233","DOI":"10.1002\/cphc.202000518","volume":"21","author":"J Gasteiger","year":"2020","unstructured":"Gasteiger J (2020) Chemistry in times of artificial intelligence. ChemPhysChem 21(20):2233\u20132242","journal-title":"ChemPhysChem"},{"issue":"11","key":"700_CR15","doi-asserted-by":"publisher","first-page":"5255","DOI":"10.1021\/acs.jctc.7b00577","volume":"13","author":"FA Faber","year":"2017","unstructured":"Faber FA, Hutchison L, Huang B, Gilmer J, Schoenholz SS, Dahl GE, Vinyals O, Kearnes S, Riley PF, Von Lilienfeld OA (2017) Prediction errors of molecular machine learning models lower than hybrid dft error. J Chem Theory Comput 13(11):5255\u20135264","journal-title":"J Chem Theory Comput"},{"issue":"2","key":"700_CR16","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","volume":"67","author":"H Zou","year":"2005","unstructured":"Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J Royal Stat Soc Ser B (Stat Methodol) 67(2):301\u2013320","journal-title":"J Royal Stat Soc Ser B (Stat Methodol)"},{"issue":"8","key":"700_CR17","doi-asserted-by":"publisher","first-page":"595","DOI":"10.1007\/s10822-016-9938-8","volume":"30","author":"S Kearnes","year":"2016","unstructured":"Kearnes S, McCloskey K, Berndl M, Pande V, Riley P (2016) Molecular graph convolutions: moving beyond fingerprints. J Comput Aided Mol Des 30(8):595\u2013608","journal-title":"J Comput Aided Mol Des"},{"issue":"6","key":"700_CR18","doi-asserted-by":"publisher","first-page":"1204","DOI":"10.1016\/j.chempr.2020.05.002","volume":"6","author":"L Pattanaik","year":"2020","unstructured":"Pattanaik L, Coley CW (2020) Molecular representation: going long on fingerprints. Chem 6(6):1204\u20131207","journal-title":"Chem"},{"issue":"8","key":"700_CR19","doi-asserted-by":"publisher","first-page":"3370","DOI":"10.1021\/acs.jcim.9b00237","volume":"59","author":"K Yang","year":"2019","unstructured":"Yang K, Swanson K, Jin W, Coley C, Eiden P, Gao H, Guzman-Perez A, Hopper T, Kelley B, Mathea M et al (2019) Analyzing learned molecular representations for property prediction. J Chem Inf Model 59(8):3370\u20133388","journal-title":"J Chem Inf Model"},{"issue":"1","key":"700_CR20","doi-asserted-by":"publisher","first-page":"216","DOI":"10.1021\/acs.jctc.7b01157","volume":"14","author":"JL McDonagh","year":"2017","unstructured":"McDonagh JL, Silva AF, Vincent MA, Popelier PL (2017) Machine learning of dynamic electron correlation energies from topological atoms. J Chem Theory Comput 14(1):216\u2013224","journal-title":"J Chem Theory Comput"},{"issue":"2\u20133","key":"700_CR21","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1016\/j.tox.2005.08.019","volume":"217","author":"C Zhao","year":"2006","unstructured":"Zhao C, Zhang H, Zhang X, Liu M, Hu Z, Fan B (2006) Application of support vector machine (svm) for prediction toxic activity of different data sets. Toxicology 217(2\u20133):105\u2013119","journal-title":"Toxicology"},{"key":"700_CR22","doi-asserted-by":"publisher","DOI":"10.1142\/5589","volume-title":"Support vector machine in chemistry","author":"N Chen","year":"2004","unstructured":"Chen N (2004) Support vector machine in chemistry. World Scientific, Singapore"},{"issue":"1","key":"700_CR23","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1517\/17460441.2014.866943","volume":"9","author":"K Heikamp","year":"2014","unstructured":"Heikamp K, Bajorath J (2014) Support vector machines for drug discovery. Expert Opin Drug Discov 9(1):93\u2013104","journal-title":"Expert Opin Drug Discov"},{"issue":"3","key":"700_CR24","doi-asserted-by":"publisher","first-page":"3167","DOI":"10.1021\/acsanm.1c00384","volume":"4","author":"B Zheng","year":"2021","unstructured":"Zheng B, Gu GX (2021) Prediction of graphene oxide functionalization using gradient boosting: implications for material chemical composition identification. ACS Appl Nano Mater 4(3):3167\u20133174","journal-title":"ACS Appl Nano Mater"},{"key":"700_CR25","doi-asserted-by":"publisher","first-page":"104554","DOI":"10.1016\/j.chemolab.2022.104554","volume":"224","author":"J Krmar","year":"2022","unstructured":"Krmar J, D\u017eigal M, Stojkovi\u0107 J, Proti\u0107 A, Ota\u0161evi\u0107 B (2022) Gradient boosted tree model: a fast track tool for predicting the atmospheric pressure chemical ionization-mass spectrometry signal of antipsychotics based on molecular features and experimental settings. Chemom Intell Lab Syst 224:104554","journal-title":"Chemom Intell Lab Syst"},{"issue":"6","key":"700_CR26","doi-asserted-by":"publisher","first-page":"2697","DOI":"10.1021\/acs.jcim.0c01489","volume":"61","author":"D Deng","year":"2021","unstructured":"Deng D, Chen X, Zhang R, Lei Z, Wang X, Zhou F (2021) Xgraphboost: extracting graph neural network-based features for a better prediction of molecular properties. J Chem Inform Model 61(6):2697\u20132705","journal-title":"J Chem Inform Model"},{"key":"700_CR27","doi-asserted-by":"publisher","DOI":"10.1155\/2022\/4987639","author":"J Wu","year":"2022","unstructured":"Wu J, Kong L, Yi M, Chen Q, Cheng Z, Zuo H, Yang Y (2022) Prediction and screening model for products based on fusion regression and xgboost classification. Comput Intell Neurosci. https:\/\/doi.org\/10.1155\/2022\/4987639","journal-title":"Comput Intell Neurosci"},{"key":"700_CR28","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2204.07532","author":"H Tian","year":"2022","unstructured":"Tian H, Ketkar R, Tao P (2022) Accurate admet prediction with xgboost. arXiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.2204.07532","journal-title":"arXiv Preprint"},{"issue":"11\u201312","key":"700_CR29","doi-asserted-by":"publisher","first-page":"1900038","DOI":"10.1002\/minf.201900038","volume":"38","author":"A Paul","year":"2019","unstructured":"Paul A, Furmanchuk A, Liao W-K, Choudhary A, Agrawal A (2019) Property prediction of organic donor molecules for photovoltaic applications using extremely randomized trees. Mol Inform 38(11\u201312):1900038","journal-title":"Mol Inform"},{"issue":"1","key":"700_CR30","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1016\/S0169-7439(97)00061-0","volume":"39","author":"D Svozil","year":"1997","unstructured":"Svozil D, Kvasnicka V, Pospichal J (1997) Introduction to multi-layer feed-forward neural networks. Chemom Intell Lab Syst 39(1):43\u201362","journal-title":"Chemom Intell Lab Syst"},{"issue":"7","key":"700_CR31","doi-asserted-by":"publisher","first-page":"1563","DOI":"10.1021\/ci400187y","volume":"53","author":"A Lusci","year":"2013","unstructured":"Lusci A, Pollastri G, Baldi P (2013) Deep architectures and deep learning in chemoinformatics: the prediction of aqueous solubility for drug-like molecules. J Chem Inform Model 53(7):1563\u20131575","journal-title":"J Chem Inform Model"},{"issue":"1","key":"700_CR32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/ncomms13890","volume":"8","author":"KT Sch\u00fctt","year":"2017","unstructured":"Sch\u00fctt KT, Arbabzadah F, Chmiela S, M\u00fcller KR, Tkatchenko A (2017) Quantum-chemical insights from deep tensor neural networks. Nat Commun 8(1):1\u20138","journal-title":"Nat Commun"},{"key":"700_CR33","unstructured":"Gilmer J, Schoenholz SS, Riley PF, Vinyals O, Dahl GE (2017) Neural message passing for quantum chemistry. In: international conference on machine learning. PMLR, 1263\u20131272"},{"key":"700_CR34","unstructured":"Ramsundar B (2018) Molecular machine learning with deepchem. PhD thesis, Stanford University"},{"issue":"1","key":"700_CR35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-019-0407-y","volume":"12","author":"M Withnall","year":"2020","unstructured":"Withnall M, Lindel\u00f6f E, Engkvist O, Chen H (2020) Building attention and edge message passing neural networks for bioactivity and physical-chemical property prediction. J Cheminform 12(1):1\u201318","journal-title":"J Cheminform"},{"key":"700_CR36","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2002.08264","author":"\u0141 Maziarka","year":"2020","unstructured":"Maziarka \u0141, Danel T, Mucha S, Rataj K, Tabor J, Jastrz\u0119bski S (2020) Molecule attention transformer. arXiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.2002.08264","journal-title":"arXiv Preprint"},{"issue":"9","key":"700_CR37","doi-asserted-by":"publisher","first-page":"3817","DOI":"10.1021\/acs.jcim.9b00410","volume":"59","author":"X Wang","year":"2019","unstructured":"Wang X, Li Z, Jiang M, Wang S, Zhang S, Wei Z (2019) Molecule property prediction based on spatial graph embedding. J Chem Inform Model 59(9):3817\u20133828","journal-title":"J Chem Inform Model"},{"issue":"1","key":"700_CR38","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-020-20314-w","volume":"12","author":"D Chen","year":"2021","unstructured":"Chen D, Gao K, Nguyen DD, Chen X, Jiang Y, Wei G-W, Pan F (2021) Algebraic graph-assisted bidirectional transformers for molecular property prediction. Nat Commun 12(1):1\u20139","journal-title":"Nat Commun"},{"issue":"17","key":"700_CR39","doi-asserted-by":"publisher","first-page":"1604","DOI":"10.1002\/cmdc.201900458","volume":"14","author":"H Cho","year":"2019","unstructured":"Cho H, Choi IS (2019) Enhanced deep-learning prediction of molecular properties via augmentation of bond topology. ChemMedChem 14(17):1604\u20131609","journal-title":"ChemMedChem"},{"key":"700_CR40","doi-asserted-by":"publisher","unstructured":"Sun F-Y, Hoffmann J, Verma V, Tang J (2019) Infograph: Unsupervised and semi-supervised graph-level representation learning via mutual information maximization. Arxiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.1908.01000","DOI":"10.48550\/arXiv.1908.01000"},{"key":"700_CR41","doi-asserted-by":"crossref","unstructured":"Meng M, Wei Z, Li Z, Jiang M, Bian Y (2019) Property prediction of molecules in graph convolutional neural network expansion. In: 2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS). IEEE, 263\u2013266","DOI":"10.1109\/ICSESS47205.2019.9040723"},{"key":"700_CR42","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1905.12265","author":"W Hu","year":"2019","unstructured":"Hu W, Liu B, Gomes J, Zitnik M, Liang P, Pande V, Leskovec J (2019) Strategies for pre-training graph neural networks. arXiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.1905.12265","journal-title":"arXiv Preprint"},{"key":"700_CR43","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1901.01484","author":"R Liao","year":"2019","unstructured":"Liao R, Zhao Z, Urtasun R, Zemel RS (2019) Lanczosnet: multi-scale deep graph convolutional networks. arXiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.1901.01484","journal-title":"arXiv Preprint"},{"key":"700_CR44","first-page":"10383","volume":"33","author":"Z Chen","year":"2020","unstructured":"Chen Z, Chen L, Villar S, Bruna J (2020) Can graph neural networks count substructures? Adv Neural Inform Process Syst 33:10383\u201310395","journal-title":"Adv Neural Inform Process Syst"},{"key":"700_CR45","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2005.13607","author":"H Ma","year":"2020","unstructured":"Ma H, Bian Y, Rong Y, Huang W, Xu T, Xie W, Ye G, Huang J (2020) Multi-view graph neural networks for molecular property prediction. arXiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.2005.13607","journal-title":"arXiv Preprint"},{"key":"700_CR46","doi-asserted-by":"publisher","unstructured":"Chen B, B\u00e9cigneul G, Ganea O-E, Barzilay R, Jaakkola T (2020) Optimal transport graph neural networks. Arxiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.2006.04804","DOI":"10.48550\/arXiv.2006.04804"},{"issue":"1","key":"700_CR47","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-020-0414-z","volume":"12","author":"B Tang","year":"2020","unstructured":"Tang B, Kramer ST, Fang M, Qiu Y, Wu Z, Xu D (2020) A self-attention based message passing neural network for predicting molecular lipophilicity and aqueous solubility. J Cheminform 12(1):1\u20139","journal-title":"J Cheminform"},{"key":"700_CR48","doi-asserted-by":"publisher","first-page":"128817","DOI":"10.1016\/j.cej.2021.128817","volume":"414","author":"Y Li","year":"2021","unstructured":"Li Y, Li P, Yang X, Hsieh C-Y, Zhang S, Wang X, Lu R, Liu H, Yao X (2021) Introducing block design in graph neural networks for molecular properties prediction. Chem Eng J 414:128817","journal-title":"Chem Eng J"},{"key":"700_CR49","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2006.04804","author":"G Larsson","year":"2016","unstructured":"Larsson G, Maire M, Shakhnarovich G (2016) Fractalnet: ultra-deep neural networks without residuals. arXiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.2006.04804","journal-title":"arXiv Preprint"},{"issue":"D1","key":"700_CR50","doi-asserted-by":"publisher","first-page":"1100","DOI":"10.1093\/nar\/gkr777","volume":"40","author":"A Gaulton","year":"2012","unstructured":"Gaulton A, Bellis LJ, Bento AP, Chambers J, Davies M, Hersey A, Light Y, McGlinchey S, Michalovich D, Al-Lazikani B et al (2012) Chembl: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40(D1):1100\u20131107","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"700_CR51","doi-asserted-by":"publisher","first-page":"2977","DOI":"10.1021\/jm030580l","volume":"47","author":"R Wang","year":"2004","unstructured":"Wang R, Fang X, Lu Y, Wang S (2004) The pdbbind database: collection of binding affinities for protein- ligand complexes with known three-dimensional structures. J Med Chem 47(12):2977\u20132980","journal-title":"J Med Chem"},{"issue":"12","key":"700_CR52","doi-asserted-by":"publisher","first-page":"4111","DOI":"10.1021\/jm048957q","volume":"48","author":"R Wang","year":"2005","unstructured":"Wang R, Fang X, Lu Y, Yang C-Y, Wang S (2005) The pdbbind database: methodologies and updates. J Med Chem 48(12):4111\u20134119","journal-title":"J Med Chem"},{"issue":"3","key":"700_CR53","doi-asserted-by":"publisher","first-page":"405","DOI":"10.1093\/bioinformatics\/btu626","volume":"31","author":"Z Liu","year":"2015","unstructured":"Liu Z, Li Y, Han L, Li J, Liu J, Zhao Z, Nie W, Liu Y, Wang R (2015) Pdb-wide collection of binding data: current status of the pdbbind database. Bioinformatics 31(3):405\u2013412","journal-title":"Bioinformatics"},{"issue":"D1","key":"700_CR54","doi-asserted-by":"publisher","first-page":"400","DOI":"10.1093\/nar\/gkr1132","volume":"40","author":"Y Wang","year":"2012","unstructured":"Wang Y, Xiao J, Suzek TO, Zhang J, Wang J, Zhou Z, Han L, Karapetyan K, Dracheva S, Shoemaker BA et al (2012) Pubchem\u2019s bioassay database. Nucleic Acids Res 40(D1):400\u2013412","journal-title":"Nucleic Acids Res"},{"issue":"10","key":"700_CR55","doi-asserted-by":"publisher","first-page":"1936","DOI":"10.1021\/acs.jcim.6b00290","volume":"56","author":"G Subramanian","year":"2016","unstructured":"Subramanian G, Ramsundar B, Pande V, Denny RA (2016) Computational modeling of $$\\beta$$-secretase 1 (bace-1) inhibitors using ligand based approaches. J Chem Inform Model 56(10):1936\u20131949","journal-title":"J Chem Inform Model"},{"key":"700_CR56","doi-asserted-by":"publisher","first-page":"85","DOI":"10.3389\/fenvs.2015.00085","volume":"3","author":"R Huang","year":"2016","unstructured":"Huang R, Xia M, Nguyen D-T, Zhao T, Sakamuru S, Zhao J, Shahane SA, Rossoshek A, Simeonov A (2016) Tox21challenge to build predictive models of nuclear receptor and stress response pathways as mediated by exposure to environmental chemicals and drugs. Front Environ Sci 3:85","journal-title":"Front Environ Sci"},{"issue":"D1","key":"700_CR57","doi-asserted-by":"publisher","first-page":"1075","DOI":"10.1093\/nar\/gkv1075","volume":"44","author":"M Kuhn","year":"2016","unstructured":"Kuhn M, Letunic I, Jensen LJ, Bork P (2016) The sider database of drugs and side effects. Nucleic Acids Res 44(D1):1075\u20131079","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"700_CR58","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1021\/acscentsci.6b00367","volume":"3","author":"H Altae-Tran","year":"2017","unstructured":"Altae-Tran H, Ramsundar B, Pappu AS, Pande V (2017) Low data drug discovery with one-shot learning. ACS Cent Sci 3(4):283\u2013293","journal-title":"ACS Cent Sci"},{"key":"700_CR59","doi-asserted-by":"crossref","unstructured":"Davis J, Goadrich M (2006) The relationship between precision-recall and roc curves. In: proceedings of the 23rd international conference on machine learning, 233\u2013240","DOI":"10.1145\/1143844.1143874"},{"key":"700_CR60","unstructured":"Landrum G et al (2013) Rdkit: A software suite for cheminformatics, computational chemistry, and predictive modeling. Academic Press, Cambridge, Massachusetts, USA"},{"issue":"1","key":"700_CR61","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1021\/ci00057a005","volume":"28","author":"D Weininger","year":"1988","unstructured":"Weininger D (1988) Smiles, a chemical language and information system. 1. Introduction to methodology and encoding rules. J Chem Inform Comput Sci 28(1):31\u201336","journal-title":"J Chem Inform Comput Sci"},{"key":"700_CR62","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1602.06289","author":"S Jastrz\u0119bski","year":"2016","unstructured":"Jastrz\u0119bski S, Le\u015bniak D, Czarnecki WM (2016) Learning to SMILE(S). arXiv Preprint. https:\/\/doi.org\/10.48550\/arXiv.1602.06289","journal-title":"arXiv Preprint"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00700-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00700-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00700-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,7]],"date-time":"2023-12-07T15:59:22Z","timestamp":1701964762000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00700-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,23]]},"references-count":62,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["700"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00700-4","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,23]]},"assertion":[{"value":"5 May 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 February 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 February 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"27"}}