{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T16:32:32Z","timestamp":1740155552801,"version":"3.37.3"},"reference-count":47,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,8,13]],"date-time":"2022-08-13T00:00:00Z","timestamp":1660348800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,8,13]],"date-time":"2022-08-13T00:00:00Z","timestamp":1660348800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002261","name":"Russian Foundation for Basic Research","doi-asserted-by":"publisher","award":["20-04-60285","20-04-60285","20-04-60285","20-04-60285","20-04-60285"],"id":[{"id":"10.13039\/501100002261","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"Abstract<\/jats:title>\n Motivation<\/jats:title>\n Application of chemical named entity recognition (CNER) algorithms allows retrieval of information from texts about chemical compound identifiers and creates associations with physical\u2013chemical properties and biological activities. Scientific texts represent low-formalized sources of information. Most methods aimed at CNER are based on machine learning approaches, including conditional random fields and deep neural networks. In general, most machine learning approaches require either vector or sparse word representation of texts. Chemical named entities (CNEs) constitute only a small fraction of the whole text, and the datasets used for training are highly imbalanced.<\/jats:p>\n <\/jats:sec>\n Methods and results<\/jats:title>\n We propose a new method for extracting CNEs from texts based on the na\u00efve Bayes classifier combined with specially developed filters. In contrast to the earlier developed CNER methods, our approach uses the representation of the data as a set of fragments of text (FoTs) with the subsequent preparati`on of a set of multi-n<\/jats:italic>-grams (sequences from one to\u00a0n<\/jats:italic>\u00a0symbols) for each FoT. Our approach may provide the recognition of novel CNEs. For CHEMDNER corpus, the values of the sensitivity (recall) was 0.95, precision was 0.74, specificity was 0.88, and balanced accuracy was 0.92 based on five-fold cross validation. We applied the developed algorithm to the extracted CNEs of potential Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) main protease (Mpro) inhibitors. A set of CNEs corresponding to the chemical substances evaluated in the biochemical assays used for the discovery of Mpro inhibitors was retrieved. Manual analysis of the appropriate texts showed that CNEs of potential SARS-CoV-2 Mpro inhibitors were successfully identified by our method.<\/jats:p>\n <\/jats:sec>\n Conclusion<\/jats:title>\n The obtained results show that the proposed method can be used for filtering out words that are not related to CNEs; therefore, it can be successfully applied to the extraction of CNEs for the purposes of cheminformatics and medicinal chemistry.<\/jats:p>\n <\/jats:sec>","DOI":"10.1186\/s13321-022-00633-4","type":"journal-article","created":{"date-parts":[[2022,8,13]],"date-time":"2022-08-13T06:02:45Z","timestamp":1660370565000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Chemical named entity recognition in the texts of scientific publications using the na\u00efve Bayes classifier approach"],"prefix":"10.1186","volume":"14","author":[{"given":"O. A.","family":"Tarasova","sequence":"first","affiliation":[]},{"given":"A. V.","family":"Rudik","sequence":"additional","affiliation":[]},{"given":"N. Yu.","family":"Biziukova","sequence":"additional","affiliation":[]},{"given":"D. A.","family":"Filimonov","sequence":"additional","affiliation":[]},{"given":"V. V.","family":"Poroikov","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,8,13]]},"reference":[{"key":"633_CR1","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1016\/j.tibtech.2010.01.004","volume":"28","author":"L Yao","year":"2010","unstructured":"Yao L, Evans JA, Rzhetsky A (2010) Novel opportunities for computational biology and sociology in drug discovery. Trends Biotechnol 28:161\u2013170. https:\/\/doi.org\/10.1016\/j.tibtech.2010.01.004","journal-title":"Trends Biotechnol"},{"key":"633_CR2","doi-asserted-by":"publisher","first-page":"212","DOI":"10.1021\/acs.jcim.7b00589","volume":"58","author":"SJ Capuzzi","year":"2018","unstructured":"Capuzzi SJ, Thornton TE, Liu K et al (2018) Chemotext: a publicly available web server for mining drug\u2013target\u2013disease relationships in PubMed. J Chem Inf Model 58:212\u2013218. https:\/\/doi.org\/10.1021\/acs.jcim.7b00589","journal-title":"J Chem Inf Model"},{"key":"633_CR3","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1002\/minf.201100023","volume":"30","author":"TI Oprea","year":"2011","unstructured":"Oprea TI, Nielsen SK, Ursu O et al (2011) Associating drugs, targets and clinical outcomes into an integrated network affords a new platform for computer-aided drug repurposing. Mol Inform 30:100\u2013111. https:\/\/doi.org\/10.1002\/minf.201100023","journal-title":"Mol Inform"},{"key":"633_CR4","doi-asserted-by":"publisher","DOI":"10.1289\/EHP3264","volume":"126","author":"G Piir","year":"2018","unstructured":"Piir G, Kahn I, Garc\u00eda-Sosa AT et al (2018) Best practices for QSAR model reporting: physical and chemical properties, ecotoxicity, environmental fate, human health, and toxicokinetics endpoints. Environ Health Perspect 126:126001. https:\/\/doi.org\/10.1289\/EHP3264","journal-title":"Environ Health Perspect"},{"key":"633_CR5","doi-asserted-by":"publisher","first-page":"3635","DOI":"10.1021\/acs.jcim.9b00164","volume":"59","author":"OA Tarasova","year":"2019","unstructured":"Tarasova OA, Biziukova NYu, Filimonov DA et al (2019) Data mining approach for extraction of useful information about biologically active compounds from publications. J Chem Inf Model 59:3635\u20133644. https:\/\/doi.org\/10.1021\/acs.jcim.9b00164","journal-title":"J Chem Inf Model"},{"key":"633_CR6","doi-asserted-by":"publisher","first-page":"332","DOI":"10.1002\/minf.201300144","volume":"33","author":"E Dura","year":"2014","unstructured":"Dura E, Muresan S, Engkvist O et al (2014) Mining molecular pharmacological effects from biomedical text: a case study for eliciting anti-obesity\/diabetes effects of chemical compounds. Mol Inform 33:332\u2013342. https:\/\/doi.org\/10.1002\/minf.201300144","journal-title":"Mol Inform"},{"key":"633_CR7","doi-asserted-by":"publisher","first-page":"7673","DOI":"10.1021\/acs.chemrev.6b00851","volume":"117","author":"M Krallinger","year":"2017","unstructured":"Krallinger M, Rabal O, Louren\u00e7o A et al (2017) Information retrieval and text mining technologies for chemistry. Chem Rev 117:7673\u20137761. https:\/\/doi.org\/10.1021\/acs.chemrev.6b00851","journal-title":"Chem Rev"},{"key":"633_CR8","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1186\/s13321-018-0313-8","volume":"10","author":"P Corbett","year":"2018","unstructured":"Corbett P, Boyle J (2018) Chemlistem: chemical named entity recognition using recurrent neural networks. J Cheminform 10:59. https:\/\/doi.org\/10.1186\/s13321-018-0313-8","journal-title":"J Cheminform"},{"key":"633_CR9","doi-asserted-by":"publisher","first-page":"S10","DOI":"10.1186\/1758-2946-7-S1-S10","volume":"7","author":"SA Akhondi","year":"2015","unstructured":"Akhondi SA, Hettne KM, van der Horst E et al (2015) Recognition of chemical entities: combining dictionary-based and grammar-based approaches. J Cheminform 7:S10. https:\/\/doi.org\/10.1186\/1758-2946-7-S1-S10","journal-title":"J Cheminform"},{"key":"633_CR10","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1186\/1758-2946-6-17","volume":"6","author":"S Eltyeb","year":"2014","unstructured":"Eltyeb S, Salim N (2014) Chemical named entities recognition: a review on approaches and applications. J Cheminform 6:17. https:\/\/doi.org\/10.1186\/1758-2946-6-17","journal-title":"J Cheminform"},{"key":"633_CR11","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1093\/database\/baw049","volume":"2016","author":"Y Zhang","year":"2016","unstructured":"Zhang Y, Xu J, Chen H et al (2016) Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning. Database (Oxford) 2016:49. https:\/\/doi.org\/10.1093\/database\/baw049","journal-title":"Database (Oxford)"},{"key":"633_CR12","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1186\/s12911-017-0468-7","volume":"17","author":"Z Liu","year":"2017","unstructured":"Liu Z, Yang M, Wang X et al (2017) Entity recognition from clinical texts via recurrent neural network. BMC Med Inform Decis Mak 17:67. https:\/\/doi.org\/10.1186\/s12911-017-0468-7","journal-title":"BMC Med Inform Decis Mak"},{"key":"633_CR13","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2020.103539","volume":"110","author":"C Col\u00f3n-Ruiz","year":"2020","unstructured":"Col\u00f3n-Ruiz C, Segura-Bedmar I (2020) Comparing deep learning architectures for sentiment analysis on drug reviews. J Biomed Inform 110:103539. https:\/\/doi.org\/10.1016\/j.jbi.2020.103539","journal-title":"J Biomed Inform"},{"key":"633_CR14","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1186\/s13321-018-0327-2","volume":"11","author":"W Hemati","year":"2019","unstructured":"Hemati W, Mehler A (2019) LSTMVoter: chemical named entity recognition using a conglomerate of sequence labeling tools. J Cheminform 11:3. https:\/\/doi.org\/10.1186\/s13321-018-0327-2","journal-title":"J Cheminform"},{"key":"633_CR15","doi-asserted-by":"publisher","first-page":"506","DOI":"10.1002\/minf.201100005","volume":"30","author":"M Vazquez","year":"2011","unstructured":"Vazquez M, Krallinger M, Leitner F, Valencia A (2011) Text mining for drugs and chemical compounds: methods, tools and applications. Mol Inform 30:506\u2013519. https:\/\/doi.org\/10.1002\/minf.201100005","journal-title":"Mol Inform"},{"key":"633_CR16","doi-asserted-by":"publisher","DOI":"10.3389\/fgene.2020.618862","volume":"11","author":"N Biziukova","year":"2020","unstructured":"Biziukova N, Tarasova O, Ivanov S, Poroikov V (2020) Automated extraction of information from texts of scientific publications: insights Into HIV treatment strategies. Front Genet 11:618862. https:\/\/doi.org\/10.3389\/fgene.2020.618862","journal-title":"Front Genet"},{"key":"633_CR17","doi-asserted-by":"publisher","first-page":"S1","DOI":"10.1186\/1758-2946-7-S1-S1","volume":"7","author":"M Krallinger","year":"2015","unstructured":"Krallinger M, Leitner F, Rabal O et al (2015) CHEMDNER: the drugs and chemical names extraction challenge. J Cheminform 7:S1. https:\/\/doi.org\/10.1186\/1758-2946-7-S1-S1","journal-title":"J Cheminform"},{"key":"633_CR18","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1007\/11683568_8","volume-title":"Knowledge discovery in life science literature","author":"Y-C Wu","year":"2006","unstructured":"Wu Y-C, Fan T-K, Lee Y-S, Yen S-J (2006) Extracting named entities using support vector machines. In: Bremer EG, Hakenberg J, Han E-H et al (eds) Knowledge discovery in life science literature. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 91\u2013103"},{"issue":"1","key":"633_CR19","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1186\/s13321-018-0280-0","volume":"10","author":"I Korvigo","year":"2018","unstructured":"Korvigo I, Holmatov M, Zaikovskii A et al (2018) Putting hands to rest: efficient deep CNN-RNN architecture for chemical named entity recognition with no hand-crafted rules. J Cheminform 10(1):28. https:\/\/doi.org\/10.1186\/s13321-018-0280-0","journal-title":"J Cheminform"},{"key":"633_CR20","doi-asserted-by":"publisher","first-page":"E2751","DOI":"10.3390\/molecules23112751","volume":"23","author":"O Tarasova","year":"2018","unstructured":"Tarasova O, Biziukova N, Filimonov D, Poroikov V (2018) A computational approach for the prediction of HIV resistance based on amino acid and nucleotide descriptors. Molecules 23:E2751. https:\/\/doi.org\/10.3390\/molecules23112751","journal-title":"Molecules"},{"key":"633_CR21","doi-asserted-by":"publisher","DOI":"10.1155\/2021\/7085412","author":"Y Li","year":"2021","unstructured":"Li Y, Yin C (2021) Application of dual-channel convolutional neural network algorithm in semantic feature analysis of english text big data. Comput Intell Neurosci. https:\/\/doi.org\/10.1155\/2021\/7085412","journal-title":"Comput Intell Neurosci"},{"key":"633_CR22","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2020.103381","volume":"103","author":"M Cho","year":"2020","unstructured":"Cho M, Ha J, Park C, Park S (2020) Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition. J Biomed Inform 103:103381. https:\/\/doi.org\/10.1016\/j.jbi.2020.103381","journal-title":"J Biomed Inform"},{"key":"633_CR23","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2019.103252","volume":"96","author":"Y Chen","year":"2019","unstructured":"Chen Y, Zhou C, Li T et al (2019) Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training. J Biomed Inform 96:103252. https:\/\/doi.org\/10.1016\/j.jbi.2019.103252","journal-title":"J Biomed Inform"},{"key":"633_CR24","doi-asserted-by":"publisher","first-page":"103609","DOI":"10.1016\/j.jbi.2020.103609","volume":"112","author":"S Gajendran","year":"2020","unstructured":"Gajendran S, Manjula D, Sugumaran V (2020) Character level and word level embedding with bidirectional LSTM\u2014dynamic recurrent neural network for biomedical named entity recognition from literature. J Biomed Inform 112:103609. https:\/\/doi.org\/10.1016\/j.jbi.2020.103609","journal-title":"J Biomed Inform"},{"key":"633_CR25","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1186\/1471-2105-9-207","volume":"9","author":"M Bundschus","year":"2008","unstructured":"Bundschus M, Dejori M, Stetter M et al (2008) Extraction of semantic biomedical relations from text using conditional random fields. BMC Bioinformatics 9:207. https:\/\/doi.org\/10.1186\/1471-2105-9-207","journal-title":"BMC Bioinformatics"},{"key":"633_CR26","doi-asserted-by":"publisher","first-page":"4943","DOI":"10.1609\/aaai.v33i01.33014943","volume":"33","author":"H Shu","year":"2019","unstructured":"Shu H, Zhu H (2019) Sensitivity analysis of deep neural networks. AAAI 33:4943\u20134950. https:\/\/doi.org\/10.1609\/aaai.v33i01.33014943","journal-title":"AAAI"},{"key":"633_CR27","doi-asserted-by":"publisher","first-page":"1065","DOI":"10.1109\/TMI.2020.3046692","volume":"40","author":"Z Li","year":"2021","unstructured":"Li Z, Kamnitsas K, Glocker B (2021) Analyzing overfitting under class imbalance in neural networks for image segmentation. IEEE Trans Med Imaging 40:1065\u20131077. https:\/\/doi.org\/10.1109\/TMI.2020.3046692","journal-title":"IEEE Trans Med Imaging"},{"key":"633_CR28","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1080\/17460441.2016.1201262","volume":"11","author":"II Baskin","year":"2016","unstructured":"Baskin II, Winkler D, Tetko IV (2016) A renaissance of neural networks in drug discovery. Expert Opin Drug Discov 11:785\u2013795. https:\/\/doi.org\/10.1080\/17460441.2016.1201262","journal-title":"Expert Opin Drug Discov"},{"issue":"4","key":"633_CR29","doi-asserted-by":"publisher","DOI":"10.1016\/j.patter.2022.100488","volume":"3","author":"A Trewartha","year":"2022","unstructured":"Trewartha A, Walker N, Huo H et al (2022) Quantifying the advantage of domain-specific pre-training on named entity recognition tasks in materials science. Patterns 3(4):100488. https:\/\/doi.org\/10.1016\/j.patter.2022.100488","journal-title":"Patterns"},{"key":"633_CR30","doi-asserted-by":"publisher","first-page":"1650040","DOI":"10.1142\/S0219720016500402","volume":"15","author":"O Tarasova","year":"2017","unstructured":"Tarasova O, Filimonov D, Poroikov V (2017) PASS-based approach to predict HIV-1 reverse transcriptase resistance. J Bioinform Comput Biol 15:1650040. https:\/\/doi.org\/10.1142\/S0219720016500402","journal-title":"J Bioinform Comput Biol"},{"key":"633_CR31","doi-asserted-by":"publisher","first-page":"E956","DOI":"10.3390\/molecules23040956","volume":"23","author":"O Tarasova","year":"2018","unstructured":"Tarasova O, Poroikov V (2018) HIV resistance prediction to reverse transcriptase inhibitors: focus on open data. Molecules 23:E956. https:\/\/doi.org\/10.3390\/molecules23040956","journal-title":"Molecules"},{"key":"633_CR32","unstructured":"https:\/\/pubmed.ncbi.nlm.nih.gov\/"},{"key":"633_CR33","unstructured":"https:\/\/www.nltk.org\/api\/nltk.tokenize.html"},{"key":"633_CR34","doi-asserted-by":"publisher","first-page":"182","DOI":"10.1039\/9781847558879-00182","volume-title":"Chemoinformatics approaches to virtual screening","author":"DA Filimonov","year":"2008","unstructured":"Filimonov DA, Poroikov VV (2008) Probabilistic approach in activity prediction. In: Varnek A, Tropsha A (eds) Chemoinformatics approaches to virtual screening. RSC Publishing, Cambridge, pp 182\u2013216"},{"key":"633_CR35","unstructured":"www.uniprot.org"},{"key":"633_CR36","unstructured":"https:\/\/pubchem.ncbi.nlm.nih.gov\/"},{"key":"633_CR37","unstructured":"https:\/\/www.ebi.ac.uk\/chembl\/"},{"key":"633_CR38","doi-asserted-by":"crossref","unstructured":"Self WH, Semler MW, Leither LM, Casey JD, Angus DC, Brower RG, Chang SY, Collins SP, Eppensteiner JC, Filbin MR, Files DC, Gibbs KW, Ginde AA, Gong MN, Harrell FE, Hayden DL, Hough CL, Johnson NJ, Khan A, Diercks D (2020) Effect of hydroxychloroquine on clinical status at 14 days in hospitalized patients with COVID19: a randomized clinical trial. JAMA 324 (21): 2165\u20132176. they are not considered as effective medications against COVID-19","DOI":"10.1001\/jama.2020.22240"},{"key":"633_CR39","doi-asserted-by":"publisher","first-page":"1693","DOI":"10.1016\/j.ijbiomac.2020.07.235","volume":"164","author":"O Abian","year":"2020","unstructured":"Abian O, Ortega-Alarcon D, Jimenez-Alesanco A et al (2020) Structural stability of SARS-CoV-2 3CLpro and identification of quercetin as an inhibitor by experimental screening. Int J Biol Macromol 164:1693\u20131703. https:\/\/doi.org\/10.1016\/j.ijbiomac.2020.07.235","journal-title":"Int J Biol Macromol"},{"key":"633_CR40","doi-asserted-by":"publisher","first-page":"6559","DOI":"10.3390\/molecules26216559","volume":"26","author":"AA Al-Karmalawy","year":"2021","unstructured":"Al-Karmalawy AA, Farid MM, Mostafa A et al (2021) Naturally available flavonoid aglycones as potential antiviral drug candidates against SARS-CoV-2. Molecules 26:6559. https:\/\/doi.org\/10.3390\/molecules26216559","journal-title":"Molecules"},{"key":"633_CR41","doi-asserted-by":"publisher","DOI":"10.1016\/j.phymed.2021.153704","volume":"91","author":"T Xiao","year":"2021","unstructured":"Xiao T, Wei Y, Cui M et al (2021) Effect of dihydromyricetin on SARS-CoV-2 viral replication and pulmonary inflammation and fibrosis. Phytomedicine 91:153704. https:\/\/doi.org\/10.1016\/j.phymed.2021.153704","journal-title":"Phytomedicine"},{"key":"633_CR42","doi-asserted-by":"publisher","first-page":"1683","DOI":"10.1021\/acs.jcim.0c01054","volume":"61","author":"OA Tarasova","year":"2021","unstructured":"Tarasova OA, Biziukova NY, Rudik AV et al (2021) Extraction of data on parent compounds and their metabolites from texts of scientific abstracts. J Chem Inf Model 61:1683\u20131690. https:\/\/doi.org\/10.1021\/acs.jcim.0c01054","journal-title":"J Chem Inf Model"},{"key":"633_CR43","unstructured":"Lafferty J, McCallum A, Pereira FCN (2001) Conditional random fields: probabilistic models for segmenting and labeling sequence data. University of Pennsylvania. Departmental Papers (CIS). 2001"},{"key":"633_CR44","doi-asserted-by":"publisher","unstructured":"Tang B, Feng Y, Wang X, et al (2015) A comparison of conditional random fields and structured support vector machines for chemical entity recognition in biomedical literature. J Cheminform 7(Suppl 1 Text mining for chemistry and the CHEMDNER track): S8. https:\/\/doi.org\/10.1186\/1758-2946-7-S1-S8","DOI":"10.1186\/1758-2946-7-S1-S8"},{"issue":"8","key":"633_CR45","doi-asserted-by":"publisher","first-page":"1381","DOI":"10.1093\/bioinformatics\/btx761","volume":"34","author":"L Luo","year":"2018","unstructured":"Luo L, Yang Z, Yang P et al (2018) An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition. Bioinformatics 34(8):1381\u20131388. https:\/\/doi.org\/10.1093\/bioinformatics\/btx761","journal-title":"Bioinformatics"},{"issue":"4","key":"633_CR46","doi-asserted-by":"publisher","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","volume":"36","author":"J Lee","year":"2019","unstructured":"Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, Kang J (2019) BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4):1234\u20131240. https:\/\/doi.org\/10.1093\/bioinformatics\/btz682","journal-title":"Bioinformatics"},{"key":"633_CR47","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1517\/17460441.2013.761204","volume":"8","author":"S Kar","year":"2013","unstructured":"Kar S, Roy K (2013) How far can virtual screening take us in drug discovery? Expert Opin Drug Discov 8:245\u2013261. https:\/\/doi.org\/10.1517\/17460441.2013.761204","journal-title":"Expert Opin Drug Discov"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00633-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-022-00633-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00633-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,8,13]],"date-time":"2022-08-13T06:03:29Z","timestamp":1660370609000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-022-00633-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,13]]},"references-count":47,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["633"],"URL":"https:\/\/doi.org\/10.1186\/s13321-022-00633-4","relation":{},"ISSN":["1758-2946"],"issn-type":[{"type":"electronic","value":"1758-2946"}],"subject":[],"published":{"date-parts":[[2022,8,13]]},"assertion":[{"value":"4 April 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 July 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 August 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"55"}}