{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,11,22]],"date-time":"2023-11-22T00:43:04Z","timestamp":1700613784095},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,9,7]],"date-time":"2023-09-07T00:00:00Z","timestamp":1694044800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,9,7]],"date-time":"2023-09-07T00:00:00Z","timestamp":1694044800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["NRF-2020R1A2C2004628","NRF-2020R1A2C2004628","NRF-2020R1A2C2004628"],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100010418","name":"Institute for Information and Communications Technology Promotion","doi-asserted-by":"publisher","award":["No.2019-0-01842","No.2019-0-01842","No.2019-0-01842"],"id":[{"id":"10.13039\/501100010418","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"Abstract<\/jats:title>In recent years, the field of computational drug design has made significant strides in the development of artificial intelligence (AI) models for the generation of de novo chemical compounds with desired properties and biological activities, such as enhanced binding affinity to target proteins. These high-affinity compounds have the potential to be developed into more potent therapeutics for a broad spectrum of diseases. Due to the lack of data required for the training of deep generative models, however, some of these approaches have fine-tuned their molecular generators using data obtained from a separate predictor. While these studies show that generative models can produce structures with the desired target properties, it remains unclear whether the diversity of the generated structures and the span of their chemical space align with the distribution of the intended target molecules. In this study, we present a novel generative framework, LOGICS, a framework for Learning Optimal Generative distribution Iteratively for designing target-focused Chemical Structures. We address the exploration\u2014exploitation dilemma, which weighs the choice between exploring new options and exploiting current knowledge. To tackle this issue, we incorporate experience memory and employ a layered tournament selection approach to refine the fine-tuning process. The proposed method was applied to the binding affinity optimization of two target proteins of different protein classes, \u03ba-opioid receptors, and PIK3CA, and the quality and the distribution of the generative molecules were evaluated. The results showed that LOGICS outperforms competing state-of-the-art models and generates more diverse de novo chemical structures with optimized properties. The source code is available at the GitHub repository (https:\/\/github.com\/GIST-CSBL\/LOGICS<\/jats:ext-link>).<\/jats:p>","DOI":"10.1186\/s13321-023-00747-3","type":"journal-article","created":{"date-parts":[[2023,9,7]],"date-time":"2023-09-07T03:14:59Z","timestamp":1694056499000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["LOGICS: Learning optimal generative distribution for designing de novo chemical structures"],"prefix":"10.1186","volume":"15","author":[{"given":"Bongsung","family":"Bae","sequence":"first","affiliation":[]},{"given":"Haelee","family":"Bae","sequence":"additional","affiliation":[]},{"given":"Hojung","family":"Nam","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,9,7]]},"reference":[{"issue":"6","key":"747_CR1","doi-asserted-by":"publisher","first-page":"895","DOI":"10.1007\/s12257-020-0049-y","volume":"25","author":"H Kim","year":"2020","unstructured":"Kim H, Kim E, Lee I, Bae B, Park M, Nam H (2020) Artificial intelligence in drug discovery: a comprehensive review of data-driven and machine learning approaches. Biotechnol Bioprocess Eng 25(6):895\u2013930","journal-title":"Biotechnol Bioprocess Eng"},{"issue":"4","key":"747_CR2","doi-asserted-by":"publisher","first-page":"828","DOI":"10.1039\/C9ME00039A","volume":"4","author":"DC Elton","year":"2019","unstructured":"Elton DC, Boukouvalas Z, Fuge MD, Chung PW (2019) Deep learning for molecular design-a review of the state of the art. Mol Syst Des Eng 4(4):828\u2013849","journal-title":"Mol Syst Des Eng"},{"issue":"19","key":"747_CR3","doi-asserted-by":"publisher","first-page":"14011","DOI":"10.1021\/acs.jmedchem.1c00927","volume":"64","author":"X Tong","year":"2021","unstructured":"Tong X, Liu X, Tan X, Li X, Jiang J, Xiong Z, Xu T, Jiang H, Qiao N, Zheng M (2021) Generative models for de novo drug design. J Med Chem 64(19):14011\u201314027","journal-title":"J Med Chem"},{"issue":"1","key":"747_CR4","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1038\/s42004-018-0068-1","volume":"1","author":"D Merk","year":"2018","unstructured":"Merk D, Grisoni F, Friedrich L, Schneider G (2018) Tuning artificial intelligence on the de novo design of natural-product-inspired retinoid X receptor modulators. Commun Chem 1(1):68","journal-title":"Commun Chem"},{"issue":"1","key":"747_CR5","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1186\/s13321-019-0328-9","volume":"11","author":"SJ Zheng","year":"2019","unstructured":"Zheng SJ, Yan X, Gu Q, Yang YD, Du YF, Lu YT, Xu J (2019) QBMG: quasi-biogenic molecule generator with deep recurrent neural network. J Cheminform 11(1):5","journal-title":"J Cheminform"},{"issue":"4","key":"747_CR6","doi-asserted-by":"publisher","first-page":"1347","DOI":"10.1021\/acs.jcim.8b00902","volume":"59","author":"M Awale","year":"2019","unstructured":"Awale M, Sirockin F, Stiefl N, Reymond JL (2019) Drug analogs from fragment-based long short-term memory generative neural networks. J Chem Inf Model 59(4):1347\u20131356","journal-title":"J Chem Inf Model"},{"issue":"1","key":"747_CR7","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1186\/s13321-017-0235-x","volume":"9","author":"M Olivecrona","year":"2017","unstructured":"Olivecrona M, Blaschke T, Engkvist O, Chen H (2017) Molecular de-novo design through deep reinforcement learning. J Cheminform 9(1):48","journal-title":"J Cheminform"},{"issue":"7","key":"747_CR8","doi-asserted-by":"publisher","first-page":"eaap7885","DOI":"10.1126\/sciadv.aap7885","volume":"4","author":"M Popova","year":"2018","unstructured":"Popova M, Isayev O, Tropsha A (2018) Deep reinforcement learning for de novo drug design. Sci Adv 4(7):eaap7885","journal-title":"Sci Adv"},{"issue":"1","key":"747_CR9","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1186\/s13321-019-0355-6","volume":"11","author":"X Liu","year":"2019","unstructured":"Liu X, Ye K, van Vlijmen HWT (2019) AP IJ, van Westen GJP: An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A2A receptor. J Cheminform 11(1):35","journal-title":"J Cheminform"},{"issue":"1","key":"747_CR10","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1186\/s13321-021-00498-z","volume":"13","author":"T Pereira","year":"2021","unstructured":"Pereira T, Abbasi M, Ribeiro B, Arrais JP (2021) Diversity oriented deep reinforcement Learning for targeted molecule generation. J Cheminform 13(1):21","journal-title":"J Cheminform"},{"key":"747_CR11","doi-asserted-by":"publisher","DOI":"10.1016\/j.bmc.2021.116308","volume":"44","author":"K Papadopoulos","year":"2021","unstructured":"Papadopoulos K, Giblin KA, Janet JP, Patronov A, Engkvist O (2021) De novo design with deep generative models based on 3D similarity scoring. Bioorg Med Chem 44:116308","journal-title":"Bioorg Med Chem"},{"issue":"1","key":"747_CR12","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1021\/acscentsci.7b00512","volume":"4","author":"MHS Segler","year":"2018","unstructured":"Segler MHS, Kogej T, Tyrchan C, Waller MP (2018) Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Cent Sci 4(1):120\u2013131","journal-title":"ACS Cent Sci"},{"key":"747_CR13","unstructured":"Ahn S, Kim J, Lee H, Shin J (2020) Guiding deep molecular optimization with genetic exploration. arXiv:2007.04897"},{"key":"747_CR14","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1016\/j.ddtec.2020.09.003","volume":"32\u201333","author":"P Renz","year":"2019","unstructured":"Renz P, Van Rompaey D, Wegner JK, Hochreiter S, Klambauer G (2019) On failure modes in molecule generation and optimization. Drug Discov Today Technol 32\u201333:55\u201363","journal-title":"Drug Discov Today Technol"},{"issue":"1","key":"747_CR15","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1186\/s13321-022-00646-z","volume":"14","author":"M Thomas","year":"2022","unstructured":"Thomas M, O\u2019Boyle NM, Bender A, de Graaf C (2022) Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation. J Cheminform 14(1):68","journal-title":"J Cheminform"},{"key":"747_CR16","doi-asserted-by":"crossref","unstructured":"Guo J, Schwaller P (2023) Augmented memory: capitalizing on experience replay to accelerate de novo molecular design. arXiv:2305.16160","DOI":"10.26434\/chemrxiv-2023-qmqmq-v2"},{"issue":"1\u20132","key":"747_CR17","doi-asserted-by":"publisher","first-page":"1700111","DOI":"10.1002\/minf.201700111","volume":"37","author":"A Gupta","year":"2018","unstructured":"Gupta A, Muller AT, Huisman BJH, Fuchs JA, Schneider P, Schneider G (2018) Generative recurrent networks for de novo drug design. Mol Inform 37(1\u20132):1700111","journal-title":"Mol Inform"},{"key":"747_CR18","doi-asserted-by":"crossref","unstructured":"Thanh-Tung H, Tran T: Catastrophic forgetting and mode collapse in GANs. In: 2020 International Joint Conference on Neural Networks (IJCNN): 19\u201324 July 2020 2020. 1\u201310","DOI":"10.1109\/IJCNN48605.2020.9207181"},{"key":"747_CR19","volume-title":"Reinforcement learning : an introduction","author":"RS Sutton","year":"1998","unstructured":"Sutton RS, Barto AG (1998) Reinforcement learning\u202f: an introduction. MIT Press, Cambridge, Mass"},{"issue":"3","key":"747_CR20","doi-asserted-by":"publisher","first-page":"1096","DOI":"10.1021\/acs.jcim.8b00839","volume":"59","author":"N Brown","year":"2019","unstructured":"Brown N, Fiscato M, Segler MHS, Vaucher AC (2019) GuacaMol: benchmarking models for de novo molecular design. J Chem Inf Model 59(3):1096\u20131108","journal-title":"J Chem Inf Model"},{"key":"747_CR21","unstructured":"RDKit: Open-source cheminformatics (version 2019.03) https:\/\/www.rdkit.org"},{"issue":"D1","key":"747_CR22","doi-asserted-by":"publisher","first-page":"D1102","DOI":"10.1093\/nar\/gky1033","volume":"47","author":"S Kim","year":"2019","unstructured":"Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B et al (2019) PubChem 2019 update: improved access to chemical data. Nucleic Acids Res 47(D1):D1102\u2013D1109","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"747_CR23","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1038\/s42256-020-0174-5","volume":"2","author":"PC Kotsias","year":"2020","unstructured":"Kotsias PC, Arus-Pous J, Chen HM, Engkvist O, Tyrchan C, Bjerrum EJ (2020) Direct steering of de novo molecular generation with descriptor conditional recurrent neural networks. Nat Mach Intell 2(5):254\u2013265","journal-title":"Nat Mach Intell"},{"issue":"1","key":"747_CR24","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1186\/s13321-020-00473-0","volume":"12","author":"T Blaschke","year":"2020","unstructured":"Blaschke T, Engkvist O, Bajorath J, Chen H (2020) Memory-assisted reinforcement learning for diverse molecular de novo design. J Cheminform 12(1):68","journal-title":"J Cheminform"},{"issue":"1","key":"747_CR25","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1186\/s13321-019-0393-0","volume":"11","author":"J Arus-Pous","year":"2019","unstructured":"Arus-Pous J, Johansson SV, Prykhodko O, Bjerrum EJ, Tyrchan C, Reymond JL, Chen H, Engkvist O (2019) Randomized SMILES strings improve the quality of molecular generative models. J Cheminform 11(1):71","journal-title":"J Cheminform"},{"issue":"9","key":"747_CR26","doi-asserted-by":"publisher","first-page":"1736","DOI":"10.1021\/acs.jcim.8b00234","volume":"58","author":"K Preuer","year":"2018","unstructured":"Preuer K, Renz P, Unterthiner T, Hochreiter S, Klambauer G (2018) Frechet ChemNet distance: a metric for generative models for molecules in drug discovery. J Chem Inf Model 58(9):1736\u20131741","journal-title":"J Chem Inf Model"},{"issue":"3","key":"747_CR27","first-page":"193","volume":"9","author":"BL Miller","year":"1995","unstructured":"Miller BL, Goldberg DE (1995) Genetic algorithms, tournament selection, and the effects of noise. Complex Syst 9(3):193\u2013212","journal-title":"Complex Syst"},{"key":"747_CR28","unstructured":"Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller M. (2013) Playing Atari with Deep Reinforcement Learning. arXiv:1312.5602"},{"key":"747_CR29","doi-asserted-by":"publisher","DOI":"10.3389\/fphar.2020.565644","volume":"11","author":"D Polykovskiy","year":"2020","unstructured":"Polykovskiy D, Zhebrak A, Sanchez-Lengeling B, Golovanov S, Tatanov O, Belyaev S, Kurbanov R, Artamonov A, Aladinskiy V, Veselov M et al (2020) Molecular sets (MOSES): a benchmarking platform for molecular generation models. Front Pharmacol 11:565644","journal-title":"Front Pharmacol"},{"issue":"2","key":"747_CR30","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1110\/ps.21302","volume":"11","author":"B Ma","year":"2002","unstructured":"Ma B, Shatsky M, Wolfson HJ, Nussinov R (2002) Multiple diverse ligands binding at a single protein site: a matter of pre-existing populations. Protein Sci 11(2):184\u2013197","journal-title":"Protein Sci"},{"key":"747_CR31","doi-asserted-by":"crossref","unstructured":"Peyr\u00e9 G, Cuturi M (2018) Computational Optimal Transport. arXiv:1803.00567","DOI":"10.1561\/9781680835519"},{"key":"747_CR32","unstructured":"Solomon J (2018) Optimal Transport on Discrete Domains. arXiv:1801.07745"},{"key":"747_CR33","volume-title":"Handbook of combinatorial optimization: supplement","author":"RE Burkard","year":"1999","unstructured":"Burkard RE, \u00c7ela E (1999) Linear assignment problems and extensions. In: Du D-Z, Pardalos PM (eds) Handbook of combinatorial optimization: supplement, vol A. Boston. MA, Springer, US"},{"key":"747_CR34","volume-title":"Advances in neural information processing systems","author":"A Srivastava","year":"2017","unstructured":"Srivastava A, Valkov L, Russell C, Gutmann MU, Sutton C (2017) VEEGAN: Reducing mode collapse in gans using implicit variational learning. In: Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R (eds) Advances in neural information processing systems. New York Curran Associates Inc"},{"issue":"13","key":"747_CR35","doi-asserted-by":"publisher","first-page":"3521","DOI":"10.1073\/pnas.1611835114","volume":"114","author":"J Kirkpatrick","year":"2017","unstructured":"Kirkpatrick J, Pascanu R, Rabinowitz N, Veness J, Desjardins G, Rusu AA, Milan K, Quan J, Ramalho T, Grabska-Barwinska A et al (2017) Overcoming catastrophic forgetting in neural networks. Proc Natl Acad Sci U S A 114(13):3521\u20133526","journal-title":"Proc Natl Acad Sci U S A"},{"key":"747_CR36","unstructured":"Multicore-TSNE https:\/\/github.com\/DmitryUlyanov\/Multicore-TSNE"},{"issue":"2","key":"747_CR37","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1038\/nchem.1243","volume":"4","author":"GR Bickerton","year":"2012","unstructured":"Bickerton GR, Paolini GV, Besnard J, Muresan S, Hopkins AL (2012) Quantifying the chemical beauty of drugs. Nat Chem 4(2):90\u201398","journal-title":"Nat Chem"},{"issue":"1","key":"747_CR38","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1186\/1758-2946-1-8","volume":"1","author":"P Ertl","year":"2009","unstructured":"Ertl P, Schuffenhauer A (2009) Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Cheminform 1(1):8","journal-title":"J Cheminform"},{"key":"747_CR39","unstructured":"Schrodinger, LLC: The PyMOL molecular graphics system, Version 1.8. In.; 2015."},{"issue":"12","key":"747_CR40","doi-asserted-by":"publisher","first-page":"2577","DOI":"10.1002\/bip.360221211","volume":"22","author":"W Kabsch","year":"1983","unstructured":"Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22(12):2577\u20132637","journal-title":"Biopolymers"},{"issue":"1","key":"747_CR41","doi-asserted-by":"publisher","first-page":"8799","DOI":"10.1038\/s41598-023-35648-w","volume":"13","author":"E Mazuz","year":"2023","unstructured":"Mazuz E, Shtar G, Shapira B, Rokach L (2023) Molecule generation using transformers and policy gradient reinforcement learning. Sci Rep 13(1):8799","journal-title":"Sci Rep"},{"key":"747_CR42","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btac814","author":"Z Liao","year":"2023","unstructured":"Liao Z, Xie L, Mamitsuka H, Zhu S (2023) Sc2Mol: a scaffold-based two-step molecule generator with variational autoencoder and transformer. Bioinformatics. https:\/\/doi.org\/10.1093\/bioinformatics\/btac814","journal-title":"Bioinformatics"},{"key":"747_CR43","unstructured":"Levy D, Rector-Brooks J (2023) Molecular fragment-based diffusion model for drug discovery. In: Notin P (ed) ICLR 2023 - Machine learning for drug discovery workshop: 2023"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00747-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00747-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00747-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,21]],"date-time":"2023-11-21T22:24:51Z","timestamp":1700605491000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00747-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,7]]},"references-count":43,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["747"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00747-3","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,7]]},"assertion":[{"value":"6 April 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 August 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 September 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"We declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"77"}}