{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T16:33:07Z","timestamp":1740155587976,"version":"3.37.3"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,9,2]],"date-time":"2022-09-02T00:00:00Z","timestamp":1662076800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,9,2]],"date-time":"2022-09-02T00:00:00Z","timestamp":1662076800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["81903438"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004731","name":"Natural Science Foundation of Zhejiang Province","doi-asserted-by":"publisher","award":["LD22H300004"],"id":[{"id":"10.13039\/501100004731","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"Abstract<\/jats:title>Deep learning methods, such as reaction prediction and retrosynthesis analysis, have demonstrated their significance in the chemical field. However, the de novo generation of novel reactions using artificial intelligence technology requires further exploration. Inspired by molecular generation, we proposed a novel task of reaction generation. Herein, Heck reactions were applied to train the transformer model, a state-of-art natural language process model, to generate 4717 reactions after sampling and processing. Then, 2253 novel Heck reactions were confirmed by organizing chemists to judge the generated reactions. More importantly, further organic synthesis experiments were performed to verify the accuracy and feasibility of representative reactions. The total process, from Heck reaction generation to experimental verification, required only 15\u00a0days, demonstrating that our model has well-learned reaction rules in-depth and can contribute to novel reaction discovery and chemical space exploration.<\/jats:p>","DOI":"10.1186\/s13321-022-00638-z","type":"journal-article","created":{"date-parts":[[2022,9,2]],"date-time":"2022-09-02T10:05:41Z","timestamp":1662113141000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["From theory to experiment: transformer-based generation enables rapid discovery of novel reactions"],"prefix":"10.1186","volume":"14","author":[{"given":"Xinqiao","family":"Wang","sequence":"first","affiliation":[]},{"given":"Chuansheng","family":"Yao","sequence":"additional","affiliation":[]},{"given":"Yun","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Jiahui","family":"Yu","sequence":"additional","affiliation":[]},{"given":"Haoran","family":"Qiao","sequence":"additional","affiliation":[]},{"given":"Chengyun","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Yejian","family":"Wu","sequence":"additional","affiliation":[]},{"given":"Renren","family":"Bai","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9194-0115","authenticated-orcid":false,"given":"Hongliang","family":"Duan","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,9,2]]},"reference":[{"issue":"3","key":"638_CR1","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1039\/b104620a","volume":"34","author":"MH Todd","year":"2005","unstructured":"Todd MH (2005) Computer-aided organic synthesis. Chem Soc Rev 34(3):247","journal-title":"Chem Soc Rev"},{"issue":"1","key":"638_CR2","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1002\/wcms.61","volume":"2","author":"A Cook","year":"2012","unstructured":"Cook A, Johnson AP, Law J, Mirzazadeh M, Ravitz O, Simon A (2012) Computer-aided synthesis design: 40 years on. Wiley Interdiscip Rev Comput Mol Sci 2(1):79","journal-title":"Wiley Interdiscip Rev Comput Mol Sci"},{"issue":"14","key":"638_CR3","doi-asserted-by":"publisher","first-page":"4515","DOI":"10.1002\/anie.201806920","volume":"58","author":"W Beker","year":"2019","unstructured":"Beker W, Gajewska EP, Badowski T, Grzybowski BA (2019) Prediction of major regio-, site-, and diastereoisomers in Diels-Alder reactions by using machine-learning: the importance of physically meaningful descriptors. Angew Chem Int Ed Engl 58(14):4515","journal-title":"Angew Chem Int Ed Engl"},{"issue":"16","key":"638_CR4","doi-asserted-by":"publisher","first-page":"8667","DOI":"10.1021\/acs.jmedchem.9b02120","volume":"63","author":"TJ Struble","year":"2020","unstructured":"Struble TJ, Alvarez JC, Brown SP, Chytil M, Cisar J, DesJarlais RL, Engkvist O, Frank SA, Greve DR, Griffin DJ, Hou X, Johannes JW, Kreatsoulas C, Lahue B, Mathea M, Mogk G, Nicolaou CA, Palmer AD, Price DJ, Robinson RI, Salentin S, Xing L, Jaakkola T, Green WH, Barzilay R, Coley CW, Jensen KF (2020) Current and future roles of artificial intelligence in medicinal chemistry synthesis. J Med Chem 63(16):8667","journal-title":"J Med Chem"},{"issue":"7","key":"638_CR5","doi-asserted-by":"publisher","first-page":"1415","DOI":"10.1039\/D0QO01636E","volume":"8","author":"Y Zhang","year":"2021","unstructured":"Zhang Y, Wang L, Wang X, Zhang C, Ge J, Tang J, Su A, Duan H (2021) Data augmentation and transfer learning strategies for reaction prediction in low chemical data regimes. Org Chem Front 8(7):1415","journal-title":"Org Chem Front"},{"issue":"34","key":"638_CR6","doi-asserted-by":"publisher","first-page":"4114","DOI":"10.1039\/D1CC00586C","volume":"57","author":"Y Wu","year":"2021","unstructured":"Wu Y, Zhang C, Wang L, Duan H (2021) A graph-convolutional neural network for addressing small-scale reaction prediction. Chem Commun 57(34):4114","journal-title":"Chem Commun"},{"issue":"3","key":"638_CR7","doi-asserted-by":"publisher","first-page":"593","DOI":"10.1021\/ci800228y","volume":"49","author":"J Law","year":"2009","unstructured":"Law J, Zsoldos Z, Simon A, Reid D, Liu Y, Khew SY, Johnson AP, Major S, Wade RA, Ando HY (2009) Route designer: a retrosynthetic analysis tool utilizing automated retrosynthetic rule generation. J Chem Inf Model 49(3):593","journal-title":"J Chem Inf Model"},{"issue":"6","key":"638_CR8","doi-asserted-by":"publisher","first-page":"2529","DOI":"10.1021\/acs.jcim.9b00286","volume":"59","author":"CW Coley","year":"2019","unstructured":"Coley CW, Green WH, Jensen KF (2019) RDChiral: An RDKit wrapper for handling stereochemistry in retrosynthetic template extraction and application. J Chem Inf Model 59(6):2529","journal-title":"J Chem Inf Model"},{"key":"638_CR9","unstructured":"Sun R, Dai H, Li L, Kearnes S, Dai B (2020) Energy-based View of Retrosynthesis. arXiv preprint arXiv: 2007.13437"},{"key":"638_CR10","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jcim.1c01065","author":"P Seidl","year":"2022","unstructured":"Seidl P, Renz P, Dyubankova N, Neves P, Verhoeven J, Wegner JK, Segler M, Hochreiter S, Klambauer G (2022) Improving Few-and Zero-Shot Reaction Template Prediction Using Modern Hopfield Networks. J Chem Inf Model. https:\/\/doi.org\/10.1021\/acs.jcim.1c01065","journal-title":"J Chem Inf Model"},{"issue":"11","key":"638_CR11","doi-asserted-by":"publisher","first-page":"2043","DOI":"10.1021\/jo01299a001","volume":"45","author":"TD Salatin","year":"1980","unstructured":"Salatin TD, Jorgensen WL (1980) Computer-assisted mechanistic evaluation of organic reactions. 1. overview. J Org Chem 45(11):2043\u20132051","journal-title":"J Org Chem"},{"key":"638_CR12","doi-asserted-by":"publisher","first-page":"1237","DOI":"10.1021\/acscentsci.7b00355","volume":"3","author":"CW Coley","year":"2017","unstructured":"Coley CW, Rogers L, Green WH, Jensen KF (2017) Computer-assisted retrosynthesis based on molecular similarity. ACS Cent Sci 3:1237\u20131245","journal-title":"ACS Cent Sci"},{"key":"638_CR13","unstructured":"Yan C, Zhao P, Lu C, Yu Y, Huang J. (2021). RetroComposer: Discovering Novel Reactions by Composing Templates for Retrosynthesis Prediction. arXiv preprint arXiv:2112.11225"},{"key":"638_CR14","doi-asserted-by":"publisher","unstructured":"Wan Y, Li X, Wang X, Yao X, Liao B, Hsieh CY, Zhang S. (2021) NeuralTPL: a deep learning approach for efficient reaction space exploration. ChemRxiv preprint ChemRxiv:. https:\/\/doi.org\/10.26434\/chemrxiv-2021-xvcwb","DOI":"10.26434\/chemrxiv-2021-xvcwb"},{"key":"638_CR15","unstructured":"Jin W, Coley CW, Barzilay R, Jaakkola T (2017) Predicting organic reaction outcomes with weisfeiler-lehman network. In: Advances in Neural Information Processing Systems. p. 2607"},{"issue":"2","key":"638_CR16","doi-asserted-by":"publisher","first-page":"370","DOI":"10.1039\/C8SC04228D","volume":"10","author":"CW Coley","year":"2019","unstructured":"Coley CW, Jin W, Rogers L, Jamison TF, Jaakkola TS, Green WH, Barzilay R, Jensen KF (2019) A graph-convolutional neural network model for the prediction of chemical reactivity. Chem Sci 10(2):370","journal-title":"Chem Sci"},{"key":"638_CR17","unstructured":"Nam J, Kim J (2016) Linking the neural machine translation and the prediction of organic chemistry reactions. arXiv preprint arXiv:.09529"},{"issue":"28","key":"638_CR18","doi-asserted-by":"publisher","first-page":"6091","DOI":"10.1039\/C8SC02339E","volume":"9","author":"P Schwaller","year":"2018","unstructured":"Schwaller P, Gaudin T, Lanyi D, Bekas C, Laino T (2018) \u201cFound in Translation\u201d:predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models. Chem Sci 9(28):6091","journal-title":"Chem Sci"},{"issue":"10","key":"638_CR19","doi-asserted-by":"publisher","first-page":"1103","DOI":"10.1021\/acscentsci.7b00303","volume":"3","author":"B Liu","year":"2017","unstructured":"Liu B, Ramsundar B, Kawthekar P, Shi J, Gomes J, Luu Nguyen Q, Ho S, Sloane J, Wender P, Pande V (2017) Retrosynthetic reaction prediction using neural sequence-to-sequence models. ACS Cent Sci 3(10):1103","journal-title":"ACS Cent Sci"},{"issue":"9","key":"638_CR20","doi-asserted-by":"publisher","first-page":"1572","DOI":"10.1021\/acscentsci.9b00576","volume":"5","author":"P Schwaller","year":"2019","unstructured":"Schwaller P, Laino T, Gaudin T, Bolgar P, Hunter CA, Bekas C, Lee AA (2019) Molecular transformer: a model for uncertainty-calibrated chemical reaction prediction. ACS Cent Sci 5(9):1572","journal-title":"ACS Cent Sci"},{"issue":"8","key":"638_CR21","doi-asserted-by":"publisher","first-page":"649","DOI":"10.1038\/nrd1799","volume":"4","author":"G Schneider","year":"2005","unstructured":"Schneider G, Fechner U (2005) Computer-based de novo design of drug-like molecules. Nat Rev Drug Discov 4(8):649","journal-title":"Nat Rev Drug Discov"},{"issue":"5","key":"638_CR22","doi-asserted-by":"publisher","first-page":"742","DOI":"10.1002\/wcms.49","volume":"1","author":"M Hartenfeller","year":"2011","unstructured":"Hartenfeller M, Schneider G (2011) Enabling future drug discovery by de novo design. Wiley Interdiscip Rev Comput Mol Sci 1(5):742","journal-title":"Wiley Interdiscip Rev Comput Mol Sci"},{"key":"638_CR23","unstructured":"Wang Z, He W, Wu H, Wu H, Li W, Wang H, Chen E (2016) Chinese poetry generation with planning based neural network. arXiv preprint arXiv:.09889"},{"issue":"1","key":"638_CR24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-021-81889-y","volume":"11","author":"W Bort","year":"2021","unstructured":"Bort W, Baskin II, Gimadiev T, Mukanov A, Nugmanov R, Sidorov P, Marcou G, Horvath D, Klimchuk O, Madzhidov T (2021) Discovery of novel chemical reactions by deep generative recurrent neural network. Sci Rep 11(1):1","journal-title":"Sci Rep"},{"key":"638_CR25","unstructured":"Vaswani A. Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser \u0141, Polosukhin I (2017) Attention is all you need. In Advances in neural information processing systems. p. 5998"},{"key":"638_CR26","doi-asserted-by":"publisher","DOI":"10.33774\/chemrxiv-2021-fxvwg","author":"C Zhang","year":"2021","unstructured":"Zhang C, Cai X, Qiao H, Zhang Y, Wu Y, Wang X, Xie H, Luo F, Duan H (2021) Self-supervised molecular pretraining strategy for reaction prediction in low-resource scenarios. ChemRxiv preprint ChemRxiv. https:\/\/doi.org\/10.33774\/chemrxiv-2021-fxvwg","journal-title":"ChemRxiv preprint ChemRxiv"},{"issue":"1","key":"638_CR27","doi-asserted-by":"publisher","first-page":"2573","DOI":"10.1038\/s41467-021-22951-1","volume":"12","author":"AC Vaucher","year":"2021","unstructured":"Vaucher AC, Schwaller P, Geluykens J, Nair VH, Iuliano A, Laino T (2021) Inferring experimental procedures from text-based representations of chemical reactions. Nat Commun 12(1):2573","journal-title":"Nat Commun"},{"key":"638_CR28","doi-asserted-by":"crossref","unstructured":"Dai Z, Yang Z, Yang Y, Carbonell J, Le QV, Salakhutdinov R. (2019) Transformer-xl: Attentive language models beyond a fixed-length context. arXiv preprint arXiv:.02860","DOI":"10.18653\/v1\/P19-1285"},{"issue":"20","key":"638_CR29","doi-asserted-by":"publisher","first-page":"5518","DOI":"10.1021\/ja01022a034","volume":"90","author":"RF Heck","year":"1968","unstructured":"Heck RF (1968) Acylation, methylation, and carboxyalkylation of olefins by Group VIII metal derivatives. J Am Chem Soc 90(20):5518","journal-title":"J Am Chem Soc"},{"key":"638_CR30","first-page":"11","volume":"9","author":"L Van der Maaten","year":"2008","unstructured":"Van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Research 9:11","journal-title":"J Mach Learn Research"},{"key":"638_CR31","unstructured":"Hinton G, Roweis ST (2002) Stochastic neighbor embedding. In NIPS p 833"},{"issue":"2","key":"638_CR32","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1038\/s42256-020-00284-w","volume":"3","author":"P Schwaller","year":"2021","unstructured":"Schwaller P, Probst D, Vaucher AC, Nair VH, Kreutter D, Laino T, Reymond JL (2021) Mapping the space of chemical reactions using attention-based neural networks. Nat Mach Intell 3(2):144","journal-title":"Nat Mach Intell"},{"issue":"1","key":"638_CR33","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1186\/s13321-020-0416-x","volume":"12","author":"D Probst","year":"2020","unstructured":"Probst D, Reymond JL (2020) Visualization of very large high-dimensional data sets as minimum spanning trees. J Cheminform 12(1):12","journal-title":"J Cheminform"},{"issue":"12","key":"638_CR34","doi-asserted-by":"publisher","first-page":"3298","DOI":"10.1039\/C6OB00164E","volume":"14","author":"X Cheng","year":"2016","unstructured":"Cheng X, Chen Z, Gao Y, Xue F, Jiang C (2016) Aminoquinoline-assisted vinylic C-H arylation of unsubstituted acrylamide for the selective synthesis of Z olefins. Org Biomol Chem 14(12):3298","journal-title":"Org Biomol Chem"},{"key":"638_CR35","first-page":"78","volume":"1697","author":"R Grigg","year":"1986","unstructured":"Grigg R, Sridharan V, Stevenson P, Worakun T (1986) Palladium (II) catalysed construction of tetrasubstituted carbon centres, and spiro and bridged-ring compounds from enamides of 2-lodobenzoic acids. J Chem Soc Chem Commun 1697:78","journal-title":"J Chem Soc Chem Commun"},{"issue":"24","key":"638_CR36","doi-asserted-by":"publisher","first-page":"8362","DOI":"10.1039\/D1SC01050F","volume":"12","author":"O Dollar","year":"2021","unstructured":"Dollar O, Joshi N, Beck DAC, Pfaendtner J (2021) Attention-based generative models for de novo molecular design. Chem Sci 12(24):8362\u20138372","journal-title":"Chem Sci"},{"key":"638_CR37","unstructured":"Poem generation GitHub. https:\/\/github.com\/GaoPeng97\/Transformer-xl-chinese.git"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00638-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-022-00638-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00638-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,2]],"date-time":"2022-09-02T10:08:22Z","timestamp":1662113302000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-022-00638-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,2]]},"references-count":37,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["638"],"URL":"https:\/\/doi.org\/10.1186\/s13321-022-00638-z","relation":{},"ISSN":["1758-2946"],"issn-type":[{"type":"electronic","value":"1758-2946"}],"subject":[],"published":{"date-parts":[[2022,9,2]]},"assertion":[{"value":"11 December 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 August 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 September 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"There are no conflicts to declare.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"60"}}