{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,5]],"date-time":"2025-04-05T00:06:53Z","timestamp":1743811613187,"version":"3.37.3"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,9,28]],"date-time":"2022-09-28T00:00:00Z","timestamp":1664323200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,9,28]],"date-time":"2022-09-28T00:00:00Z","timestamp":1664323200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","award":["441958208"],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001663","name":"Volkswagen Foundation","doi-asserted-by":"publisher","award":["Momentum project"],"id":[{"id":"10.13039\/501100001663","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008769","name":"Julius-Maximilians-Universit\u00e4t W\u00fcrzburg","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100008769","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"Abstract<\/jats:title>TUCAN is a canonical serialization format that is independent of domain-specific concepts of structure and bonding. The atomic number is the only chemical feature that is used to derive the TUCAN format. Other than that, the format is solely based on the molecular topology. Validation is reported on a manually curated test set of molecules as well as a library of non-chemical graphs. The serialization procedure generates a canonical \u201ctuple-style\u201d output which is bidirectional, allowing the TUCAN string to serve as both identifier and descriptor. Use of the Python NetworkX graph library facilitated a compact and easily extensible implementation.<\/jats:p>Graphical Abstract<\/jats:bold><\/jats:p>","DOI":"10.1186\/s13321-022-00640-5","type":"journal-article","created":{"date-parts":[[2022,9,28]],"date-time":"2022-09-28T16:05:10Z","timestamp":1664381110000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["TUCAN: A molecular identifier and descriptor applicable to the whole periodic table from hydrogen to oganesson"],"prefix":"10.1186","volume":"14","author":[{"given":"Jan C.","family":"Brammer","sequence":"first","affiliation":[]},{"given":"Gerd","family":"Blanke","sequence":"additional","affiliation":[]},{"given":"Claudia","family":"Kellner","sequence":"additional","affiliation":[]},{"given":"Alexander","family":"Hoffmann","sequence":"additional","affiliation":[]},{"given":"Sonja","family":"Herres-Pawlis","sequence":"additional","affiliation":[]},{"given":"Ulrich","family":"Schatzschneider","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,9,28]]},"reference":[{"key":"640_CR1","doi-asserted-by":"crossref","unstructured":"Gasteiger J (ed) (2003) Handbook of Chemoinformatics: From data to knowledge in 4 volumes. Wiley-VCH, Weinheim","DOI":"10.1002\/9783527618279"},{"issue":"12","key":"640_CR2","doi-asserted-by":"publisher","first-page":"3149","DOI":"10.1021\/ci200488k","volume":"51","author":"AM Clark","year":"2011","unstructured":"Clark AM (2011) Accurate specification of molecular structures: The case for zero-order bonds and explicit hydrogen counting. J Chem Inf Model 51(12):3149\u20133157","journal-title":"J Chem Inf Model"},{"issue":"9","key":"640_CR3","doi-asserted-by":"publisher","first-page":"1469","DOI":"10.1002\/anie.200603600","volume":"46","author":"BO Roos","year":"2007","unstructured":"Roos BO, Borin AC, Gagliardi L (2007) Reaching the maximum multiplicity of the covalent chemical bond. Angew Chem Int Ed 46(9):1469\u20131472","journal-title":"Angew Chem Int Ed"},{"issue":"10","key":"640_CR4","doi-asserted-by":"publisher","first-page":"1897","DOI":"10.1351\/pac200678101897","volume":"78","author":"J Brecher","year":"2006","unstructured":"Brecher J (2006) Graphical representation of stereochemical configuration. Pure Appl Chem 78(10):1897\u20131970","journal-title":"Pure Appl Chem"},{"issue":"6","key":"640_CR5","doi-asserted-by":"publisher","first-page":"1569","DOI":"10.1002\/bkcs.10298","volume":"36","author":"SP Mbue","year":"2015","unstructured":"Mbue SP, Cho K-H (2015) Identification of isomers of organometallic compounds. Bull Korean Chem Soc 36(6):1569\u20131574","journal-title":"Bull Korean Chem Soc"},{"issue":"4","key":"640_CR6","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1016\/S0010-8545(00)80259-3","volume":"13","author":"JH Enemark","year":"1974","unstructured":"Enemark JH, Feltham RD (1974) Principles of structure, bonding, and reactivity for metal nitrosyl complexes. Coord Chem Rev 13(4):339\u2013406","journal-title":"Coord Chem Rev"},{"issue":"1","key":"640_CR7","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1021\/ci00057a005","volume":"28","author":"D Weininger","year":"1988","unstructured":"Weininger D (1988) SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J Chem Inf Comput Sci 28(1):31\u201336","journal-title":"J Chem Inf Comput Sci"},{"key":"640_CR8","unstructured":"Daylight theory manual, https:\/\/www.daylight.com\/dayhtml\/doc\/theory\/index.pdf"},{"issue":"10","key":"640_CR9","doi-asserted-by":"publisher","first-page":"1779","DOI":"10.1351\/pac200779101779","volume":"79","author":"RM Hartshorn","year":"2007","unstructured":"Hartshorn RM, Hey-Hawkins E, Kalio R, Leigh GJ (2007) Representation of configuration in coordination polyhedra and the extension of current methodology to coordination numbers greater than six. Pure Appl Chem 79(10):1779\u20131799","journal-title":"Pure Appl Chem"},{"key":"640_CR10","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1186\/s13321-015-0068-4","volume":"7","author":"SR Heller","year":"2015","unstructured":"Heller SR, McNaught A, Pletnev I, Stein S, Tchekhovskoi D (2015) InChI, the IUPAC international chemical identifier. J Cheminform 7:23","journal-title":"J Cheminform"},{"issue":"5","key":"640_CR11","doi-asserted-by":"publisher","first-page":"787","DOI":"10.1021\/ci00027a001","volume":"35","author":"A Dietz","year":"1995","unstructured":"Dietz A (1995) Yet another representation of molecular structure. J Chem Inf Comput Sci 35(5):787\u2013802","journal-title":"J Chem Inf Comput Sci"},{"key":"640_CR12","unstructured":"Coordination complexes for InChI: preliminary study. https:\/\/github.com\/aclarkxyz\/data_coordinchi"},{"issue":"42","key":"640_CR13","doi-asserted-by":"publisher","first-page":"11140","DOI":"10.1002\/anie.201405820","volume":"53","author":"DA Evans","year":"2014","unstructured":"Evans DA (2014) History of the Harvard ChemDraw project. Angew Chem Int Ed 53(42):11140\u201311145","journal-title":"Angew Chem Int Ed"},{"issue":"2","key":"640_CR14","doi-asserted-by":"publisher","first-page":"244","DOI":"10.1021\/ci00007a012","volume":"32","author":"A Dalby","year":"1992","unstructured":"Dalby A, Nourse JG, Hounshell WD, Gushurst AK, Grier DL, Leland BA, Laufer J (1992) Description of several chemical structure file formats used by computer programs developed at Molecular Design Limited. J Chem Inf Comput Sci 32(2):244\u2013255","journal-title":"J Chem Inf Comput Sci"},{"key":"640_CR15","unstructured":"CTFile formats. Biovia; 2020 https:\/\/discover.3ds.com\/sites\/default\/files\/2020-08\/biovia_ctfileformats_2020.pdf"},{"key":"640_CR16","unstructured":"Chemical representation. Biovia; 2021 http:\/\/help.accelrysonline.com\/insight\/2021\/content\/pdf_files\/bioviachemicalrepresentation.pdf"},{"key":"640_CR17","volume-title":"Chemical graph theory","author":"N Trinajstic","year":"1992","unstructured":"Trinajstic N (1992) Chemical graph theory, 2nd edn. CRC Press, Boca Raton","edition":"2"},{"key":"640_CR18","doi-asserted-by":"crossref","unstructured":"Hagberg A, Schult DA, Swart PJ (2008) Exploring network structure, dynamics, and function using NetworkX. In: Proceedings of the 7th Python in science conference, Pasadena, CA USA, pp 11\u201315","DOI":"10.25080\/TCWV9851"},{"issue":"4","key":"640_CR19","doi-asserted-by":"publisher","first-page":"497","DOI":"10.1002\/andp.18310970402","volume":"97","author":"WC Zeise","year":"1831","unstructured":"Zeise WC (1831) Von der Wirkung zwischen Platinchlorid und Alkohol, und von den dabei entstehenden neuen Substanzen. Ann Phys Chem 97(4):497\u2013541","journal-title":"Ann Phys Chem"},{"issue":"11","key":"640_CR20","doi-asserted-by":"publisher","first-page":"2653","DOI":"10.1021\/ic50153a012","volume":"14","author":"RA Love","year":"1975","unstructured":"Love RA, Koetzle TF, Williams GJB, Andrews LC, Bau R (1975) Neutron diffraction study of the structure of Zeise\u2019s salt, KPtCl3(C2H4)\u00b7H2O. Inorg Chem 14(11):2653\u20132657","journal-title":"Inorg Chem"},{"issue":"2","key":"640_CR21","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1021\/c160017a018","volume":"5","author":"HL Morgan","year":"1965","unstructured":"Morgan HL (1965) The generation of a unique machine description for chemical structures\u2014a technique developed at Chemical Abstracts Service. J Chem Doc 5(2):107\u2013113","journal-title":"J Chem Doc"},{"issue":"2","key":"640_CR22","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1021\/ci60010a014","volume":"17","author":"C Jochum","year":"1977","unstructured":"Jochum C, Gasteiger J (1977) Canonical numbering and constitutional symmetry. J Chem Inf Comput Sci 17(2):113\u2013117","journal-title":"J Chem Inf Comput Sci"},{"issue":"10","key":"640_CR23","doi-asserted-by":"publisher","first-page":"2111","DOI":"10.1021\/acs.jcim.5b00543","volume":"55","author":"N Schneider","year":"2015","unstructured":"Schneider N, Sayle RA, Landrum GA (2015) Get your atoms in order\u2014an open-source implementation of a novel and robust molecular canonicalization algorithm. J Chem Inf Model 55(10):2111\u20132120","journal-title":"J Chem Inf Model"},{"issue":"6","key":"640_CR24","doi-asserted-by":"publisher","first-page":"1326","DOI":"10.1021\/ja01084a030","volume":"87","author":"R Breslow","year":"1965","unstructured":"Breslow R, Altman LJ, Krebs A, Mohacsi E, Murata I, Peterson RA, Posner J (1965) Substituted cyclopropenones. J Am Chem Soc 87(6):1326\u20131331","journal-title":"J Am Chem Soc"},{"issue":"9","key":"640_CR25","first-page":"12","volume":"2","author":"B Weisfeiler","year":"1968","unstructured":"Weisfeiler B, Leman AA (1968) The reduction of a graph to canonical form and the algebra which appears therein. NTI Series 2(9):12\u201316","journal-title":"NTI Series"},{"key":"640_CR26","doi-asserted-by":"crossref","unstructured":"Kiefer S (2020) Power and limits of the Weisfeiler\u2013Leman algorithm. PhD thesis, RWTH Aachen","DOI":"10.1145\/3436980.3436982"},{"issue":"2","key":"640_CR27","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1021\/ci00012a003","volume":"33","author":"M Razinger","year":"1993","unstructured":"Razinger M, Balasubramanian K, Munk ME (1993) Graph automorphism perception algorithms in computer-enhanced structure elucidation. J Chem Inf Comput Sci 33(2):197\u2013201","journal-title":"J Chem Inf Comput Sci"},{"key":"640_CR28","doi-asserted-by":"crossref","unstructured":"Junttila T, Kaski P (2007) Engineering an efficient canonical labeling tool for large and sparse graphs. In: Proceedings of the workshop on algorithm engineering and experiments (ALENEX). pp 135\u2013149","DOI":"10.1137\/1.9781611972870.13"},{"key":"640_CR29","doi-asserted-by":"publisher","first-page":"56","DOI":"10.1186\/s13321-020-00460-5","volume":"12","author":"L David","year":"2020","unstructured":"David L, Thakkar A, Mercado R, Engkvist O (2020) Molecular representations in AI-driven drug discovery: a review and practical guide. J Cheminform 12:56","journal-title":"J Cheminform"},{"issue":"3","key":"640_CR30","doi-asserted-by":"publisher","first-page":"173","DOI":"10.1021\/c160014a015","volume":"4","author":"H Hiz","year":"1964","unstructured":"Hiz H (1964) A linearization of chemical graphs. J Chem Doc 4(3):173\u2013180","journal-title":"J Chem Doc"},{"issue":"3","key":"640_CR31","doi-asserted-by":"publisher","first-page":"186","DOI":"10.1021\/c160014a017","volume":"4","author":"SH Eisman","year":"1964","unstructured":"Eisman SH (1964) A Polish-type notation for chemical structures. J Chem Doc 4(3):186\u2013190","journal-title":"J Chem Doc"},{"issue":"3","key":"640_CR32","doi-asserted-by":"publisher","first-page":"146","DOI":"10.1021\/c160030a007","volume":"8","author":"WJ Wiswesser","year":"1968","unstructured":"Wiswesser WJ (1968) 107 years of line-formula notations (1861\u20131968). J Chem Doc 8(3):146\u2013150","journal-title":"J Chem Doc"},{"issue":"8","key":"640_CR33","doi-asserted-by":"publisher","first-page":"478","DOI":"10.1021\/ja02046a005","volume":"22","author":"EA Hill","year":"1900","unstructured":"Hill EA (1900) A system of indexing chemical literature; adopted by the classification division of the US patent office. J Am Chem Soc 22(8):478\u2013494","journal-title":"J Am Chem Soc"},{"issue":"2","key":"640_CR34","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1021\/ci60014a015","volume":"18","author":"RE Carhart","year":"1978","unstructured":"Carhart RE (1978) Erroneous claims concerning the perception of topological symmetry. J Chem Inf Comput Sci 18(2):108\u2013110","journal-title":"J Chem Inf Comput Sci"},{"key":"640_CR35","unstructured":"Neuen D, Schweitzer P (2017) Benchmark graphs for practical graph isomorphism. arXiv:1705.03686"},{"key":"640_CR36","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1186\/s13321-020-00453-4","volume":"12","author":"DG Krotko","year":"2020","unstructured":"Krotko DG (2020) Atomic ring invariant and modified CANON extended connectivity algorithm for symmetry perception in molecular graphs and rigorous canonicalization of SMILES. J Cheminform 12:48","journal-title":"J Cheminform"},{"issue":"1","key":"640_CR37","doi-asserted-by":"publisher","first-page":"1102","DOI":"10.1093\/nar\/gky1033","volume":"47","author":"S Kim","year":"2019","unstructured":"Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B et al (2019) PubChem 2019 update: improved access to chemical data. Nucleic Acids Res 47(1):1102\u20131109","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"640_CR38","doi-asserted-by":"publisher","first-page":"2698","DOI":"10.1016\/S0021-9258(18)67888-3","volume":"238","author":"RE Canfield","year":"1963","unstructured":"Canfield RE (1963) The amino acid sequence of egg white lysozyme. J Biol Chem 238(8):2698\u20132707","journal-title":"J Biol Chem"},{"issue":"4","key":"640_CR39","doi-asserted-by":"publisher","DOI":"10.1088\/2632-2153\/aba947","volume":"1","author":"M Krenn","year":"2020","unstructured":"Krenn M, H\u00e4se F, Nigan AK, Friedrich P, Aspuru-Guzik A (2020) Self-referencing embedded strings (SELFIES): a 100% robust molecular string representation. Mach Learn Sci Technol 1(4):045024","journal-title":"Mach Learn Sci Technol"},{"key":"640_CR40","unstructured":"Fu T, Gao W, Xiao C, Yasonik J, Coley CW, Sun J (2021) Differentiable scaffolding tree for molecular optimization. arXiv:2109.10469"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00640-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-022-00640-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00640-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,4]],"date-time":"2024-10-04T21:31:51Z","timestamp":1728077511000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-022-00640-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,28]]},"references-count":40,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["640"],"URL":"https:\/\/doi.org\/10.1186\/s13321-022-00640-5","relation":{},"ISSN":["1758-2946"],"issn-type":[{"type":"electronic","value":"1758-2946"}],"subject":[],"published":{"date-parts":[[2022,9,28]]},"assertion":[{"value":"18 March 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 August 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 September 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"66"}}