{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T18:29:05Z","timestamp":1732040945031},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"25","license":[{"start":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T00:00:00Z","timestamp":1636934400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T00:00:00Z","timestamp":1636934400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100010663","name":"H2020 European Research Council","doi-asserted-by":"publisher","award":["ERC-2015-STG-679528"],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Universidad Nacional de Educacion Distancia"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Neural Comput & Applic"],"published-print":{"date-parts":[[2023,9]]},"abstract":"Abstract<\/jats:title>The splitting of words into stressed and unstressed syllables is the foundation for the scansion of poetry, a process that aims at determining the metrical pattern of a line of verse within a poem. Intricate language rules and their exceptions, as well as poetic licenses exerted by the authors, make calculating these patterns a nontrivial task. Some rhetorical devices shrink the metrical length, while others might extend it. This opens the door for interpretation and further complicates the creation of automated scansion algorithms useful for automatically analyzing corpora on a distant reading fashion. In this paper, we compare the automated metrical pattern identification systems available for Spanish, English, and German, against fine-tuned monolingual and multilingual language models trained on the same task. Despite being initially conceived as models suitable for semantic tasks, our results suggest that transformers-based models retain enough structural information to perform reasonably well for Spanish on a monolingual setting, and outperforms both for English and German when using a model trained on the three languages, showing evidence of the benefits of cross-lingual transfer between the languages.<\/jats:p>","DOI":"10.1007\/s00521-021-06692-2","type":"journal-article","created":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T03:02:36Z","timestamp":1636945356000},"page":"18171-18176","update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Transformers analyzing poetry: multilingual metrical pattern prediction with transfomer-based language models"],"prefix":"10.1007","volume":"35","author":[{"ORCID":"http:\/\/orcid.org\/0000-0002-9143-5573","authenticated-orcid":false,"given":"Javier","family":"de la Rosa","sequence":"first","affiliation":[]},{"given":"\u00c1lvaro","family":"P\u00e9rez","sequence":"additional","affiliation":[]},{"given":"Mirella","family":"de Sisto","sequence":"additional","affiliation":[]},{"given":"Laura","family":"Hern\u00e1ndez","sequence":"additional","affiliation":[]},{"given":"Aitor","family":"D\u00edaz","sequence":"additional","affiliation":[]},{"given":"Salvador","family":"Ros","sequence":"additional","affiliation":[]},{"given":"Elena","family":"Gonz\u00e1lez-Blanco","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,11,15]]},"reference":[{"key":"6692_CR1","doi-asserted-by":"crossref","unstructured":"Agirrezabal M, Alegria I, Hulden M (2017) A comparison of feature-based and neural scansion of poetry. In: Proceedings of the international conference recent advances in natural language processing, Ranlp 2017, pp 18\u201323","DOI":"10.26615\/978-954-452-049-6_003"},{"key":"6692_CR2","doi-asserted-by":"crossref","unstructured":"Agirrezabal M, Astigarraga A, Arrieta B, Hulden M (2016) Zeuscansion: a tool for scansion of english poetry. J Lang Model 4","DOI":"10.15398\/jlm.v4i1.102"},{"key":"6692_CR3","unstructured":"Algee-Hewitt M, Heuser R, Kraxenberger M, Porter J, Sensenbaugh J, Tackett J (2014) The stanford literary lab transhistorical poetry project phase II: metrical form. In: DH"},{"key":"6692_CR4","doi-asserted-by":"crossref","unstructured":"Anttila A, Heuser R (2016) Phonological and metrical variation across genres. In: Proceedings of the annual meetings on phonology, vol\u00a03","DOI":"10.3765\/amp.v3i0.3679"},{"key":"6692_CR5","first-page":"119","volume-title":"Current trends in metrical analysis","author":"K Bobenhausen","year":"2011","unstructured":"Bobenhausen K (2011) The metricalizer2-automated metrical markup of German poetry. Current trends in metrical analysis. Peter Lang, Bern, pp 119\u2013131"},{"key":"6692_CR6","doi-asserted-by":"publisher","first-page":"67","DOI":"10.3917\/lang.199.0067","volume":"199","author":"K Bobenhausen","year":"2015","unstructured":"Bobenhausen K, Hammerich B (2015) Literary metrics, linguistic metrics, and the algorithmic analysis of German poetry using metricalizer (2). Langages 199:67","journal-title":"Langages"},{"key":"6692_CR7","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"P Bojanowski","year":"2017","unstructured":"Bojanowski P, Grave E, Joulin A, Mikolov T (2017) Enriching word vectors with subword information. Trans Assoc Comput Linguist 5:135\u2013146","journal-title":"Trans Assoc Comput Linguist"},{"key":"6692_CR8","unstructured":"Ca\u00f1ete J, Chaperon G, Fuentes R, Ho JH, Kang H, P\u00e9rez J (2020). Spanish pre-trained bert model and evaluation data. In: PML4DC at ICLR 2020"},{"key":"6692_CR9","doi-asserted-by":"publisher","unstructured":"Chan, B., Schweter, S., M\u00f6ller, T (2020) German\u2019s next language model. In: Proceedings of the 28th International Conference on Computational Linguistics, pp. 6788\u20136796. International Committee on Computational Linguistics, Barcelona, Spain (Online) . https:\/\/doi.org\/10.18653\/v1\/2020.coling-main.598. https:\/\/www.aclweb.org\/anthology\/2020.coling-main.598","DOI":"10.18653\/v1\/2020.coling-main.598"},{"key":"6692_CR10","doi-asserted-by":"crossref","unstructured":"Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzm\u00e1n, F., Grave, E., Ott, M., Zettlemoyer, L., Stoyanov, V (2019) Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"6692_CR11","doi-asserted-by":"crossref","unstructured":"Conneau, A., Kruszewski, G., Lample, G., Barrault, L., Baroni, M (2018) What you can cram into a single vector: Probing sentence embeddings for linguistic properties. arXiv preprint arXiv:1805.01070","DOI":"10.18653\/v1\/P18-1198"},{"key":"6692_CR12","unstructured":"Devlin, J., Chang, M.W., Lee, K., Toutanova, K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805"},{"key":"6692_CR13","first-page":"1","volume":"73","author":"F Dimpel","year":"2015","unstructured":"Dimpel F (2015) Automatische mittelhochdeutsche metrik 2.0. Phil. Netz 73:1\u201326","journal-title":"Netz"},{"key":"6692_CR14","doi-asserted-by":"crossref","unstructured":"Estes A, Hench C (2016) Supervised machine learning for hybrid meter. In: Proceedings of the fifth workshop on computational linguistics for literature, pp 1\u20138","DOI":"10.18653\/v1\/W16-0201"},{"key":"6692_CR15","doi-asserted-by":"crossref","unstructured":"Gerv\u00e1s P (2000). A logic programming application for the analysis of Spanish verse. In: International conference on computational logic, pp 1330\u20131344. Springer","DOI":"10.1007\/3-540-44957-4_89"},{"key":"6692_CR16","unstructured":"Groves PL (1998) Strange music: the metre of the English heroic line. English Literary Studies (University of Victoria)"},{"key":"6692_CR17","unstructured":"Haider, T., Eger, S., Kim, E., Klinger, R., Menninghaus, W (2020) Po-emo: conceptualization, annotation, and modeling of aesthetic emotions in German and English poetry. arXiv preprint arXiv:2003.07723"},{"key":"6692_CR18","unstructured":"Haider, T., Kuhn, J (2018) Supervised rhyme detection with Siamese recurrent networks. In: Proceedings of the second joint SIGHUM workshop on computational linguistics for cultural heritage, social sciences, humanities and literature, pp 81\u201386"},{"key":"6692_CR19","unstructured":"Hartman, C.O (2005) The Scandroid 1.1. http:\/\/oak.conncoll.edu\/cohar\/Programs.htm. Accessed 20 July 2020"},{"key":"6692_CR20","unstructured":"Hewitt J, Manning CD (2019) A structural probe for finding syntax in word representations. pp 4129\u20134138"},{"key":"6692_CR21","doi-asserted-by":"crossref","unstructured":"Howard, J., Ruder, S (2018) Universal language model fine-tuning for text classification. arXiv preprint arXiv:1801.06146","DOI":"10.18653\/v1\/P18-1031"},{"key":"6692_CR22","unstructured":"Huber A (2020) Eighteenth-century poetry archive"},{"key":"6692_CR23","doi-asserted-by":"crossref","unstructured":"Joulin A, Grave E, Bojanowski P, Mikolov T (2016) Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759","DOI":"10.18653\/v1\/E17-2068"},{"key":"6692_CR24","doi-asserted-by":"crossref","unstructured":"Kim Y (2014) Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882","DOI":"10.3115\/v1\/D14-1181"},{"key":"6692_CR25","unstructured":"Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp 1188\u20131196"},{"key":"6692_CR26","doi-asserted-by":"crossref","unstructured":"Liu NF, Gardner M, Belinkov Y, Peters ME, Smith NA (2019) Linguistic knowledge and transferability of contextual representations. arXiv preprint arXiv:1903.08855","DOI":"10.18653\/v1\/N19-1112"},{"issue":"1","key":"6692_CR27","first-page":"19","volume":"15","author":"HM Logan","year":"1988","unstructured":"Logan HM (1988) Computer analysis of sound and meter in poetry. Coll Lit 15(1):19\u201324","journal-title":"Coll Lit"},{"key":"6692_CR28","unstructured":"Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111\u20133119"},{"key":"6692_CR29","volume-title":"Distant reading","author":"F Moretti","year":"2013","unstructured":"Moretti F (2013) Distant reading. Verso Books, Brooklyn"},{"issue":"1","key":"6692_CR30","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1093\/llc\/fqx009","volume":"33","author":"B Navarro-Colorado","year":"2017","unstructured":"Navarro-Colorado B (2017) A metrical scansion system for fixed-metre Spanish poetry. Digit Scholarsh Humanit 33(1):112\u2013127","journal-title":"Digit Scholarsh Humanit"},{"key":"6692_CR31","unstructured":"Navarro-Colorado B, Lafoz MR, S\u00e1nchez N (2016) Metrical annotation of a large corpus of spanish sonnets: representation, scansion and evaluation. In: International conference on language resources and evaluation, pp 4360\u20134364"},{"key":"6692_CR32","doi-asserted-by":"crossref","unstructured":"Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L (2018) Deep contextualized word representations. arXiv preprint arXiv:1802.05365","DOI":"10.18653\/v1\/N18-1202"},{"key":"6692_CR33","unstructured":"Radford, A., Narasimhan, K., Salimans, T., Sutskever, I (2018) Improving language understanding by generative pre-training"},{"key":"6692_CR34","first-page":"83","volume":"65","author":"J de la Rosa","year":"2020","unstructured":"de la Rosa J, P\u00e9rez \u00c1, Hern\u00e1ndez L, Ros S, Gonz\u00e1lez-Blanco E (2020) Rantanplan, fast and accurate syllabification and scansion of Spanish poetry. Procesamiento del Lenguaje Natural 65:83\u201390","journal-title":"Procesamiento del Lenguaje Natural"},{"issue":"2","key":"6692_CR35","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1353\/vp.2011.0016","volume":"49","author":"HF Tucker","year":"2011","unstructured":"Tucker HF (2011) Poetic data and the news from poems: a for better for verse memoir. Vic Poet 49(2):267\u2013281","journal-title":"Vic Poet"},{"key":"6692_CR36","doi-asserted-by":"crossref","unstructured":"Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M et al (2019) Huggingface\u2019s transformers: state-of-the-art natural language processing. ArXiv arXiv-1910","DOI":"10.18653\/v1\/2020.emnlp-demos.6"}],"container-title":["Neural Computing and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-021-06692-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00521-021-06692-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-021-06692-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,22]],"date-time":"2023-08-22T08:14:22Z","timestamp":1692692062000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00521-021-06692-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,15]]},"references-count":36,"journal-issue":{"issue":"25","published-print":{"date-parts":[[2023,9]]}},"alternative-id":["6692"],"URL":"https:\/\/doi.org\/10.1007\/s00521-021-06692-2","relation":{},"ISSN":["0941-0643","1433-3058"],"issn-type":[{"value":"0941-0643","type":"print"},{"value":"1433-3058","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,11,15]]},"assertion":[{"value":"2 February 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 October 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 November 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicts of interest"}}]}}