Abstract
Localization refers to the adaptation of a document’s content to meet the linguistic, cultural, and other requirements of a specific target market―a locale. Transcreation describes the process of adapting a message from one language to another, while maintaining its intent, style, tone, and context. In recent years, pre-trained language models have pushed the limits of natural language understanding and generation and dominated the NLP progress. We foresee that the AI-based pre-trained language models (e.g. masked language modeling) and other existing and upcoming language modeling techniques will be integrated as effective tools to support localization/transcreation efforts in the coming years. To support localization/transcreation tasks, we use AI-based Masked Language Modeling (MLM) to provide a powerful human-machine teaming tool to query language models for the most proper words/phrases to reflect the proper linguistical and cultural characteristics of the target language. For linguistic applications, we list examples on logical connectives, pronouns and antecedents, and unnecessary redundant nouns and verbs. For intercultural conceptualization applications, we list examples of cultural event schema, role schema, emotional schema, and propositional schema. There are two possible approaches to determine where to put masks: a human-based approach or an algorithm-based approach. For the algorithm-based approach, constituency parsing can be used to break a text into sub-phrases, or constituents, after which typical linguistic patterns can be detected and then finally masking tasks can be attempted on the related texts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Wang, C., Li, M., Smola, A.: Language Models with Transformers, https://arxiv.org/pdf/1904.09408.pdf, Accessed Oct 2019
Devlin, J., Chang, M-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. https://arxiv.org/abs/1810.04805, Accessed Oct 2018
Masked language modeling demo, AllenNLP, Allen Institute for AI. https://demo.allennlp.org/
Joan Pinkham with the collaboration of Guihua Jiang, The translator’s Guide to Chinglish, Foreign Language. Teaching and Researching Press, China (2000)
Sharifian, F.: Cultural linguistics. In: Chapelle, C.A. (ed.) The encyclopedia of applied linguistics (pp. 1590–1596). New Jersey, Boston, Oxford: Blackwell Publishing Ltd., 2013.
Xu, Z., Sharifian, F.: Unpacking cultural conceptualizations in Chinese English. J. Asian Pac. Commun. 27(1), 65–84 (2017)
Donahue, C., Lee, M., Liang, P.: Models to Fill Enabling Language in the Blanks. arXiv preprint arXiv:2005.05339, Acessed 11 May 2020
Official website of the State Council Information Office, China. White Paper ‘‘China's Energy Development in the New Era’’ (2020)
Constituency Parsing demo, AllenNLP, Allen Institute for AI. https://demo.allennlp.org/constituency-parsing
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Qian, M., Liu, J. (2021). Assisting Text Localization and Transcreation Tasks Using AI-Based Masked Language Modeling. In: Degen, H., Ntoa, S. (eds) Artificial Intelligence in HCI. HCII 2021. Lecture Notes in Computer Science(), vol 12797. Springer, Cham. https://doi.org/10.1007/978-3-030-77772-2_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-77772-2_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77771-5
Online ISBN: 978-3-030-77772-2
eBook Packages: Computer ScienceComputer Science (R0)