Abstract
This paper presents an agent-based model of the emergence and evolution of a language system for Boolean coordination. The model assumes the agents have cognitive capacities for invention, adoption, abstraction, repair and adaptation, a common lexicon for basic concepts, and the ability to construct complex concepts using recursive combinations of basic concepts and logical operations such as negation, conjunction or disjunction. It also supposes the agents initially have neither a lexicon for logical operations nor the ability to express logical combinations of basic concepts through language. The results of the experiments we have performed show that a language system for Boolean coordination emerges as a result of a process of self-organisation of the agents’ linguistic interactions when these agents adapt their preferences for vocabulary, syntactic categories and word order to those they observe are used more often by other agents. Such a language system allows the unambiguous communication of higher-order logic terms representing logical combinations of basic properties with non-trivial recursive structure, and it can be reliably transmitted across generations according to the results of our experiments. Furthermore, the conceptual and linguistic systems, and simplification and repair operations of the agent-based model proposed are more general than those defined in previous works, because they not only allow the simulation of the emergence and evolution of a language system for the Boolean coordination of basic properties, but also for the Boolean coordination of higher-order logic terms of any Boolean type which can represent the meaning of nouns, sentences, verbs, adjectives, adverbs, prepositions, prepositional phrases and subexpressions not traditionally analysed as forming constituents, using linguistic devices such as syntactic categories, word order and function words.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
\(\text{ Bool }\) is the type of Boolean values and \(\text{ Ind }\) the type of individuals. See Sect. 2.1.1 for a definition of symple types in the \(\lambda \)-Calculus.
\(\text{ up }\) and \(\text{ le }\) are constants of type \(\text{ Ind }\rightarrow \text{ Bool }\) denoting the properties of being in an upper and left position respectively. If the agents used First Order Logic instead of Higher Order Logic to represent their meanings, they would approximate the meaning of this noun phrase by using formula \((\text{ up }(x) \vee \text{ le }(x)) \wedge \lnot (\text{ up }(x) \wedge \text{ le }(x))\).
When we omit parentheses from functional types, we assume associativity is applied to the right, so that \(\text{ Ind }\rightarrow \text{ Ind }\rightarrow \text{ Bool }\) is equivalent to \(\text{ Ind }\rightarrow (\text{ Ind }\rightarrow \text{ Bool })\).
Assuming square and upper have the standard natural language interpretations, \(square(o_1)\) will be true if \(o_1\) denotes an object that is a square, upper(square) will denote the property of being a square situated in an upper position, and \(upper(square(o_1))\) will be true if \(o_1\) is a square which is in an upper position.
Treating move as a binary relation means interpreting \(move(x,r_1)\) as ‘\(r_1\) moves x’. Currying move implies that move(x) means ‘moves x’, and \(move(x)(r_1)\) means ‘\(r_1\) moves x’.
Generalized quantifiers, e.g. \(\mathtt{every}_{\tau }\) or \(\mathtt{some}_{\tau }\), quantify over objects of type \(\tau \).
See definition 6 on page 9 for formal definitions of lexicon and lexical entry.
See Sect. 3.1 for a formal definition of the set of syntactic categories used in this paper.
We will omit parentheses within categories, taking the forward slash to be left associative, the backward slash to be right associative, and the backward slash bind more tightly than the forward slash. Thus \((\text{ np }\backslash \text{ s })/ \text{ np }\) is equivalent to \(\text{ np }\backslash \text{ s }/ \text{ np }\).
Lambek extended pure applicative categorial grammar according to a simple algebraic interpretation of the slashes. In the Lambek Calculus an expression is assigned to category \(A\slash B\) (respectively, to category \(B\backslash A\)) if and only if when it is followed (respectively, preceded) by an expression of category B, it produces an expression of category A. However, applicative categorial grammar only respects one half of the biconditional.
Syntactic categories pr / pr or pr\(\backslash \)pr are used to place the word associated with not before or after the expression associated with its argument. Categories pr / pr / pr, pr\(\backslash \)pr / pr and pr\(\backslash \)pr\(\backslash \)pr place the words associated with and,or before, between or after the expressions associated with their arguments.
Affixes are groups of letters attached to words.
A conceptualisation of a subset of objects (i.e. a meaning) is a higher-order logic term that is true for all the objects in the subset and false for the rest of the objects in the speaker’s and hearer’s context.
If F is a constant of type \(\mathtt{\text{ Ind }\rightarrow \text{ Bool }}\), i.e. a basic property, invention is not necessary, because an entry for F already exists in the common lexicon.
The reason for this is that the lexical entries for basic properties are part of the common lexicon of basic concepts initially shared by all the agents in the population.
It should be noted that although the choice of words for expressing different Boolean operators is independent of each other, the choice of a lexical entry for expressing a particular operator in a given sentence is always the same for all the occurrences of such an operator in the higher-order term expressed by such a sentence. Likewise, the choice of a lexical entry for parsing a particular word in a given sentence is always consistent for all the occurrences of such word in the sentence.
In particular, the Ciao Prolog System [10], available from www.clip.dia.fi.upm.es, has been used.
The same arguments as those used in Sect. 4.1 can be used to justify the method used to compute the score of a particular meaning obtained by parsing a given expression.
In the experiments, the syntactic category C of the expressions invented by the agents is pr.
In the experiments, the agents do not try to simplify associations between expressions and more complex meanings, except in the case of learning the expressions used by other agents to refer to the grouping operator id, which we discuss in the next subsection in the context of repair operations.
On the other hand, if a repair operation had been applied to a sentence that contained the logical operator ’or’ before any repair operation had been applied to a sentence that contained the logical operator ’and’, lexical entry 10 would have been replaced with lexical entry 14 in step 1, and lexical entry 13 would have been added to the agent’s lexicon in step 2. Later on, if the agent applies a repair operation to a sentence containing ’and’, it would replace lexical entry 9 with lexical entry 12 and step 2 would not be applied.
We describe repair operations for arbitrary syntactic categories c, because they are applicable to lexical entries for coordinators of expressions of any Boolean type. However, in the experiments, c is always pr.
They could have been triggered by the generation of ambiguous expressions for other types of meanings, e.g. \(\mathtt{and(\alpha )(not(\beta ))}\) or \(\mathtt{and(\alpha )(or(\beta )(\gamma ))}\). But in the present experiments this is not the case.
We use again an arbitrary syntactic category c to describe the adoption of lexical entries constructed during the application of repair operations, because the mechanisms proposed are applicable to lexical entries for coordinators of expressions of any Boolean type. However, in the experiments, c is always pr.
\(\mu \) is the initial score agents assign to the lexical entries they create during simplification or repair.
Adaptation captures the speed at which an agent can adapt to a new situation, which determines the time it takes for a population to agree on a common language. Amplification relates to the extent to which an agent is able to escape from a behaviour which represents a suboptimal fixed point of its response function. This is achieved by amplifying small deviations from suboptimal equilibrium states, thus making them unstable.
Note that null lexical variability means that all the agents in the population prefer the same expression (both word and position) for naming the Boolean operators (and,or,not) and the grouping operator (id).
In [18], the number of words invented per object in the naming game is estimated to be in O(n), where n is the population size, and the time for n / 2 words to spread in the population in \(O(n^2 \cdot \ln (n))\). The asymptotic notation O(g(n)) is defined as follows. For a given function g(n), we denote by O(g(n)) the set of functions \(O(g(n)) = \{f(n) \; : \; \text{ there } \text{ exist } \text{ positive } \text{ constants } \; c \; \text{ and } \; n_0 \; \text{ such } \text{ that } \; 0 \le f(n) \le c \cdot g(n) \; \text{ for } \text{ all } \; n \ge n_0\}\).
The initial groups of elder, adult and young agents contain \(\frac{n}{3}, \frac{n}{3}+n\;{ mod}(3)\) and \(\frac{n}{3}\) agents respectively, where n is the population size, \(\frac{n}{3}\) is the integer division of n by 3 and \(n\;\text{ mod }(3)\) the remainder.
See appendix A of [42] for a discussion of the expressiveness of the language systems constructed in the experiments reported in that paper, and a proof of the ability of such language systems to unambiguously express every propositional logic formula, which are represented in [42] using Lisp-like notation [34, 35].
References
Ajdukiewicz, K. (1935). Die syntaktische konnexitat. Studia Philosophica, 1(1), 1–27.
Alfonseca, M., & Soler-Gil, F. J. (2013). Evolving an ecology of mathematical expressions with grammatical evolution. Biosystems, 111(2), 111–119.
Alfonseca, M., & Soler-Gil, F. J. (2015). Evolving a predator-prey ecosystem of mathematical expressions with grammatical evolution. Complexity, 20(3), 66–83. https://doi.org/10.1002/cplx.21507. Published online.
Allen, J. (1995). Natural language understanding (2nd ed.). San Francisco: The Benjamin/Cummings Publishing Company.
Bar-Hillel, Y. (1950). On syntactical categories. Journal of Symbolic Logic, 15, 1–16.
Baroncheli, A., Felici, M., Caglioti, E., Loreto, V., & Steels, L. (2006). Sharp transition towards shared vocabularies in multi-agent systems. Journal of Statistical Mechanics, 2006, P06014.
Batali, J. (1998). Computational simulations of the emergence of grammar. In: J. R. Hurforf, M. Studdert-Kennedy, & C. Knight (Eds.), Approaches to the evolution of language: Social and cognitive bases (pp. 405–426). Cambridge University Press.
Beuls, K., & Steels, L. (2013). Agent-based models of strategies for the emergence and evolution of grammatical agreement. PLoS ONE, 8(3), e58960.
Briscoe, T. (Ed.). (2002). Linguistic Evolution through Language Acquisition: Formal and Computational Models. Cambridge: Cambridge University Press.
Bueno, F. Cabeza, D. Carro, M. Hermenegildo, M. López-García P. & Puebla, G. (August 1997) The Ciao Prolog system. reference manual. Technical Report CLIP3/97.1, School of Computer Science, Technical University of Madrid (UPM), Available from http://www.clip.dia.fi.upm.es/.
Carpenter, B. (1997). Type-logical semantics. Cambridge: MIT Press.
Cartmill, E. A., Roberts, S., Lyn, H., & Cornish, H. (Eds.). (2014). Evolution of Language, Proceedings of the Tenth International Conference EVOLANG. Singapore: World Scientific.
Church, A. (1940). A formulation of a simple theory of types. Journal of Symbolic Logic, 5, 56–68.
Clocksin, W. F., & Mellish, C. S. (Eds.). (1996). Programming in prolog (4th ed.). Berlin: Springer.
Colmerauer, A. Kanoui, H. Pasero, R. & Roussel, P. (1973). Un système de Communication Homme-machine en Francais, Research Report. Technical report, Groupe Intelligence Artificielle, Université Aix-Marseille II, France.
Dall’Asta, L., Baronchelli, A., Barrat, A., & Loreto, V. (2006). Non-equilibrium dynamics of language games on complex networks. Physical Review, 74(3), 036105.
de Boer, B. (2001). The origins of vowels systems. Oxford: Oxford University Press.
de Vylder, B. (2008). The Evolution of conventions in multi-agent systems. Ph.D. thesis, Artificial Intelligence Lab, Free University of Brussels.
Di Sciullo, M., & Boeckx, C. (Eds.). (2011). The biolinguistic enterprise. New perspectives on the evolution and nature of the human language faculty. Oxford: Oxford University Press.
Garcia-Casademont, E., & Steels, L. (2016). Insight grammar learning. Journal of Cognitive Science, 17(1), 27–62.
Gazdar, G. (1980). A cross-categorial semantics for coordination. Linguistics and Philosophy, 3(3), 407–409.
Gerasymova, K. Spranger, M. & Beuls, K. (2012). A language strategy for aspect: Encoding aktionsarten through morphology. In Experiments in cultural language evolution (pp. 257–276). Springer.
Grosz, B., Jones, K., & Webber, B. (Eds.). (1986). Readings in natural language processing. Burlington: Morgan Kaufmann.
Hurford, J., Studdert-Kennedy, M., & Kight, C. (Eds.). (1998). Approaches to the evolution of language: Social and cognitive bases. Cambridge University Press.
Kay, M. (1973). The MIND system. In Natural language processing, pp. 155–188. Algorithmics Press.
Kay, M. (1980). Algorithm schemata and data structures in syntactic processing. Technical Report CSL-80-12, Xerox Corporation, Reprinted in [22].
Keenan, E. L., & Faltz, L. M. (1985). Boolean semantics for natural language. In L. McNally, Y. Sharvit, & Z. Szabo (Eds.), Studies in Linguistics and Philosophy (vol. 23). Netherlands: Springer.
Kirby, S. (2002). Learning, bottlenecks and the evolution of recursive syntax. In Linguistic evolution through language acquisition: Formal and computational models, pp. 96–109. Cambridge University Press.
Lambek, J. (1958). The mathematics of sentence structure. American Mathematical Monthly, 65, 154–169.
Lambek, J. (1961). On the calculus of syntactic types. In R. Jakobson (Ed.) Structure of language and its mathematical aspects. Proceedings of symposia in applied mathematics (pp. 166–178). American Mathematical Society.
Lara, J. & Alfonseca, M. (2000). Some strategies for the simulation of vocabulary agreement in multi-agents communities. Journal of Artificial Societies and Social Simulation, 3(4). Retrieved January 1, 2013 from http://jasss.soc.surrey.ac.uk/3/4/2.html.
Lara, J. & Alfonseca, M. (2002). The role of oblivion, memory size and spatial separation in dynamic language games. Journal of Artificial Societies and Social Simulation, 5(2). Retrieved January 1, 2013 from http://jasss.soc.surrey.ac.uk/5/2/1.html.
Lyon, C., Nehaniv, C., & Cangelosi, A. (Eds.). (2007). Emergence of language and communication., Lecture notes in computer science Berlin: Springer.
McCarthy, J. (1960). Recursive functions of symbolic expressions and their computation by machine, part I. Communications of the ACM, 3(4), 184–195.
McCarthy, J. (1990). Formalizing Common Sense. Papers by John McCarthy. Ablex. Edited by Vladimir Lifschitz.
Minett, J. W., & Wang, W. S. Y. (Eds.). (2005). Language acquisition, change and emergence: Essays in evolutionary linguistics. Hong Kong: City University of Hong Kong Press.
Montague, R. (1973). The proper treatment of quantification in ordinary English. In Approaches to Natural Language: Proceedings of the 1970 Stanford Workshop on Grammar and Semantics. Dordrech: Reidel, Reprinted in Thomanson. R. editor, Formal Philosophy, pp. 247–270. New Haven: Yale University Press.
Piaget, J. (1985). The Equilibration of Cognitive Structures: The Central Problem of Intellectual Development. Chicago: University of Chicago Press.
Sierra-Santibáñez, J. (2001). Grounded models as a basis for intuitive reasoning. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, IJCAI-2001, pp. 401–406.
Sierra-Santibáñez, J. (2002) Grounded models as a basis for intuitive and deductive reasoning: The acquisition of logical categories. In Proceedings of the European Conference on Artificial Intelligence, ECAI-2002, pp. 93–97.
Sierra-Santibáñez, J. (2014). An agent-based model studying the acquisition of a language system of logical constructions. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, AAAI-2014, pp. 350–357. AAAI Press.
Sierra-Santibáñez, J. (2015). An agent-based model of the emergence and transmission of a language system for the expression of logical combinations. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI-2015, pp. 492–499. AAAI Press.
Steedman, M. (1985). Dependency and coordination in the grammar of Dutch and English. Language, 61, 523–568.
Steels, L. (1995). A self-organizing spatial vocabulary. Artificial Life, 2(3), 319–332.
Steels, L. (1998). The origins of ontologies and communication conventions in multi-agent systems. Autonomous Agents and Multi-Agent Systems, 1(2), 169–194.
Steels, L. (1998). The origins of syntax in visually grounded robotic agents. Artificial Intelligence, 103(1–2), 133–156.
Steels, L. (2011). Modeling the cultural evolution of language. Physics of Life Reviews, 8, 339–356.
Steels, L. (Ed.). (2012). Experiments in cultural language evolution. Amsterdam: John Benjamins.
Steels, L. (2015). The Talking Heads experiment: Origins of words and meanings (Computational Models of Language Evolution) (Volume 1). Berlin: Language Science Press.
Steels, L. (2016). Agent-based models for the emergence and evolution of grammar. Phil. Trans. R. Soc. B, 371(20150447), 1–9.
Steels, L., & Belpaeme, T. (2005). Coordinating perceptually grounded categories through language: a case study for colour. Behavioral and Brain Sciences, 28, 469–529.
Steels, L. & Garcia-Casademont, E. (2015) How to play the syntax game. In Proceedings of the European Conference on Artificial Life, pp. 479–486. MIT Press.
Steels, L. & Vogt, P. (1997) Grounding adaptive language games in robotic agents. In Proceedings of the European Conference on Artificial Life, MIT Press.
Tomasello, M. (2003). Constructing a Language: A Usage-Based Theory of Language Acquisition. Cambridge: Harvard Univ Press.
Tomasello, M. (2006). Acquiring linguistic constructions. In Handbook of child psychology. Wiley Online Library.
Vogt, P. (2005). The emergence of compositional structures in perceptually grounded language games. Artificial Intelligence, 167(1–2), 206–242.
Williams, M. A., McCarthy, J., Gardenfors, P., Stanton, C., & Karol, A. (2009). A grouding framework. Autonomous Agents and Multi-Agent Systems, 19, 272–296.
Wittgenstein, L. (1953). Philosophical investigations. New York: Macmillan.
Zuidema, W., & de Boer, B. (2010). Multi-agent simulations of the evolution of combinatorial phonology. Adaptive Behavior, 18(2), 141–154.
Acknowledgements
I should like to express my gratitude to Manuel Alfonseca for reading several versions of this paper and providing valuable comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work has been partially supported by funds from the Spanish Ministry for Economy and Competitiveness (MINECO) and the European Union (FEDER funds) under grant GRAMM (TIN2017-86727-C2-1-R), grant APCOM (TIN2014-57226-P), and Generalitat de Catalunya (AGAUR) under projects ALBCOM (2017 SGR 786) and LARCA (2014 SGR 890).
Rights and permissions
About this article
Cite this article
Sierra-Santibáñez, J. An agent-based model of the emergence and evolution of a language system for boolean coordination. Auton Agent Multi-Agent Syst 32, 417–458 (2018). https://doi.org/10.1007/s10458-018-9384-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10458-018-9384-1