Abstract
We provide an in-depth study of Nash equilibria in multi-objective normal-form games (MONFGs), i.e., normal-form games with vectorial payoffs. Taking a utility-based approach, we assume that each player’s utility can be modelled with a utility function that maps a vector to a scalar utility. In the case of a mixed strategy, it is meaningful to apply such a scalarisation both before calculating the expectation of the payoff vector as well as after. This distinction leads to two optimisation criteria. With the first criterion, players aim to optimise the expected value of their utility function applied to the payoff vectors obtained in the game. With the second criterion, players aim to optimise the utility of expected payoff vectors given a joint strategy. Under this latter criterion, it was shown that Nash equilibria need not exist. Our first contribution is to provide a sufficient condition under which Nash equilibria are guaranteed to exist. Secondly, we show that when Nash equilibria do exist under both criteria, no equilibrium needs to be shared between the two criteria, and even the number of equilibria can differ. Thirdly, we contribute a study of pure strategy Nash equilibria under both criteria. We show that when assuming quasiconvex utility functions for players, the sets of pure strategy Nash equilibria under both optimisation criteria are equivalent. This result is further extended to games in which players adhere to different optimisation criteria. Finally, given these theoretical results, we construct an algorithm to compute all pure strategy Nash equilibria in MONFGs where players have a quasiconvex utility function.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bergstresser, K., & Yu, P. L. (1977). Domination structures and multicriteria problems in n-person games. Theory and Decision, 8(1), 5–48. https://doi.org/10.1007/BF00133085.
Blackwell, D. (1954). An analog of the minimax theorem for vector payoffs. Pacific Journal of Mathematics, 6(1), 1–8. https://doi.org/10.2140/pjm.1956.6.1.
Breinbjerg, J. (2017). Equilibrium arrival times to queues with general service times and non-linear utility functions. European Journal of Operational Research, 261(2), 595–605. https://doi.org/10.1016/j.ejor.2017.03.010.
Busoniu, L., Babuska, R., & Schutter, B. D. (2008). A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 38(2), 156–172. https://doi.org/10.1109/TSMCC.2007.913919.
Corley, H. W. (1985). Games with vector payoffs. Journal of Optimization Theory and Applications, 47(4), 491–498. https://doi.org/10.1007/BF00942194.
Corley, H. W. (2020). A regret-based algorithm for computing all pure nash equilibria for noncooperative games in normal form. Theoretical Economics Letters, 10(06), 1253–1259. https://doi.org/10.4236/tel.2020.106076.
Dragomir, S. S., & Pearce, C. E. (2012). Jensen’s inequality for quasiconvex functions. Numerical Algebra, Control and Optimization, 2(2), 279–291. https://doi.org/10.3934/naco.2012.2.279.
Fudenberg, D., & Tirole, J. (1991). Game theory. MIT Press.
Hayes, C. F., Rădulescu, R., Bargiacchi, E., Källström, J., Macfarlane, M., Reymond, M., et al. (2022). A practical guide to multi-objective reinforcement learning and planning. Autonomous Agents and Multi-Agent Systems, 36(1), 26. https://doi.org/10.1007/s10458-022-09552-y.
Hayes, C. F., Reymond, M., Roijers, D. M., Howley, E., & Mannion, P. (2021). Distributional Monte Carlo Tree Search for Risk-Aware and Multi-Objective Reinforcement Learning. In Proceedings of the 20th international conference on autonomous agents and multiagent systems (AAMAS 2021) (p. 3). IFAAMAS
He, J., Li, Y., Li, H., Tong, H., Yuan, Z., Yang, X., & Huang, W. (2020). Application of game theory in integrated energy system systems: A review. IEEE Access, 8, 93380–93397. https://doi.org/10.1109/ACCESS.2020.2994133.
Ismaili, A. (2018). On existence, mixtures, computation and efficiency in multi-objective games. In T. Miller, N. Oren, Y. Sakurai, I. Noda, B. T. R. Savarimuthu, & T. Cao Son (Eds.), PRIMA 2018: Principles and practice of multi-agent systems (pp. 210–225). Springer.
Koppelman, F. S. (1981). Non-linear utility functions in models of travel choice behavior. Transportation, 10(2), 127–146. https://doi.org/10.1007/BF00165262.
Leyton-Brown, K., & Shoham, Y. (2008). Essentials of game theory: A concise multidisciplinary introduction. Synthesis Lectures on Artificial Intelligence and Machine Learning, 2(1), 1–88.
Lozovanu, D., Solomon, D., & Zelikovsky, A. (2005). Multiobjective games and determining Pareto–Nash equilibria. The Bulletin of Academy of Sciences of Moldova, Mathematics, 3(49), 115–122.
Nash, J. (1951). Non-cooperative games. The Annals of Mathematics, 54(2), 286. https://doi.org/10.2307/1969529.
Nowé, A., Vrancx, P., & De Hauwere, Y. M. (2012). Game theory and multi-agent reinforcement learning. In M. Wiering & M. van Otterlo (Eds.), Reinforcement learning: State-of-the-art (pp. 441–470). Springer. https://doi.org/10.1007/978-3-642-27645-3_14.
Porter, R., Nudelman, E., & Shoham, Y. (2008). Simple search methods for finding a Nash equilibrium. Games and Economic Behavior, 63(2), 642–662. https://doi.org/10.1016/j.geb.2006.03.015.
Rădulescu, R. (2021). Decision making in multi-objective multi-agent systems: A utility-based perspective. Ph.D. thesis, Vrije Universiteit Brussel, Brussels
Rădulescu, R., Mannion, P., Roijers, D. M., & Nowé, A. (2020). Multi-objective multi-agent decision making: A utility-based analysis and survey. Autonomous Agents and Multi-Agent Systems, 34(1), 10–10. https://doi.org/10.1007/s10458-019-09433-x.
Rădulescu, R., Mannion, P., Zhang, Y., Roijers, D. M., & Nowé, A. (2020). A utility-based analysis of equilibria in multi-objective normal-form games. The Knowledge Engineering Review, 35, e32. https://doi.org/10.1017/S0269888920000351.
Rădulescu, R., Verstraeten, T., Zhang, Y., Mannion, P., Roijers, D. M., & Nowé, A. (2022). Opponent learning awareness and modelling in multi-objective normal form games. Neural Computing and Applications, 34(3), 1759–1781. https://doi.org/10.1007/s00521-021-06184-3.
Roijers, D. M., Steckelmacher, D., & Nowé, A. (2018). Multi-objective reinforcement learning for the expected utility of the return. In Adaptive learning agents workshop at AAMAS Stockholm, Sweden
Roijers, D. M., Vamplew, P., Whiteson, S., & Dazeley, R. (2013). A survey of multi-objective sequential decision-making. Journal of Artificial Intelligence Research, 48, 67–113. https://doi.org/10.1613/jair.3987.
Roijers, D. M., & Whiteson, S. (2017). Multi-objective decision making. Synthesis Lectures on Artificial Intelligence and Machine Learning, 11(1), 1–129.
Roijers, D. M., Zintgraf, L. M., Libin, P., Reymond, M., Bargiacchi, E., & Nowé, A. (2021). Interactive multi-objective reinforcement learning in multi-armed bandits with gaussian process utility models. In F. Hutter, K. Kersting, J. Lijffijt, & I. Valera (Eds.), Machine learning and knowledge discovery in databases (pp. 463–478). Springer. https://doi.org/10.1007/978-3-030-67664-3_28.
Rosenthal, R. W. (1973). A class of games possessing pure-strategy Nash equilibria. International Journal of Game Theory, 2(1), 65–67. https://doi.org/10.1007/BF01737559.
Roughgarden, T. (2016). Introduction and Examples (pp. 1–10). Cambridge University Press. https://doi.org/10.1017/CBO9781316779309.002.
Röpke, W. (2021). Communication in multi-objective games. Master’s thesis, Vrije Universiteit Brussel
Röpke, W., Roijers, D. M., Nowé, A., & Rădulescu, R. (2021). On nash equilibria for multi-objective normal-form games under scalarised expected returns versus expected scalarised returns. In MODeM workshop
Röpke, W., Rădulescu, R., Roijers, D. M., & Nowé, A. (2021). Communication strategies in multi-objective normal-form games. In Adaptive and learning agents workshop at AAMAS
Samuelson, L. (2016). Game theory in economics and beyond. Journal of Economic Perspectives, 30(4), 107–130. https://doi.org/10.1257/jep.30.4.107.
Shapley, L. S., & Rigby, F. D. (1959). Equilibrium points in games with vector payoffs. Naval Research Logistics Quarterly, 6(1), 57–61. https://doi.org/10.1002/nav.3800060107.
Stein, N. D., Ozdaglar, A., & Parrilo, P. A. (2008). Separable and low-rank continuous games. International Journal of Game Theory, 37(4), 475–504. https://doi.org/10.1007/s00182-008-0129-2.
Varian, H. R. (2014). Chapter 3—preferences. In Intermediate microeconomics: A modern approach: Ninth Edition. W. W. Norton & Company
Voorneveld, M., Grahn, S., & Dufwenberg, M. (2000). Ideal equilibria in noncooperative multicriteria games. Mathematical Methods of Operations Research, 52(1), 65–77. https://doi.org/10.1007/s001860000069.
Wierzbicki, A. P. (1995). Multiple criteria games—Theory and applications. Journal of Systems Engineering and Electronics, 6(2), 65–81.
Zapata, A., Mármol, A. M., Monroy, L., & Caraballo, M. A. (2019). A maxmin approach for the equilibria of vector-valued games. Group Decision and Negotiation, 28(2), 415–432. https://doi.org/10.1007/s10726-018-9608-4.
Zeleny, M. (1975). Games with multiple payoffs. International Journal of Game Theory, 4(4), 179–191. https://doi.org/10.1007/BF01769266.
Zintgraf, L. M., Roijers, D. M., Linders, S., Jonker, C. M., & Nowé, A. (2018). Ordered Preference Elicitation Strategies for Supporting Multi-Objective Decision Making. In Proceedings of the 17th international conference on autonomous agents and multiagent systems, AAMAS ’18 (pp. 1477–1485). International foundation for autonomous agents and multiagent systems, Stockholm, Sweden
Acknowledgements
The first author is supported by the Research Foundation – Flanders (FWO), grant number 1197622N. This research was supported by funding from the Flemish Government under the “Onderzoeksprogramma Artificiële Intelligentie (AI) Vlaanderen” program.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Part of this work was carried out by the first author for his thesis [29] under the supervision of the other authors. Some preliminary results in this article were presented in the Multi-Objective Decision Making Workshop 2021 [30].
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Röpke, W., Roijers, D.M., Nowé, A. et al. On nash equilibria in normal-form games with vectorial payoffs. Auton Agent Multi-Agent Syst 36, 53 (2022). https://doi.org/10.1007/s10458-022-09582-6
Accepted:
Published:
DOI: https://doi.org/10.1007/s10458-022-09582-6