{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,1,12]],"date-time":"2024-01-12T00:00:10Z","timestamp":1705017610913},"reference-count":15,"publisher":"World Scientific Pub Co Pte Lt","issue":"01","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Tools"],"published-print":{"date-parts":[[2012,2]]},"abstract":" This paper contributes to solve effectively stochastic resource allocation problems in multiagent environments. To address it, a distributed Q-values approach is proposed when the resources are distributed among agents a priori, but the actions made by an agent may influence the reward obtained by at least another agent. This distributed Q-values approach allows to coordinate agents' reward and thus permits to reduce the set of states and actions to consider. On the other hand, when the resources are available to all agents, no distributed Q-values is possible and tight lower and upper bounds are proposed for existing heuristic search algorithms. <\/jats:p> Our experimental results demonstrate the efficiency of our distributed Q-values in terms of planning time as well as our tight bounds in terms of fast convergence and reduction of backups. <\/jats:p>","DOI":"10.1142\/s0218213012500030","type":"journal-article","created":{"date-parts":[[2012,1,6]],"date-time":"2012-01-06T00:57:11Z","timestamp":1325811431000},"page":"1250003","source":"Crossref","is-referenced-by-count":4,"title":["STOCHASTIC RESOURCE ALLOCATION IN MULTIAGENT ENVIRONMENTS: AN APPROACH BASED ON DISTRIBUTED Q-VALUES AND BOUNDED REAL-TIME DYNAMIC PROGRAMMING"],"prefix":"10.1142","volume":"21","author":[{"given":"PIERRICK","family":"PLAMONDON","sequence":"first","affiliation":[{"name":"DAMAS Laboratory, Laval University, G1K 7P4, Qu\u00e9bec, Canada"}]},{"given":"BRAHIM","family":"CHAIB-DRAA","sequence":"additional","affiliation":[{"name":"DAMAS Laboratory, Laval University, G1K 7P4, Qu\u00e9bec, Canada"}]}],"member":"219","published-online":{"date-parts":[[2012,4,5]]},"reference":[{"key":"rf1","doi-asserted-by":"publisher","DOI":"10.1002\/9780470182963"},{"key":"rf2","volume-title":"Reinforcement Learning: An Introduction","volume":"116","author":"Sutton R. S.","year":"1998"},{"key":"rf3","doi-asserted-by":"publisher","DOI":"10.2200\/S00268ED1V01Y201005AIM009"},{"key":"rf4","doi-asserted-by":"publisher","DOI":"10.1016\/0004-3702(94)00011-O"},{"key":"rf9","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(01)00106-0"},{"key":"rf10","unstructured":"S.\u00a0Singh and D.\u00a0Cohn, Advances in Neural Information Processing Systems\u00a010 (MIT Press, Cambridge, MA, USA, 1998)\u00a0pp. 1057\u20131063."},{"key":"rf11","volume-title":"Microeconomics","author":"Pindyck R. S.","year":"2000"},{"key":"rf12","volume-title":"Dynamic Programming","author":"Bellman R.","year":"1957"},{"key":"rf13","volume-title":"Dynamic Programming and Markov Processes","author":"Howard R. A.","year":"1960"},{"key":"rf14","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316887"},{"key":"rf16","first-page":"279","volume":"8","author":"Watkins C. J.","journal-title":"Machine learning"},{"key":"rf17","volume-title":"Artificial Intelligence: A Modern Approach","author":"Russell S. J.","year":"2009"},{"key":"rf18","first-page":"100","volume":"4","author":"Raphael B.","journal-title":"IEEE Trans. Syst. Science and Cybernetics"},{"key":"rf19","doi-asserted-by":"publisher","DOI":"10.1145\/3828.3830"},{"key":"rf20","doi-asserted-by":"publisher","DOI":"10.1016\/0004-3702(90)90054-4"}],"container-title":["International Journal on Artificial Intelligence Tools"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218213012500030","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,6]],"date-time":"2019-08-06T16:42:04Z","timestamp":1565109724000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218213012500030"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,2]]},"references-count":15,"journal-issue":{"issue":"01","published-online":{"date-parts":[[2012,4,5]]},"published-print":{"date-parts":[[2012,2]]}},"alternative-id":["10.1142\/S0218213012500030"],"URL":"https:\/\/doi.org\/10.1142\/s0218213012500030","relation":{},"ISSN":["0218-2130","1793-6349"],"issn-type":[{"value":"0218-2130","type":"print"},{"value":"1793-6349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,2]]}}}