doi: 10.4304/jsw.9.3.634-640
Statistics Based Q-learning Algorithm for Multi-Agent System and Application in RoboCup
Abstract—This paper proposes statistic learning based Q-learning algorithm for Multi-Agent System, the agent can learn other agents’ action policies through observing and counting the joint action, a concise but useful hypothesis is adopted to denote the optimal policies of other agents, the full joint probability of policies distribution guarantees the optimal action choice to the learning agent. The algorithm can also improve the learning speed because the conventional Q-learning space is cut from exponential one to linear one. The convergence of the algorithm has been proved; the successful application of this algorithm in the RoboCup shows its good learning performance.
Index Terms—Q-learning, Statistics, Multi-agent, RoboCup
Cite: Ya Xie, Zhonghua Huang, "Statistics Based Q-learning Algorithm for Multi-Agent System and Application in RoboCup," Journal of Software vol. 9, no. 3, pp. 634-640, 2014.
General Information
ISSN: 1796-217X (Online)
Abbreviated Title: J. Softw.
Frequency: Quarterly
APC: 500USD
DOI: 10.17706/JSW
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Cecilia Xie
Abstracting/ Indexing: DBLP, EBSCO,
CNKI, Google Scholar, ProQuest,
INSPEC(IET), ULRICH's Periodicals
Directory, WorldCat, etcE-mail: jsweditorialoffice@gmail.com
-
Oct 22, 2024 News!
Vol 19, No 3 has been published with online version [Click]
-
Jan 04, 2024 News!
JSW will adopt Article-by-Article Work Flow
-
Apr 01, 2024 News!
Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec) [Click]
-
Apr 01, 2024 News!
Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP [Click]
-
Jun 12, 2024 News!
Vol 19, No 2 has been published with online version [Click]