doi: 10.4304/jsw.9.3.634-640
Statistics Based Q-learning Algorithm for Multi-Agent System and Application in RoboCup
Abstract—This paper proposes statistic learning based Q-learning algorithm for Multi-Agent System, the agent can learn other agents’ action policies through observing and counting the joint action, a concise but useful hypothesis is adopted to denote the optimal policies of other agents, the full joint probability of policies distribution guarantees the optimal action choice to the learning agent. The algorithm can also improve the learning speed because the conventional Q-learning space is cut from exponential one to linear one. The convergence of the algorithm has been proved; the successful application of this algorithm in the RoboCup shows its good learning performance.
Index Terms—Q-learning, Statistics, Multi-agent, RoboCup
Cite: Ya Xie, Zhonghua Huang, "Statistics Based Q-learning Algorithm for Multi-Agent System and Application in RoboCup," Journal of Software vol. 9, no. 3, pp. 634-640, 2014.
General Information
ISSN: 1796-217X (Online)
Abbreviated Title: J. Softw.
Frequency: Biannually
APC: 500USD
DOI: 10.17706/JSW
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Cecilia Xie
Abstracting/ Indexing: DBLP, EBSCO,
CNKI, Google Scholar, ProQuest,
INSPEC(IET), ULRICH's Periodicals
Directory, WorldCat, etcE-mail: jsweditorialoffice@gmail.com
-
Mar 07, 2025 News!
Vol 19, No 4 has been published with online version [Click]
-
Mar 07, 2025 News!
JSW had implemented online submission system [Click]
-
Apr 01, 2024 News!
Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec) [Click]
-
Apr 01, 2024 News!
Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP [Click]
-
Oct 22, 2024 News!
Vol 19, No 3 has been published with online version [Click]