This paper explores the use of a learning algorithm in the “guarding a territory” game. The game occurs in continuous time, where a single learning invader tries to get as close as possible to a territory before being captured by a guard. Previous research has approached the problem by letting only the guard learn. We will examine the other possibility of the game, in which only the invader is going to learn. Furthermore, in our case the guard is superior (faster) to the invader. We will also consider using models with non-holonomic constraints. A control system is designed and optimized for the invader to play the game and reach Nash Equilibrium. The paper shows how the learning system is able to adapt itself. The system’s performance is evaluated through different simulations and compared to the Nash Equilibrium. Experiments with real robots were conducted and verified our simulations in a real-life environment. Our results show that our learning invader behaved rationally in different circumstances.

Additional Metadata
Keywords Adaptive control, Continuous time, Fuzzy Q-learning, Machine intelligence, Nash equilibrium, Non-holonomic, Reinforcement learning
Persistent URL dx.doi.org/10.1007/s10846-015-0317-9
Journal Journal of Intelligent and Robotic Systems: Theory and Applications
Citation
Raslan, H. (Hashem), Schwartz, H.M, & Givigi, S. (Sidney). (2016). A Learning Invader for the “Guarding a Territory” Game: A Reinforcement Learning Problem. Journal of Intelligent and Robotic Systems: Theory and Applications, 83(1), 55–70. doi:10.1007/s10846-015-0317-9