An adaptive actor-critic algorithm with multi-step simulated experiences for controlling nonholonomic mobile robots

Main Author: Syam, Rafiuddin
Format: Article
Bahasa: eng
Terbitan: Springer, Germany, Vol. 11 No. 1 January 2007 , 2012
Subjects:
Online Access: http://repository.unhas.ac.id/handle/123456789/2640
Daftar Isi:
  • In this paper, we propose a new algorithm of an adaptive actor-critic method with multi-step simulated experiences, as a kind of temporal difference (TD) method. In our approach, the TD-error is composed of two valuefunctions and m utility functions, where m denotes the number ofmulti-steps inwhich the experience should be simulated. The value-function is constructed from the critic formulated by a radial basis function neural network (RBFNN), which has a simulated experience as an input, generated from a predictive model based on a kinematic model. Thus, since our approach assumes that the model is available to simulate the m-step experiences and to design a controller, such a kinematic model is also applied to construct the actor and the resultant model based actor (MBA) is also regarded as a network, i.e., it is just viewed as a resolved velocity control network. We implement this approach to control nonholonomic mobile robot, especially in a trajectory tracking control problem for the position coordinates and azimuth. Some simulations show the effectiveness of the proposed method for controlling a mobile robot with two-independent driving wheels.