Stabilization of Biped Robot based on Two mode Q-learning
نویسندگان
چکیده
In this paper, two mode Q-learning, an extension of Q-learning, is used to stabilize the Zero Moment Point (ZMP) of a biped robot in the standing posture. In two mode Q-learning, the experiences of both success and failure of an agent are used for fast convergence. To demonstrate the effectiveness of two mode Q-learning against conventional Q-learning, the property of convergence is investigated through simulation in a grid world. This paper also presents the experimental results of the ZMP compensation in the standing posture of a biped robot. The external force generated from sagittal plane impacts to biped robot and the ZMP compensation scheme, based on two mode Q-learning, is employed. The effectiveness of two mode Q-learning is veri£ed by the use of real experiment.
منابع مشابه
A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking
A simple easy to implement algorithm is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. Fuzzy if-then rules provide a reliable decision maki...
متن کاملReconstructing human push recovery reactions using a three dimensional under-actuated bipedal robot
This paper presents the ability of hybrid zero dynamics (HZD) feedback control method to reproduce human like movements for walking push recovery of an under-actuated 3D biped model. The balance recovery controller is implemented on a three-dimensional under-actuated bipedal model subjected to a push disturbance. The biped robot model is considered as a hybrid system with eight degrees of freed...
متن کاملOn the Desing and Test of a Prototype of Biped Actuated by Shape Memory Alloys
In this paper the design of a biped robot actuated with Shape Memory Alloy (SMA) springs with minimum degrees of freedom is presented. SMA springs are a class of smart materials that are known for their high power to mass and volume ratios. It was shown that utilizing spring type of SMAs have many advantages as large deformation, smooth motion, silent and clean movement compared to ordinary typ...
متن کاملBiped Balance Control by Reinforcement Learning
This work studied biped walking with single (one-leg) support and balance control using reinforcement learning. The proposed Q-learning algorithm makes a robot learn to walk without any previous knowledge of dynamics model. This balance control with single support shifts the Zero Moment Point (ZMP) of the robot to a stable region over walking sequences by means of learned gestures. Hence, the p...
متن کاملStable Gait Planning and Robustness Analysis of a Biped Robot with One Degree of Underactuation
In this paper, stability analysis of walking gaits and robustness analysis are developed for a five-link and four-actuator biped robot. Stability conditions are derived by studying unactuated dynamics and using the Poincaré map associated with periodic walking gaits. A stable gait is designed by an optimization process satisfying physical constraints and stability conditions. Also, considering...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004