Přístupnostní navigace
E-application
Search Search Close
Master's Thesis
Author of thesis: Bc. Jiří Tomášek
Acad. year: 2025/2026
Supervisor: doc. Ing. Stanislav Věchet, Ph.D.
Reviewer: doc. Ing. Jiří Krejsa, Ph.D.
This thesis investigates the application of reinforcement learning to the control of bipedal robot locomotion. The PAWO robot, a small bipedal platform developed at the Faculty of Mechanical Engineering, Brno University of Technology, is used as the evaluation platform. A custom Proximal Policy Optimization (PPO) algorithm is implemented and used to train control policies in a MuJoCo simulation of the robot. Three approaches to learning bipedal gaits are designed, implemented, and compared: behavioral cloning combined with PPO fine-tuning, reference-free reinforcement learning, and imitation learning with kinematic reference motions generated by an inverse kinematics tool. The reference free approach is also used to develop a robust standing policy with active balance recovery under external perturbations. The results show that the imitation learning approach produces the most natural and stable gait, supports multiple motions within a single policy framework, and provides a foundation for future extension to additional behaviors and transfer to the physical robot.
bipedal robot, reinforcement learning, proximal policy optimization, imitation learning, locomotion control, PAWO
Date of defence
16.06.2026
Result of the defence
Defended (thesis was successfully defended)
Grading
A
Process of defence
Při obhajobě student nejprve prezentoval svoji diplomovou práci, následně byly přečteny posudky a student odpovídal na dotazy oponenta. Poté byly členy komise položeny následující otázky: Jaký fyzikální engine používáte? Můžete srovnat vámi použitý engine se Simscape Multibody? Obhajoba byla komisí hodnocena jako výborná.
Language of thesis
English
Faculty
Fakulta strojního inženýrství
Department
Institute of Solid Mechanics, Mechatronics and Biomechanics
Study programme
Mechatronics (N-MET-P)
Composition of Committee
RNDr. Vladimír Opluštil (předseda) doc. Ing. Robert Grepl, Ph.D. (místopředseda) doc. Ing. Jiří Krejsa, Ph.D. (člen) doc. Ing. Radoslav Cipín, Ph.D. (člen) Ing. Dalibor Červinka, Ph.D. (člen) Ing. Michal Bastl, Ph.D. (člen) Ing. Peter Zavadinka, Ph.D. (člen) doc. Ing. David Fojtík, Ph.D. (člen)
Supervisor’s reportdoc. Ing. Stanislav Věchet, Ph.D.
Grade proposed by supervisor: A
Reviewer’s reportdoc. Ing. Jiří Krejsa, Ph.D.
Grade proposed by reviewer: A
Responsibility: Mgr. et Mgr. Hana Odstrčilová