下偏矩近端策略最佳化：提升機器人在平衡板上的穩定性

廖翊承; Liao, Yi-Cheng

下偏矩近端策略最佳化：提升機器人在平衡板上的穩定性

dc.contributor	包傑奇	zh_TW
dc.contributor	Jacky Baltes	en_US
dc.contributor.author	廖翊承	zh_TW
dc.contributor.author	Liao, Yi-Cheng	en_US
dc.date.accessioned	2025-12-09T08:03:03Z
dc.date.available	2025-06-30
dc.date.issued	2025
dc.description.abstract	none	zh_TW
dc.description.abstract	This study proposes an improved version of the Proximal Policy Optimization (PPO) algorithm by incorporating the Lower Partial Moment (LPM) method. The added loss function penalizes low advantage values, aiming to enhance the policy’s robustness against noise and performance. The new LPM-PPO algorithm is compared with leading methods such as SAC, DDPG, TRPO, and RPO across multiple Isaac Gym simulation environments to verify its effectiveness. For the Sim2Real transfer, the research applies the balance board task to a real-world humanoid robot. This process accounts for complex physical factors like friction, inertia, mass distribution, and motor dynamics. To accurately collect observations, the study uses OpenCV for vision-based tracking, forward kinematics for position estimation, and adds noise during training to mimic real-world sensor errors—improving the robot’s real-world adaptability and robustness.	en_US
dc.description.sponsorship	電機工程學系	zh_TW
dc.identifier	61175052H-47262
dc.identifier.uri	https://etds.lib.ntnu.edu.tw/thesis/detail/56bc277e0be6270e6a86aad3a13f69d1/
dc.identifier.uri	http://rportal.lib.ntnu.edu.tw/handle/20.500.12235/125044
dc.language	英文
dc.subject	none	zh_TW
dc.subject	Humanoid Robots	en_US
dc.subject	LPM-PPO	en_US
dc.subject	Reinforcement Learning	en_US
dc.subject	Sim2Real	en_US
dc.subject	Balance Board	en_US
dc.title	下偏矩近端策略最佳化：提升機器人在平衡板上的穩定性	zh_TW
dc.title	Lower Partial Moment Proximal Policy Optimization: Enhancing Robot Stability on Balance Boards	en_US
dc.type	學術論文

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 202500047262-109628.pdf
Size:: 7.04 MB
Format:: Adobe Portable Document Format
Description:: 學術論文

Download

Collections

學位論文