下偏矩近端策略最佳化:提升機器人在平衡板上的穩定性

dc.contributor包傑奇zh_TW
dc.contributorJacky Baltesen_US
dc.contributor.author廖翊承zh_TW
dc.contributor.authorLiao, Yi-Chengen_US
dc.date.accessioned2025-12-09T08:03:03Z
dc.date.available2025-06-30
dc.date.issued2025
dc.description.abstractnonezh_TW
dc.description.abstractThis study proposes an improved version of the Proximal Policy Optimization (PPO) algorithm by incorporating the Lower Partial Moment (LPM) method. The added loss function penalizes low advantage values, aiming to enhance the policy’s robustness against noise and performance. The new LPM-PPO algorithm is compared with leading methods such as SAC, DDPG, TRPO, and RPO across multiple Isaac Gym simulation environments to verify its effectiveness. For the Sim2Real transfer, the research applies the balance board task to a real-world humanoid robot. This process accounts for complex physical factors like friction, inertia, mass distribution, and motor dynamics. To accurately collect observations, the study uses OpenCV for vision-based tracking, forward kinematics for position estimation, and adds noise during training to mimic real-world sensor errors—improving the robot’s real-world adaptability and robustness.en_US
dc.description.sponsorship電機工程學系zh_TW
dc.identifier61175052H-47262
dc.identifier.urihttps://etds.lib.ntnu.edu.tw/thesis/detail/56bc277e0be6270e6a86aad3a13f69d1/
dc.identifier.urihttp://rportal.lib.ntnu.edu.tw/handle/20.500.12235/125044
dc.language英文
dc.subjectnonezh_TW
dc.subjectHumanoid Robotsen_US
dc.subjectLPM-PPOen_US
dc.subjectReinforcement Learningen_US
dc.subjectSim2Realen_US
dc.subjectBalance Boarden_US
dc.title下偏矩近端策略最佳化:提升機器人在平衡板上的穩定性zh_TW
dc.titleLower Partial Moment Proximal Policy Optimization: Enhancing Robot Stability on Balance Boardsen_US
dc.type學術論文

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
202500047262-109628.pdf
Size:
7.04 MB
Format:
Adobe Portable Document Format
Description:
學術論文

Collections