基於單目視覺之社會群體交互與視角自適應的行人軌跡預測模型

dc.contributor王偉彥zh_TW
dc.contributorWang, Wei-Yenen_US
dc.contributor.author章莉揚zh_TW
dc.contributor.authorChang, Li-Yangen_US
dc.date.accessioned2025-12-09T08:03:01Z
dc.date.available2026-08-01
dc.date.issued2025
dc.description.abstract行人作為智慧型運輸系統中一重要的參與者,其行為影響交通安全系統與自動駕駛控制的反應決策。本研究提供了一種端對端的社會交互模型來達成即時的行人軌跡預測,適用於智慧型運輸系統(Intelligent Transportation Systems , ITS)中影像設備的建置環境。透過BoxMOT框架串接的多物件連續追蹤模組保證了即時運算與模型輕量化的能力,本研究基於此追蹤結果接續建立行人的社會交互特徵並透過深度網路預測未來可能路徑。本研究特別針對現實場域與預設視角不同而產生的多視角挑戰,提出一套視角自適應的視覺策略,能夠重建行人的相對位置關係,旨在簡化實際部署時座標重校正程序,使此套方法在不同相機傾角下仍可維持穩定的資料分析能力。此外,針對行人社會交互特徵的學習策略,本研究進一步改良 Social-STGCNN 架構,導入群體標註資料,強化模型在進行單一行人預測時對群體行為的理解能力。zh_TW
dc.description.abstractPedestrians, as key participants in Intelligent Transportation Systems (ITS), significantly influence the decision-making processes of traffic safety systems and autonomous driving control. This research proposes an end-to-end social interaction model for real-time pedestrian trajectory prediction, tailored for deployment in ITS environments equipped with visual sensors. Leveraging the BoxMOT framework, the integrated multi-object tracking module ensures real-time performance and model efficiency. Building upon these tracking results, the model extracts social interaction features among pedestrians and employs a deep neural network to predict future trajectories.To address the challenge of multi-view discrepancies arising from varying real-world camera angles, we introduce a view-adaptive visual strategy that reconstructs relative pedestrian positions. This approach simplifies coordinate recalibration during deployment and maintains robust analytical performance across diverse camera perspectives. Furthermore, this research enhances the Social-STGCNN architecture by incorporating group annotation data, thereby improving the model’s capacity to understand group behavior during individual trajectory prediction.en_US
dc.description.sponsorship電機工程學系zh_TW
dc.identifier61175014H-48051
dc.identifier.urihttps://etds.lib.ntnu.edu.tw/thesis/detail/e8d7fa9061593d07eb5c971336230697/
dc.identifier.urihttp://rportal.lib.ntnu.edu.tw/handle/20.500.12235/125036
dc.language中文
dc.subject多物件追蹤zh_TW
dc.subject行人軌跡預測zh_TW
dc.subject社會交互建模zh_TW
dc.subject空間映射zh_TW
dc.subjectMulti-Object Trackingen_US
dc.subjectPedestrian Trajectory Predictionen_US
dc.subjectSocial Interaction Modelingen_US
dc.subjectSpatial Mappingen_US
dc.title基於單目視覺之社會群體交互與視角自適應的行人軌跡預測模型zh_TW
dc.titleA Monocular Vision-Based Pedestrian Trajectory Prediction Model with Social Group Interaction and Adaptive Viewpointsen_US
dc.type學術論文

Files

Collections