基於深度學習之羽球動作分析系統
No Thumbnail Available
Date
2024
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
近年來由於2020年東京奧運,台灣在羽球項目拿下一面金牌以及一面銀牌的好成績,隨著奪冠之後的聲浪,台灣的羽球人口也持續上升,因此本研究提出一套基於深度學習之羽球動作分析系統,能夠讓使用者輸入一段羽球動作影片,即可分析出動作的正確性,以避免造成傷害。也可以使得使用者剩下昂貴的教練費及場地費。羽球動作分析系統主要可以分成三個部分,分別為資料前處理、羽球動作辨識子系統及3D人體模型建構及分析子系統,羽球為世界上最快的球類運動,在拍攝時容易造成物件模糊的情形,因此本研究透過資料的前處理解決模糊影像,後續使用Frame Flexible Network架構,學習來自不同頻率的特徵圖,接著透過Temporal Shift Module位移部分通道的特徵圖,以達到時序融合。後續使用近年來新穎的3D人體模型技術,透過其中24個人體關鍵點,使用普式分析(Procrustes analysis)輸出容易受傷的關節點。本研究建立一個羽球動作資料集,命名為CVIU badminton datasets,該資料集包含7個常見的羽球動作,分別為反手擊球、正手擊球、右挑球、左挑球、低手發球、高手發球、防守動作,實驗結果顯示在CVIU badminton datasets中的Top-1準確度達到91.87%。類別準確度(Class accuracy)達到85.71%。後續實驗結果顯示本研究所提出改良都有提升效果。
In recent years, due to the 2020 Tokyo Olympics, Taiwan achieved excellent results in badminton, winning a gold medal and a silver medal. Following these victories, the number of badminton players in Taiwan has continued to rise. Therefore, this study proposes a deep learning-based badminton motion analysis system, which allows users to input a video of badminton movements to analyze the correctness of the movements and avoid injuries. It also helps users save on expensive coaching and venue fees.The badminton motion analysis system can be divided into three main parts: data preprocessing, badminton motion recognition subsystem, and 3D human model construction and analysis subsystem. Badminton is the fastest racket sport in the world, which often causes motion blur when filming. Therefore, this study addresses blurry images through data preprocessing. Subsequently, it uses the Frame Flexible Network architecture to learn feature maps from different frequencies. Then, the Temporal Shift Module is used to shift feature maps of some channels to achieve temporal fusion. The latest 3D human model technology is then used, utilizing 24 human key points. By employing Procrustes analysis, the system outputs joint points that are prone to injury.This study established a badminton motion dataset named CVIU Badminton Datasets, which includes seven common badminton actions: backhand stroke, forehand stroke, right lift, left lift, low serve, high serve, and defensive action. Experimental results showed that the Top-1 accuracy on the CVIU Badminton Datasets reached 91.87%. The class accuracy reached 85.71%. Subsequent experimental results indicated that the proposed improvements in this study have had an enhancement effect.
In recent years, due to the 2020 Tokyo Olympics, Taiwan achieved excellent results in badminton, winning a gold medal and a silver medal. Following these victories, the number of badminton players in Taiwan has continued to rise. Therefore, this study proposes a deep learning-based badminton motion analysis system, which allows users to input a video of badminton movements to analyze the correctness of the movements and avoid injuries. It also helps users save on expensive coaching and venue fees.The badminton motion analysis system can be divided into three main parts: data preprocessing, badminton motion recognition subsystem, and 3D human model construction and analysis subsystem. Badminton is the fastest racket sport in the world, which often causes motion blur when filming. Therefore, this study addresses blurry images through data preprocessing. Subsequently, it uses the Frame Flexible Network architecture to learn feature maps from different frequencies. Then, the Temporal Shift Module is used to shift feature maps of some channels to achieve temporal fusion. The latest 3D human model technology is then used, utilizing 24 human key points. By employing Procrustes analysis, the system outputs joint points that are prone to injury.This study established a badminton motion dataset named CVIU Badminton Datasets, which includes seven common badminton actions: backhand stroke, forehand stroke, right lift, left lift, low serve, high serve, and defensive action. Experimental results showed that the Top-1 accuracy on the CVIU Badminton Datasets reached 91.87%. The class accuracy reached 85.71%. Subsequent experimental results indicated that the proposed improvements in this study have had an enhancement effect.
Description
Keywords
羽球, 羽球動作辨識, 羽球動作分析, 3D人體模型分析, 資料增強, 電腦視覺, Badminton, Badtminton Motion Recognition, Badminton Motion Analysis, 3D Human Model Analysis, Data Augmentation, Computer Vision