基於深度學習之鯨豚個體身分辨識系統

No Thumbnail Available

Date

2024

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

本研究提出一個基於深度學習之鯨豚個體身分辨識系統,希望透過鯨豚個體身分辨識的技術,追蹤鯨豚遷徙路徑來估算鯨豚族群數量,進一步評估和保護海洋生態系統的健康。研究目標為辨識同一物種內不同鯨豚個體的生物特徵,以及同一隻鯨豚在不同拍攝環境下的影像特徵差異。由於鯨豚資料集中存在影像品質不穩定和個體影像數量極不平均的問題,故本研究著手解決這些問題,包含資料前處理(Data Preprocessing)、提出模型改良方法,及不同面向的測試方法。本系統首先對鯨豚資料集進行資料前處理,接著進行鯨豚偵測,最後作鯨豚個體身分辨識。資料集前處理包括資料清理(Data Cleaning)和資料增強(Data Augmentation),其目的在解決資料集中的潛在問題。在鯨豚偵測階段,採用YOLOv5定位鯨豚位置,過濾背景雜訊以增加模型訓練速度。在鯨豚個體身分辨識階段,利用骨幹模型(Backbone Model)從鯨豚影像中提取特徵,並使用頭部模型(Head Model)進行個體身分預測。本研究使用EfficientNetV1-B4作為骨幹模型,頭部模型使用附加角度邊界損失函數(ArcFace)。針對資料集問題對頭部模型進行改良,以提高鯨豚個體身分辨識的正確率。透過在ArcFace加入子中心(Sub-center)向量,解決同一隻鯨豚在不同拍攝環境下的影像特徵差異的問題,從而提升鯨豚個體身分辨識的正確率。此外,引入動態邊界(Dynamic Margin)解決在訓練階段鯨豚個體影像數量極不平均的問題,加快模型的收斂速度。實驗結果顯示改良後的子中心附加角度邊界損失函數在三個面向的測試 實際應用情況、多數合成資料庫(Synthetic Data),和部分合成資料庫(影像數量3張以上的鯨豚個體)之mAP分別為68.63%、81.60%和35.70%。相較於原始的ArcFace提升4.83%、6.08%和8.19%。另外,將動態邊界應用於子中心附加角度邊界損失函數的改良方案,在維持相當正確率相當的情況下,減少28%的訓練時間。由實驗結果發現,本研究所提出的改良方案能對資料集問題進行適當處理並提升鯨豚個體身分辨識的準確率。
This research presents a system based on deep learning for individual cetacean identification. It aims to track cetacean migration paths and estimate their population numbers for assessing and maintaining the health of marine ecosystems. The study focuses on distinguishing individual biological characteristics from images of cetaceans of the same species and those of the same cetaceans captured in different environments. To address the issues with the dataset, such as unstable image quality and a highly uneven distribution of individual images, the research focuses on data preprocessing, model improvement, and comprehensive testing methods.First, the cetacean dataset is preprocessed to achieve clean data. Subsequently, cetacean detection is performed using YOLOv5 to identify cetaceans and filter background noise, followed by cetacean individual identification. EfficientNetV1-B4 is chosen as the backbone model, and the Additive Angular Margin Loss (ArcFace) is adopted for the head model. Incorporating sub-centers into ArcFace addresses the problem of different image features of the same cetacean under varying environments, thus improving identification accuracy. Moreover, the introduction of dynamic margins in sub-center ArcFace deals with the uneven distribution of individual images during training, enhancing the model's convergence speed. Experimental results show that the improved sub-center ArcFace achieves higher mAP scores across three testing scenarios: real-world application, majority synthetic dataset, and partial synthetic dataset (individuals with more than three images). Compared to the original ArcFace, mAP improves by 4.83%, 6.08%, and 8.19%, respectively. Additionally, applying sub-center ArcFace with dynamic margins maintains similar accuracy levels while reducing training time by 28%. The findings indicate the effectiveness of the proposed improvements in handling dataset issues and improving the accuracy of cetacean individual identification.

Description

Keywords

鯨魚, 海豚, 個體身分辨識, 深度學習, 影像檢索, 附加角度邊界損失函數, 動態邊界應用於子中心附加角度邊界損失函數, Whale, Dolphin, Individual Identification, Deep Learning, Image Retrieval, Additive Angular Margin Loss, Sub-center Additive Angular Margin Loss with Dynamic Margin

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By