Clustering analysis of trajectory data: Comparison of mixture of regression models and hierarchical clustering with dynamic time warping

No Thumbnail Available

Date

2019

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

路徑資料為對應著時間的曲線資料,常見於許多領域如氣候、時間序列等。而路徑資料的分群為統計分析中重要的一環,透過分群我們將相似的資料分為一群,藉此我們可以分析各群的性質甚至預測下一個資料屬於的集群。這篇論文中我們使用了兩種分群方法,混合回歸模型(mixture of regression models)和應用動態時間扭曲法的階層分群法(hierarchical clustering with dynamic time warping),透過模擬以及實際資料的分析將之做比較。 在模擬中我們以分群的正確率來比較兩個方法在不同情況下的表現,以及討論了混合回歸模型在不同情況下參數估計的結果。根據模擬結果,兩個方法並沒有絕對的優劣,而是在不同情況下擁有各自的優勢。最後則是將這兩個方法分別應用在實際資料的分析上。
The clustering of trajectory data is an important part of statistical analysis. Trajectory data is curve data corresponding to time. Through clustering, we divide similar curves into groups, so that we can analyze the properties of each group. Two methods are studied: one is model-based clustering, mixture of regression models, and the other is hierarchical clustering with dynamic time warping. These two methods are compared by simulation study. In the simulation, we discuss the results of the parameter estimation of the mixture of regression models, and compare the performance of the two methods in different situations by the correct clustering rate. According to the simulation results, the two methods have their own advantages in different situations. Additionally, the two clustering methods are applied to a practical data.

Description

Keywords

混合回歸模型, 階層式分群法, 動態時間扭曲法, Mixture of regression models, Hierarchical clustering, Dynamic time warping

Citation

Collections