A Multi-level Hierarchical Index Structure for Supporting Efficient Similarity Search of Tagsets

熊薇; Nonhlanhla Shongwe

A Multi-level Hierarchical Index Structure for Supporting Efficient Similarity Search of Tagsets

dc.contributor	Jia-Ling Koh	zh_TW
dc.contributor	Chung-Wen Cho	zh_TW
dc.contributor	Jia-Ling Koh	en_US
dc.contributor	Chung-Wen Cho	en_US
dc.contributor.author	熊薇	zh_TW
dc.contributor.author	Nonhlanhla Shongwe	en_US
dc.date.accessioned	2019-09-05T11:40:57Z
dc.date.available	2014-09-03
dc.date.available	2019-09-05T11:40:57Z
dc.date.issued	2011
dc.description.abstract	In this thesis, we propose a multi-level hierarchical index structure to support efficient similarity search for tagsets. The proposed method is designed based on a previous method which supports similarity search in transaction databases with a two-level bounding mechanism. Similar to the previous method, the tagsets are incrementally grouped into clusters. However, a cluster may have sub-clusters in our approach. The tagsets in a leaf-cluster are grouped into batches. Three different thresholds are used to control the degree of similarity at each level of the index structure. Furthermore, we require the tagsets in the same cluster containing at least one common tag to prevent from grouping unrelated tagsets into a cluster. The experimental results show that the proposed multi-level hierarchical index structure provides better performance on execution time of searching than both the proposed method and the naïve method significantly. Besides, with the assistant of an inverted list of clusters, the execution time of the proposed method for deletion and updating is also much better than the other two methods.	zh_TW
dc.description.abstract	In this thesis, we propose a multi-level hierarchical index structure to support efficient similarity search for tagsets. The proposed method is designed based on a previous method which supports similarity search in transaction databases with a two-level bounding mechanism. Similar to the previous method, the tagsets are incrementally grouped into clusters. However, a cluster may have sub-clusters in our approach. The tagsets in a leaf-cluster are grouped into batches. Three different thresholds are used to control the degree of similarity at each level of the index structure. Furthermore, we require the tagsets in the same cluster containing at least one common tag to prevent from grouping unrelated tagsets into a cluster. The experimental results show that the proposed multi-level hierarchical index structure provides better performance on execution time of searching than both the proposed method and the naïve method significantly. Besides, with the assistant of an inverted list of clusters, the execution time of the proposed method for deletion and updating is also much better than the other two methods.	en_US
dc.description.sponsorship	資訊工程學系	zh_TW
dc.identifier	GN0698470726
dc.identifier.uri	http://etds.lib.ntnu.edu.tw/cgi-bin/gs32/gsweb.cgi?o=dstdcdr&s=id=%22GN0698470726%22.&%22.id.&
dc.identifier.uri	http://rportal.lib.ntnu.edu.tw:80/handle/20.500.12235/106876
dc.language	英文
dc.subject	multi-level hierarchical index structure	zh_TW
dc.subject	two-level bounding mechanism	zh_TW
dc.subject	tagsets	zh_TW
dc.subject	clusters	zh_TW
dc.subject	batches	zh_TW
dc.subject	inverted list	zh_TW
dc.subject	multi-level hierarchical index structure	en_US
dc.subject	two-level bounding mechanism	en_US
dc.subject	tagsets	en_US
dc.subject	clusters	en_US
dc.subject	batches	en_US
dc.subject	inverted list	en_US
dc.title	A Multi-level Hierarchical Index Structure for Supporting Efficient Similarity Search of Tagsets	zh_TW
dc.title	A Multi-level Hierarchical Index Structure for Supporting Efficient Similarity Search of Tagsets	en_US

Collections

學位論文

A Multi-level Hierarchical Index Structure for Supporting Efficient Similarity Search of Tagsets

Files

Collections