A Multi-level Hierarchical Index Structure for Supporting Efficient Similarity Search of Tagsets

dc.contributorJia-Ling Kohzh_TW
dc.contributorChung-Wen Chozh_TW
dc.contributorJia-Ling Kohen_US
dc.contributorChung-Wen Choen_US
dc.contributor.author熊薇zh_TW
dc.contributor.authorNonhlanhla Shongween_US
dc.date.accessioned2019-09-05T11:40:57Z
dc.date.available2014-09-03
dc.date.available2019-09-05T11:40:57Z
dc.date.issued2011
dc.description.abstractIn this thesis, we propose a multi-level hierarchical index structure to support efficient similarity search for tagsets. The proposed method is designed based on a previous method which supports similarity search in transaction databases with a two-level bounding mechanism. Similar to the previous method, the tagsets are incrementally grouped into clusters. However, a cluster may have sub-clusters in our approach. The tagsets in a leaf-cluster are grouped into batches. Three different thresholds are used to control the degree of similarity at each level of the index structure. Furthermore, we require the tagsets in the same cluster containing at least one common tag to prevent from grouping unrelated tagsets into a cluster. The experimental results show that the proposed multi-level hierarchical index structure provides better performance on execution time of searching than both the proposed method and the naïve method significantly. Besides, with the assistant of an inverted list of clusters, the execution time of the proposed method for deletion and updating is also much better than the other two methods.zh_TW
dc.description.abstractIn this thesis, we propose a multi-level hierarchical index structure to support efficient similarity search for tagsets. The proposed method is designed based on a previous method which supports similarity search in transaction databases with a two-level bounding mechanism. Similar to the previous method, the tagsets are incrementally grouped into clusters. However, a cluster may have sub-clusters in our approach. The tagsets in a leaf-cluster are grouped into batches. Three different thresholds are used to control the degree of similarity at each level of the index structure. Furthermore, we require the tagsets in the same cluster containing at least one common tag to prevent from grouping unrelated tagsets into a cluster. The experimental results show that the proposed multi-level hierarchical index structure provides better performance on execution time of searching than both the proposed method and the naïve method significantly. Besides, with the assistant of an inverted list of clusters, the execution time of the proposed method for deletion and updating is also much better than the other two methods.en_US
dc.description.sponsorship資訊工程學系zh_TW
dc.identifierGN0698470726
dc.identifier.urihttp://etds.lib.ntnu.edu.tw/cgi-bin/gs32/gsweb.cgi?o=dstdcdr&s=id=%22GN0698470726%22.&%22.id.&
dc.identifier.urihttp://rportal.lib.ntnu.edu.tw:80/handle/20.500.12235/106876
dc.language英文
dc.subjectmulti-level hierarchical index structurezh_TW
dc.subjecttwo-level bounding mechanismzh_TW
dc.subjecttagsetszh_TW
dc.subjectclusterszh_TW
dc.subjectbatcheszh_TW
dc.subjectinverted listzh_TW
dc.subjectmulti-level hierarchical index structureen_US
dc.subjecttwo-level bounding mechanismen_US
dc.subjecttagsetsen_US
dc.subjectclustersen_US
dc.subjectbatchesen_US
dc.subjectinverted listen_US
dc.titleA Multi-level Hierarchical Index Structure for Supporting Efficient Similarity Search of Tagsetszh_TW
dc.titleA Multi-level Hierarchical Index Structure for Supporting Efficient Similarity Search of Tagsetsen_US

Files

Collections