柯佳伶Jia-Ling Koh梁哲瑋Che-Wei Liang2019-09-052011-2-82019-09-052010http://etds.lib.ntnu.edu.tw/cgi-bin/gs32/gsweb.cgi?o=dstdcdr&s=id=%22GN0696470368%22.&%22.id.&http://rportal.lib.ntnu.edu.tw:80/handle/20.500.12235/106731近年來微網誌的使用越來越普遍,使用者會透過微網誌文章與好友分享,包含使用者興趣、心情、資訊分享等。微網誌使用者所發表的文章所涵蓋的類別通常是使用者有興趣的主題,因此我們希望藉由探勘微網誌使用者的所發表的文章主題來找出使用者的興趣。本論文研究所提出的方法是先對一個微網誌使用者萃取出文章中的重要字詞,運用維基百科之分類網絡來查詢出字詞所涵蓋的類別概念,而探勘出使用者可能的興趣類別。在探勘過程中,對於維基百科中直接查詢不到的字詞,則透過線上連結維基百科尋找重定向字詞所涵蓋的類別概念。對於非維基百科字詞,我們則透過相關字詞的聚落分析結果,運用相同聚落的其他字詞來探勘出可能的類別概念。我們提出計算微網誌使用者的文章主題集中度之評估方法,實驗結果顯示:本論文系統所提出之使用者文章集中度的評估方法可達到很高的正確率,且本論文系統自動判定使用者的興趣類別與受試者所挑選的類別結果有一定程度的一致性。In recent years, micro-blogging has been widely used by users. Micro-blog users usually share their interests, feelings, and information with their friends. The implicit topics covered in the micro-blog articles of a user usually show the user’ interests. Therefore, the goal of this study is to discover the implicit topics of micro-blog articles posted by micro-blog users to find users' interests. In this thesis, we first extract the important terms in a micro-blog article, and then Wikipedia is used to look up the corresponding categories of each term. For the terms which that can’t be found by Wikipedia directly, the Wikipedia online is linked to find the categories of their redirected terms. For each non-Wikipedia term, through the clustering analysis of related terms, the other terms in the same cluster with the non-Wikipedia term are used instead to get the corresponding categories. An evaluation method is proposed to measure the topic concentration degree of a micro-blog user. The results of experiments show that the proposed method can judge the topic concentration degree of micro-blog users with high precision. Moreover, the interest categories of micro-blog users discovered by the proposed method has high consistency with the results decided by the testers.微網誌維基百科文字探勘micro-bloggingWikipediatext mining運用維基百科進行個人微網誌內容主題分析Mining Topic Interests of Users from Micro-blogs based on Wikipedia