柯佳伶Jia-Ling Koh戴衣菱Yi-Ling Tai2019-09-052013-8-182019-09-052010http://etds.lib.ntnu.edu.tw/cgi-bin/gs32/gsweb.cgi?o=dstdcdr&s=id=%22GN0697470066%22.&%22.id.&http://rportal.lib.ntnu.edu.tw:80/handle/20.500.12235/106765本論文研究以電子書作為內容來源,針對兩個特定領域專有詞彙的概念解釋句,進行自動擷取以及分群組織整理。為了克服傳統上使用字詞頻率建構特徵向量卻忽略隱含語意關係的缺點,本論文提出計算句子中出現的所有字詞對選取的特徵字詞之語意相似關係,來對句子建立MI特徵向量,進行句子分群。從分群的結果中選定可以代表分群概念的標籤,使用標籤來重新組織概念架構,並且在分群中挑出可以代表兩個專有詞彙的比較句。In this thesis, we use PDF textbook as data resource, focus on comparing the conceptual sentences of two domain-specific terms .We first calculate the mutual information of every word in sentence and selected feature words to build MI vector space model. The vector space model is used to evaluate the similarity of two sentences for the hierarchical clustering algorithm. After clustering, we choose representative labels and comparative sentence pair for every cluster. According representative labels, the clusters which have the same labels will be grouped as a new concept hierarchy.資料探勘資訊檢索句子分群自動摘要Data MiningInformation RetrievalSentence ClusteringAutomatic Summarization多個專有詞彙概念解釋句語意關連自動分析組織之研究Semantic Association Analysis for Organizing Related Sentences of Multiple Domain-Specific Terms