自動擷取英文搭配語及中英文例句:雙語辭典編纂學的計算工具

dc.contributor.author高照明zh_tw
dc.contributor.authorZhao-Ming Gaoen_US
dc.date.accessioned2016-05-10T01:37:47Z
dc.date.available2016-05-10T01:37:47Z
dc.date.issued2014-05-??
dc.description.abstract本文描述英中雙語搭配語自動編纂線上系統EXEC 的設計流程。EXEC 由一千三百萬英文詞及二千七百萬中文字的中英雙語平行語料庫建立而成,結合英語搭配語檢索和中英雙語檢索功能。EXEC 利用統計以及具有依存關係的英文句法剖析器擷取英文搭配語。使用者在查詢時輸入關鍵詞和關鍵詞的詞性以及所搜尋的搭配語的詞性,程式依據英文句法剖析器的依存關係和mutual information、t-score、loglikelihood ratio 等統計訊息自動擷取可能的英文搭配語,並連結包含英文搭配語的英文例句及中文翻譯。實驗顯示EXEC 在擷取的正確率和辭典的涵蓋率都超過80%且可以很有效率地自動從平行語料擷取英文搭配語、例句、及中文翻譯。zh_tw
dc.description.abstractThis paper describes the procedures involved in developing EXEC, a web-based system which can automatically extract English collocations and their Chinese-English bilingual examples from parallel corpora. The system draws on statistics, dependency parsing, and Chinese-English parallel corpora of more than 13 million English words and 27 million Chinese characters. By taking a word as well as the parts-of-speech of the word and its collocate as input, the system can automatically generate collocation candidates based on syntactic dependency relations as well as statistical information regarding mutual information, t-scores, and log likelihood ratios. In conjunction with a Chinese-English bilingual concordancer, it can further extract English sentences containing identified collocations along with their Chinese translations. Our evaluations suggest that the proposed system performs reasonably well in terms of accuracy and efficiency. EXEC can be used in facilitating automatic compilation of bilingual collocation dictionaries as well as in overcoming the L2 language barrier for Chinese learners of English.en_US
dc.identifierBE3A12CE-3804-9001-0A4B-2436A2449EB9
dc.identifier.urihttp://rportal.lib.ntnu.edu.tw/handle/20.500.12235/78711
dc.language英文
dc.publisher英語學系zh_tw
dc.publisherDepartment of English, NTNUen_US
dc.relation40(1),95-121
dc.relation.ispartof同心圓:語言學研究zh_tw
dc.subject.other搭配語zh_tw
dc.subject.other依存關係zh_tw
dc.subject.other計算辭典編纂學zh_tw
dc.subject.other雙語平行語料庫zh_tw
dc.subject.othermutual informationzh_tw
dc.subject.othert-scorezh_tw
dc.subject.otherlog likelihood ratiozh_tw
dc.subject.othercollocationen_US
dc.subject.otherdependency relationen_US
dc.subject.othercomputational lexicographyen_US
dc.subject.otherparallel corporaen_US
dc.subject.othermutual informationen_US
dc.subject.othert-scoreen_US
dc.subject.otherlog likelihood ratioen_US
dc.title自動擷取英文搭配語及中英文例句:雙語辭典編纂學的計算工具zh-tw
dc.title.alternativeAutomatic Extraction of English Collocations and their Chinese-English Bilingual Examples: A Computational Tool for Bilingual Lexicographyzh_tw

Files

Collections