運用鄰近與概念資訊於語言模型調適之研究

dc.contributor陳柏琳zh_TW
dc.contributor.author郝柏翰zh_TW
dc.date.accessioned2019-09-05T11:18:37Z
dc.date.available2014-2-20
dc.date.available2019-09-05T11:18:37Z
dc.date.issued2014
dc.description.abstract本論文研究語言模型調適技術用於中文大詞彙連續語音辨識,其主要貢獻有兩個部分:第一部分探討主題模型(Topic Models)之延伸與改進,除了希望能放寬詞袋假設的限制之外,更藉由融入鄰近資訊(Proximity Information)期望使主題模型有更好的預測效能;第二部分提出概念模型(Concept Language Model, CLM),其主要目的為近似使用者心中所想之概念,並藉此觀察較為相關之用詞;同時,本論文更嘗試以不同方式來估測概念模型。本論文實驗以字錯誤率(Character Error Rate, CER)與語言複雜度(Perplexity)為評估依據;結果顯示本論文所提出方法對辨識效能之提升有明顯的幫助。zh_TW
dc.description.abstractThis thesis investigates and develops language model adaptation techniques for Mandarin large vocabulary continuous speech recognition (LVCSR) and its main contribution is two-fold. First, the so-called “bag-of-words” assumption of conventional topic models is relaxed by additionally incorporating word proximity cues into the model formulation. By doing so, the resulting topic models can achieve better prediction capabilities for use in LVCSR. Second, we propose a novel concept language modeling (CLM) approach to rendering the relationships between a search history and an upcoming word. The instantiations of CLM can be constructed with different levels of lexical granularities, such as words and document clusters. A series of experiments on a LVCSR task demonstrate that our proposed language models can offer substantial improvements over the baseline N-gram system, and achieve performance competitive to, or better than, some state-of-the-art language models.en_US
dc.description.sponsorship資訊工程學系zh_TW
dc.identifierGN060047082S
dc.identifier.urihttp://etds.lib.ntnu.edu.tw/cgi-bin/gs32/gsweb.cgi?o=dstdcdr&s=id=%22GN060047082S%22.&%22.id.&
dc.identifier.urihttp://rportal.lib.ntnu.edu.tw:80/handle/20.500.12235/106586
dc.language中文
dc.subject語音辨識zh_TW
dc.subject語言模型zh_TW
dc.subject鄰近資訊zh_TW
dc.subject概念資訊zh_TW
dc.subjectAutomatic Speech Recognitionen_US
dc.subjectLanguage Modelingen_US
dc.subjectProximity Cuesen_US
dc.subjectConcept Informationen_US
dc.title運用鄰近與概念資訊於語言模型調適之研究zh_TW
dc.titleLeveraging Proximity Cues and Concept Information for Language Model Adaptation in Speech Recognitionen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
n060047082s01.pdf
Size:
1.73 MB
Format:
Adobe Portable Document Format

Collections