探索基於生成對抗網路之新穎強健性技術
於語音辨識的應用

dc.contributor陳伯琳zh_TW
dc.contributorChen, Berlinen_US
dc.contributor.author楊明璋zh_TW
dc.contributor.authorYang, Ming-Jhangen_US
dc.date.accessioned2020-12-14T09:07:55Z
dc.date.available2019-08-17
dc.date.available2020-12-14T09:07:55Z
dc.date.issued2019
dc.description.abstract近年深度學習技術在許多領域有重大突破,在各種實際應用中也大放異彩,於自動語音辨識的應用中也一樣有優秀表現。雖然主流語音辨識系統在某些指標性任務上已經可達到和人類聽覺相當的辨識效果,然而它們卻不像人類一樣對於環境干擾具有強健性,也就是說儘管語音辨識系統有了大幅度的改進,「噪聲」仍舊一定程度的干擾語音辨識之準確度。諸如:背景人聲,火車,公車站牌,汽車噪音,餐館背景雜音…以上皆為常見的環境噪聲干擾。所以強健性技術的研究在當今語音辨識系統發展中扮演著重要角色。有鑑於此,本論文著手研究在語音特徵向量序列之調變頻譜上基於生成對抗網路之有效的增益方法。並在Aurora4語料庫上進行一系列實驗顯示本研究使用的方法可以增進語音辨識的效果。zh_TW
dc.description.abstractNowadays deep learning technologies have achieved record-breaking results in a wide array of realistic applications, such as automatic speech recognition (ASR). Even though mainstream ASR systems evaluated on a few benchmark tasks have already reached human-like performance, they, in reality, are not robust to environmental distortions in the manner that humans are. In view of this, this thesis sets out to develop effective enhancement methods, stemming from the so-called generative adversarial networks (GAN), for use in the modulation domain of speech feature vector sequences. A series of experiments conducted on the Aurora-4 database and task seem to demonstrate the utility of our proposed methods.en_US
dc.description.sponsorship資訊工程學系zh_TW
dc.identifierG060547076S
dc.identifier.urihttp://etds.lib.ntnu.edu.tw/cgi-bin/gs32/gsweb.cgi?o=dstdcdr&s=id=%22G060547076S%22.&
dc.identifier.urihttp://rportal.lib.ntnu.edu.tw:80/handle/20.500.12235/111696
dc.language中文
dc.subject自動語音辨識zh_TW
dc.subject強健式語音辨識zh_TW
dc.subject生成對抗網路zh_TW
dc.subject深度學習技術zh_TW
dc.subject特徵強健性技術zh_TW
dc.subject調變頻譜zh_TW
dc.subjectAutomatic Speech Recognitionen_US
dc.subjectRobustnessen_US
dc.subjectGenerative Adversarial Networksen_US
dc.subjectDeep Learningen_US
dc.subjectModulation Spectrumen_US
dc.title探索基於生成對抗網路之新穎強健性技術
於語音辨識的應用zh_TW
dc.titleExploring Generative Adversarial Network Based Robustness Techniques for Automatic Speech Recognitionen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
060547076s01.pdf
Size:
2.23 MB
Format:
Adobe Portable Document Format

Collections