探索基於生成對抗網路之新穎強健性技術 於語音辨識的應用

楊明璋; Yang, Ming-Jhang

探索基於生成對抗網路之新穎強健性技術 於語音辨識的應用

dc.contributor	陳伯琳	zh_TW
dc.contributor	Chen, Berlin	en_US
dc.contributor.author	楊明璋	zh_TW
dc.contributor.author	Yang, Ming-Jhang	en_US
dc.date.accessioned	2020-12-14T09:07:55Z
dc.date.available	2019-08-17
dc.date.available	2020-12-14T09:07:55Z
dc.date.issued	2019
dc.description.abstract	近年深度學習技術在許多領域有重大突破，在各種實際應用中也大放異彩，於自動語音辨識的應用中也一樣有優秀表現。雖然主流語音辨識系統在某些指標性任務上已經可達到和人類聽覺相當的辨識效果，然而它們卻不像人類一樣對於環境干擾具有強健性，也就是說儘管語音辨識系統有了大幅度的改進，「噪聲」仍舊一定程度的干擾語音辨識之準確度。諸如:背景人聲，火車，公車站牌，汽車噪音，餐館背景雜音…以上皆為常見的環境噪聲干擾。所以強健性技術的研究在當今語音辨識系統發展中扮演著重要角色。有鑑於此，本論文著手研究在語音特徵向量序列之調變頻譜上基於生成對抗網路之有效的增益方法。並在Aurora4語料庫上進行一系列實驗顯示本研究使用的方法可以增進語音辨識的效果。	zh_TW
dc.description.abstract	Nowadays deep learning technologies have achieved record-breaking results in a wide array of realistic applications, such as automatic speech recognition (ASR). Even though mainstream ASR systems evaluated on a few benchmark tasks have already reached human-like performance, they, in reality, are not robust to environmental distortions in the manner that humans are. In view of this, this thesis sets out to develop effective enhancement methods, stemming from the so-called generative adversarial networks (GAN), for use in the modulation domain of speech feature vector sequences. A series of experiments conducted on the Aurora-4 database and task seem to demonstrate the utility of our proposed methods.	en_US
dc.description.sponsorship	資訊工程學系	zh_TW
dc.identifier	G060547076S
dc.identifier.uri	http://etds.lib.ntnu.edu.tw/cgi-bin/gs32/gsweb.cgi?o=dstdcdr&s=id=%22G060547076S%22.&
dc.identifier.uri	http://rportal.lib.ntnu.edu.tw:80/handle/20.500.12235/111696
dc.language	中文
dc.subject	自動語音辨識	zh_TW
dc.subject	強健式語音辨識	zh_TW
dc.subject	生成對抗網路	zh_TW
dc.subject	深度學習技術	zh_TW
dc.subject	特徵強健性技術	zh_TW
dc.subject	調變頻譜	zh_TW
dc.subject	Automatic Speech Recognition	en_US
dc.subject	Robustness	en_US
dc.subject	Generative Adversarial Networks	en_US
dc.subject	Deep Learning	en_US
dc.subject	Modulation Spectrum	en_US
dc.title	探索基於生成對抗網路之新穎強健性技術 於語音辨識的應用	zh_TW
dc.title	Exploring Generative Adversarial Network Based Robustness Techniques for Automatic Speech Recognition	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 060547076s01.pdf
Size:: 2.23 MB
Format:: Adobe Portable Document Format

Download

Collections

學位論文

探索基於生成對抗網路之新穎強健性技術 於語音辨識的應用

Files

Original bundle

Collections

探索基於生成對抗網路之新穎強健性技術 於語音辨識的應用