基於AlphaZero作法之國際跳棋程式開發及研究

簡沅亨; Chien, Yuan-Heng

基於AlphaZero作法之國際跳棋程式開發及研究

dc.contributor	林順喜	zh_TW
dc.contributor	Lin Shun-Shii	en_US
dc.contributor.author	簡沅亨	zh_TW
dc.contributor.author	Chien, Yuan-Heng	en_US
dc.date.accessioned	2020-10-19T06:59:22Z
dc.date.available	2022-02-14
dc.date.available	2020-10-19T06:59:22Z
dc.date.issued	2020
dc.description.abstract	國際跳棋是由民族跳棋演變而來的。據說在一七二三年，居住在法國的一名波蘭軍官把六十四格的棋盤改為一百格，因此又被稱為「波蘭跳棋」。國際跳棋擁有flying king和連吃的特殊規則，使得下法有趣多變，深受大眾的喜愛。近年來，AlphaZero演算法在多種棋類AI訓練上，都獲得極大的成功。因此，本研究使用AlphaZero的架構來實作國際跳棋的AI。然而，國際跳棋擁有連吃路徑的問題，無法以單次神經網路輸出來完整表達連吃的路徑，所以本研究設計連續走步，藉由神經網路的多次走步輸出來完整描述連吃的路徑。為了提高國際跳棋AlphaZero的訓練效率，本研究使用大贏策略來加速訓練，讓神經網路能夠往大贏的方向去訓練。經過100迭代訓練之後，使用大贏策略訓練的神經網路模型與原始AlphaZero版本訓練的神經網路模型相比，擁有較高的勝率。	zh_TW
dc.description.abstract	Draughts evolved from National Checkers. It is said that in 1723 a Polish military officer living in France changed the size of the board from sixty-four to a hundred. Therefore, it is also called "Polish Checkers". Draughts have special rules for flying king and continuous capturing, which makes it fun and changeful, and it is popular with the public. In recent years, AlphaZero algorithm has achieved great success in playing various games. Hence, this research uses AlphaZero's architecture to implement Draughts AI program. However, Draughts has the problem of continuous capturing path, so it is impossible to fully express the path of continuous capturing with a single neural network output. This study designs continuous moving, and uses the output of multiple moves of the neural network to completely describe the path of continuous capturing. In order to improve the training efficiency of the AlphaZero-based Draughts program, we apply the Big-Win strategy to speed up the training. It lets the neural network train at the direction of big wins. After 100 iterations of training, the network model trained using the Big-Win strategy has a higher winning rate than the network model trained with the original AlphaZero version.	en_US
dc.description.sponsorship	資訊工程學系	zh_TW
dc.identifier	G060647069S
dc.identifier.uri	http://etds.lib.ntnu.edu.tw/cgi-bin/gs32/gsweb.cgi?o=dstdcdr&s=id=%22G060647069S%22.&%22.id.&
dc.identifier.uri	http://rportal.lib.ntnu.edu.tw:80/handle/20.500.12235/111717
dc.language	中文
dc.subject	電腦對局	zh_TW
dc.subject	國際跳棋	zh_TW
dc.subject	神經網路	zh_TW
dc.subject	深度學習	zh_TW
dc.subject	AlphaZero	zh_TW
dc.subject	computer game	en_US
dc.subject	draughts	en_US
dc.subject	neural network	en_US
dc.subject	deep learning	en_US
dc.subject	AlphaZero	en_US
dc.title	基於AlphaZero作法之國際跳棋程式開發及研究	zh_TW
dc.title	The development and research of a Draughts program based on AlphaZero approach	en_US

Collections

學位論文

基於AlphaZero作法之國際跳棋程式開發及研究

Files

Collections