點格棋中小盤面模型取代大盤面模型訓練之可行性研究

劉怡汎; Liu, Yi-Fan

點格棋中小盤面模型取代大盤面模型訓練之可行性研究

dc.contributor	林順喜	zh_TW
dc.contributor	Lin, Shun-Shii	en_US
dc.contributor.author	劉怡汎	zh_TW
dc.contributor.author	Liu, Yi-Fan	en_US
dc.date.accessioned	2024-12-17T03:37:29Z
dc.date.available	2024-07-11
dc.date.issued	2024
dc.description.abstract	點格棋（Dots and Boxes）是款零和、完全資訊並公正的雙人遊戲，雖然棋盤小卻有較高的複雜度。本論文以3×3盤面的點格棋作為研究主題，實現訓練好的小盤面的AlphaZero神經網路模型取代大盤面的AlphaZero神經網路模型。在實作上，我們採用基於AlphaGo Zero論文實現的AlphaZero General的開源框架專案，透過方便理解的Python開源專案，讓使用者可以輕鬆的在AlphaGo Zero的架構上實作遊戲及訓練神經網路，省去從頭開始開發的成本，能較專注於其他研究中。從實驗結果可以得知，在1天、2天及3天的訓練神經網路時間下，3×3盤面AlphaZero General代理人以平均處理合併policy的方式，在與相同訓練時間的4×4盤面AlphaZero General代理人的對戰中，分別取得64%、58%、57%的勝率。因此在訓練時間限制3天的情況下，可以使用訓練好的小盤面的AlphaZero神經網路模型取代大盤面的AlphaZero神經網路模型。	zh_TW
dc.description.abstract	Dots and Boxes is a zero-sum, perfect information, and impartial two-player game. Despite its small board size, it exhibits high game complexity. This study focuses on the 3×3 board of the game and employs the AlphaZero neural network model adapted for smaller boards, replacing the model originally designed for larger boards.For implementation, we utilized the AlphaZero General open-source framework, which is based on the principles outlined in the AlphaGo Zero paper. This Python-based framework facilitates straightforward game implementation and neural network training, following the AlphaGo Zero architecture. By leveraging this existing framework, we reduced the development costs and could allocate resources to other research areas.Experimental results demonstrate that, across various training durations (1 day, 2 days, and 3 days), the 3×3 board AlphaZero General agent, employing average processing to merge policy, outperforms its 4×4 board AlphaZero General agent. It achieved respective winning rates of 64%, 58%, and 57%. Therefore, within a limited training timeframe of 3 days, the compact AlphaZero neural network model proves effective in substituting the larger-capacity model originally used.	en_US
dc.description.sponsorship	資訊工程學系	zh_TW
dc.identifier	61147065S-45393
dc.identifier.uri	https://etds.lib.ntnu.edu.tw/thesis/detail/46a815b2a84d884d797ffe0b4ee8fbf5/
dc.identifier.uri	http://rportal.lib.ntnu.edu.tw/handle/20.500.12235/123733
dc.language	中文
dc.subject	點格棋	zh_TW
dc.subject	AlphaGo Zero	zh_TW
dc.subject	AlphaZero	zh_TW
dc.subject	AlphaZero General	zh_TW
dc.subject	Dots and Boxes	en_US
dc.subject	AlphaGo Zero	en_US
dc.subject	AlphaZero	en_US
dc.subject	AlphaZero General	en_US
dc.title	點格棋中小盤面模型取代大盤面模型訓練之可行性研究	zh_TW
dc.title	Feasibility Study on Replacing Large Board Model with Small Board Model in Dots and Boxes	en_US
dc.type	學術論文

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 202400045393-107721.pdf
Size:: 36.93 MB
Format:: Adobe Portable Document Format
Description:: 學術論文

Download

Collections

學位論文