遺伝的プログラミングを用いたモンテカルロガイスターのプレイアウト方策生成

竹内, 聖悟; 栃川, 純平; Shogo, Takeuchi; Junpei, Tochikawa

WEKO3

インデックスツリー

RootNode

アイテム

遺伝的プログラミングを用いたモンテカルロガイスターのプレイアウト方策生成

https://doi.org/10.20729/00225268

名前 / ファイル	ライセンス	アクション
IPSJ-JNL6403015.pdf (706.6 kB)	Copyright (c) 2023 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

2023-03-15

タイトル

遺伝的プログラミングを用いたモンテカルロガイスターのプレイアウト方策生成

タイトル

言語

タイトル

Generating Playout Policy of Monte Carlo Geister Using Genetic Programming

言語

jpn

キーワード

主題Scheme

Other

主題

[特集:若手研究者] 不完全情報ゲーム，ゲーム木探索，遺伝的プログラミング

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

ID登録

10.20729/00225268

ID登録タイプ

JaLC

著者所属

高知工科大学情報学群

著者所属

株式会社ゼンリン

著者所属(英)

School of Informatics, Kochi University of Technology

著者所属(英)

Zenrin Co. Ltd.

著者名

竹内, 聖悟
栃川, 純平

著者名(英)

Shogo, Takeuchi
Junpei, Tochikawa

論文抄録

内容記述タイプ

Other

内容記述

ガイスターは，相手の駒の色が分からない二人不完全情報ゲームである．ガイスターにはモンテカルロ法ベースのプレイヤがあり，プレイアウト（シミュレーション）にはランダム方策が用いられている．完全情報ゲームのプレイアウト方策としては，ゲームの知識を用いたルールベース方策や機械学習によって生成した方策がランダム方策よりも性能を向上させる．しかし相手の情報を部分的にしか知ることのできない不完全情報ゲームにおいて，その未知の情報に基づいた方策が有効であるかは不明である．また，知識の導入には人間がゲームに習熟している必要があるが，人間のゲーム習熟度によらずにプレイアウト方策を作成できることが望ましい．本研究では，このような問題の解決のため遺伝的プログラミングを用いた方策作成を提案する．ゲームへの習熟が不要となることと適切な適応度を設定できれば方策による性能改善が期待できることが利点としてあげられる．また，用いる知識として未知の情報を含む場合とそうでない場合とで実験を行い，未知の情報の利用が性能改善に貢献するかを確認する．モンテカルロ法ベースのガイスタープレイヤを対象とした実験結果から，提案手法によりランダム方策よりも強い方策が生成できること，未知の情報を用いて性能が高くなることを確認し，提案手法の有効性を示した．

論文抄録(英)

内容記述タイプ

Other

内容記述

Geister is a two-player imperfect information game in which the color of the opponent's pieces is unknown. In Geister, the Monte Carlo player use a random policy in playout. The rule-based policy using human knowledge is considered stronger than the random policy. However, it is unclear whether rule-based policy based on unknown information is effective in imperfect information game, where only partial information about the opponent is available. In addition, it is desirable to be able to create playout independent of human game proficiency, although human players must be proficient in the game to introduce knowledge. In this research, we propose a method to generate playout policy using genetic programming to solve those problems. We conduct the experiments on Monte-Carlo Geister's playout in order to confirm the effectiveness of the proposed method. Experimental results show that the proposed method can generate stronger playout policy than random policy, and that the performance can be improved by using unknown information.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 64, 号 3, p. 708-716, 発行日 2023-03-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

公開者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 12:45:10.473730

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

遺伝的プログラミングを用いたモンテカルロガイスターのプレイアウト方策生成

× 竹内, 聖悟

× 栃川, 純平

× Shogo, Takeuchi

× Junpei, Tochikawa

Versions

Share

Cite as

エクスポート