勝率と評価値の歪みに基づく評価関数調整法―将棋における進行度差の評価―

竹内, 聖悟; 林, 芳樹; 金子, 知適; 川合, 慧; Takeuchi, Shogo; Hayashi, Yoshiki; Kaneko, Tomoyuki; Kawai, Satoru

WEKO3

インデックスツリー

RootNode

アイテム

勝率と評価値の歪みに基づく評価関数調整法―将棋における進行度差の評価―

https://ipsj.ixsq.nii.ac.jp/records/97624

名前 / ファイル	ライセンス	アクション
IPSJ-GPWS2006008.pdf (150.3 kB)	Copyright (c) 2006 by the Information Processing Society of Japan
オープンアクセス

Item type

Symposium(1)

公開日

2006-11-10

タイトル

勝率と評価値の歪みに基づく評価関数調整法―将棋における進行度差の評価―

タイトル

言語

タイトル

Adjustment of Evaluation Functions Based on Relation Between Static Values and Win Ratios - Evaluation of Safety Difference Between Both Kings in Shogi -

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

著者所属

東京大学大学院総合文化研究科

著者所属

グーグル株式会社

著者所属

東京大学大学院総合文化研究科

著者所属

東京大学大学院総合文化研究科

著者所属(英)

Department of General Systems Studies, Graduate School of Arts and Sciences, The University of Tokyo

著者所属(英)

Google Japan Inc.

著者所属(英)

Department of General Systems Studies, Graduate School of Arts and Sciences, The University of Tokyo

著者所属(英)

Department of General Systems Studies, Graduate School of Arts and Sciences, The University of Tokyo

著者名

竹内, 聖悟林, 芳樹金子, 知適川合, 慧

著者名(英)

Takeuchi, Shogo Hayashi, Yoshiki Kaneko, Tomoyuki Kawai, Satoru

論文抄録

内容記述タイプ

Other

内容記述

本稿では、勝率と評価値の歪みに基づいた評価関数の調整法を提案し、将棋を例題に、本手法の有効性を示す。評価関数の調整は強いプログラムの作成に不可欠であるが、どこに問題があるか発見することや評価値を適切にあたえることはゲームの知識が必要であり困難が多い。本研究では、評価関数に問題がある局面では勝ち易さを適切に評価できず、勝率と評価関数との関係に歪みが生じていることに着目し、条件毎に勝率と評価値のグラフを描くことにより評価関数の問題点をを発見することを提案する。本手法を将棋において先手と後手の進行度差がある局面に対して用い、プレイヤ毎の進行度を評価しない評価関数には問題があることを示した。さらに、その問題を解決するため、進行度差を評価に含めた評価関数を設計し、値の自動的な調整を行った。そして、自己対戦によって調整後のプログラムの棋力の向上を確認し、本手法の有効性を示した。

論文抄録(英)

内容記述タイプ

Other

内容記述

In this paper, we present a new method for adjusting evaluation functions based on relation between static values and win ratios, and show its effectiveness. Accurate evaluation functions are important for strong game program, however, it is not easy to find out problems in existing evaluation functions. Incorrect evaluation functions assign incorrect prediction of win ratio for states in a certain condition. Therefore, we focus on relation between evaluation values and win ratio, and propose to plot them in a graph for each set of states. If an evaluation function works bad for states in a certain condition, a line of the relation for those states will be drawn apart from lines of the relation for other states. We applied this method for Shogi, and showed that usual evaluation functions work bad for states where the difference between king safety of both players is large. Then, we constructed a new evaluation function considering the difference between king safety of both players and automatically adjusted its weights. Significant improvement on strength is confirmed in self-play, and showed effectiveness of this method.

書誌情報

ゲームプログラミングワークショップ2006論文集

巻 2006, p. 56-63, 発行日 2006-11-10

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 12:47:32.152111

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

勝率と評価値の歪みに基づく評価関数調整法―将棋における進行度差の評価―

× 竹内, 聖悟林, 芳樹金子, 知適川合, 慧

× Takeuchi, Shogo Hayashi, Yoshiki Kaneko, Tomoyuki Kawai, Satoru

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

勝率と評価値の歪みに基づく評価関数調整法―将棋における進行度差の評価―

× 竹内, 聖悟 林, 芳樹 金子, 知適 川合, 慧

× Takeuchi, Shogo Hayashi, Yoshiki Kaneko, Tomoyuki Kawai, Satoru

Versions

Share

Cite as

エクスポート

× 竹内, 聖悟林, 芳樹金子, 知適川合, 慧