内部報酬を自動生成する強化学習による一人用RPGの自動攻略

加納, 由希夫; 鶴岡, 慶雅; Yukio, Kano; Yoshimasa, Tsuruoka

WEKO3

インデックスツリー

RootNode

アイテム

内部報酬を自動生成する強化学習による一人用RPGの自動攻略

https://ipsj.ixsq.nii.ac.jp/records/183861

名前 / ファイル	ライセンス	アクション
IPSJ-GPWS2017034.pdf (732.6 kB)	Copyright (c) 2017 by the Information Processing Society of Japan
オープンアクセス

Item type

Symposium(1)

公開日

2017-11-03

タイトル

内部報酬を自動生成する強化学習による一人用RPGの自動攻略

タイトル

言語

タイトル

Automatic capture of one person RPG by reinforcement learning to automatically generate internal compensation

言語

jpn

キーワード

主題Scheme

Other

主題

ゲームAI

キーワード

主題Scheme

Other

主題

強化学習

キーワード

主題Scheme

Other

主題

内部報酬

キーワード

主題Scheme

Other

主題

RPG

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

著者所属

東京大学工学部電子情報工学科

著者所属

東京大学大学院情報理工学系研究科電子情報学専攻

著者所属(英)

Department of Information and Communication Engineer-ing, Graduate School of Information Science and Technology, The University of Tokyo

著者所属(英)

Department of Information and Communication Engineer-ing, The University of Tokyo

著者名

加納, 由希夫
鶴岡, 慶雅

著者名(英)

Yukio, Kano
Yoshimasa, Tsuruoka

論文抄録

内容記述タイプ

Other

内容記述

AIが内部報酬を自動生成することによって，外部報酬を利用しない自律的な強化学習を実現することは，報酬設計が困難であるような現実世界の問題に人工知能を応用させる上で非常に重要な課題の一つである．内部報酬を自動で生成する手法の一つにICM(Pathak,2017) があり，A3C(Mnih,2016)の報酬にICMの内部報酬を用いた強化学習は，VizDoomやSuper Mario Bros などのゲームにおいて高い学習成果を示している．本研究では，ゲームの初期状態が毎回変化するという特徴を持つローグライクゲームに対して，ICMの手法を適用して効率的な強化学習を行えるようにすることを目指す．

論文抄録(英)

内容記述タイプ

Other

内容記述

Realization of autonomous reinforcement learning that does not use external compensation by automatically generating internal compensation from AI is extremely important in applying artiﬁcial intelli-gence to real world problems where compensation design is difficult. ICM (Pathak, 2017) is one method to automatically generate internal compensation, reinforcement learning using internal compensation of ICM for remuneration of A3C (Mnih, 2016) is used in games such as VizDoom and Super Mario Bros. It shows high learning outcome. In this research, we aim to enable efficient reinforcement learning by applying ICM method to roguelike games, which features the initial state of the game changing every time.

書誌情報

ゲームプログラミングワークショップ2017論文集

巻 2017, p. 219-225, 発行日 2017-11-03

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-20 03:31:00.759550

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

内部報酬を自動生成する強化学習による一人用RPGの自動攻略

× 加納, 由希夫

× 鶴岡, 慶雅

× Yukio, Kano

× Yoshimasa, Tsuruoka

Versions

Share

Cite as

エクスポート