強化学習を用いたモジュール型多脚ロボットにおける適応的移動法獲得

新堀航大; 兵頭, 和幸; 砂山, 享祐; 三上, 貞芳; Kodai, Shimbori; Kazuyuki, Hyodo; Kyosuke, Sunayama; Sadayoshi, Mikami

WEKO3

インデックスツリー

RootNode

アイテム

強化学習を用いたモジュール型多脚ロボットにおける適応的移動法獲得

https://ipsj.ixsq.nii.ac.jp/records/9257

名前 / ファイル	ライセンス	アクション
IPSJ-JNL5003025.pdf (2.1 MB)	Copyright (c) 2009 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

2009-03-15

タイトル

強化学習を用いたモジュール型多脚ロボットにおける適応的移動法獲得

タイトル

言語

タイトル

Acquisition of Adaptive Movement Strategy by Reinforcement Learning in Modular Multiple-leg Mobile Robot

言語

jpn

キーワード

主題Scheme

Other

主題

一般論文

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

その他タイトル

その他のタイトル

知識処理

著者所属

公立はこだて未来大学大学院

著者所属

公立はこだて未来大学大学院

著者所属

東芝ソリューション株式会社

著者所属

公立はこだて未来大学

著者所属(英)

Graduate School of Future University-Hakodate

著者所属(英)

Graduate School of Future University-Hakodate

著者所属(英)

Toshiba Solutions Corpolation

著者所属(英)

Future University-Hakodate

著者名

新堀航大兵頭, 和幸砂山, 享祐三上, 貞芳

著者名(英)

Kodai, Shimbori Kazuyuki, Hyodo Kyosuke, Sunayama Sadayoshi, Mikami

論文抄録

内容記述タイプ

Other

内容記述

探査作業のような未知環境の下でロボットを用いるような研究が進められているが，ロボットのパーツの破損によって移動不可となる可能性などについては，まだ研究の余地が残されている．本論文では，ロボットが破損した場合でも，回収が困難な場合には破損部以外の利用可能なアクチュエータを用いることで移動法を再獲得するようなシステムを想定し，想定外の状況にもある程度適応できるような移動法獲得を，強化学習を用いて実現する．提案する手法では，脚形状から車輪形状などの想定外の形状へのアクチュエータモジュールの換装もある程度可能なシステムを前提とする．このような前提では新たな移動手順を広く探査することになるが，ロボットの移動機能を迅速に回復するためには，なるべく有用な行動を速く探査し利用することに重点を置く必要がある．このため，本研究では強化学習手法に対して，時間的信頼性に基づいた「行動価値の成長」と呼ぶ再探索手法を導入する．3D物理シミュレータによる6脚移動ロボットの実験により，提案する方法が比較的高速に良い候補となる移動法を獲得できていることが示されている．

論文抄録(英)

内容記述タイプ

Other

内容記述

Currently, intensive studies are carried out to make robots that can work under an unknown environment. It is however not so much investigated for methods to realize recovery from situations where a part of a robot has been broken. This study is to propose a configuration of a mobile robot system that is able to achieve a new movement under the situation where some of its actuators are broken and replaced by alternative ones, which may not be the same configuration as the original ones. In particular, the proposed method is designed to be able to deal with replacement of a leg-type actuator to a wheel-type actuator, which may not be considered in design-time. The proposed method is based on a Reinforcement Learning and is modified so that it can achieve rapid conversion over a wide search space. To this end, a “growth of action-value” method is proposed, which enables effective exploration of an action space based on temporal reliability of each action-value. A series of 3D simulation-based experiments are conducted, where the proposed method shows rapid conversion to a good candidate of movement patterns.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 50, 号 3, p. 1170-1180, 発行日 2009-03-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

戻る

views

See details

	Views

Versions

Ver.1

2025-01-23 03:29:02.934820

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

強化学習を用いたモジュール型多脚ロボットにおける適応的移動法獲得

× 新堀航大兵頭, 和幸砂山, 享祐三上, 貞芳

× Kodai, Shimbori Kazuyuki, Hyodo Kyosuke, Sunayama Sadayoshi, Mikami

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

強化学習を用いたモジュール型多脚ロボットにおける適応的移動法獲得

× 新堀航大 兵頭, 和幸 砂山, 享祐 三上, 貞芳

× Kodai, Shimbori Kazuyuki, Hyodo Kyosuke, Sunayama Sadayoshi, Mikami

Versions

Share

Cite as

エクスポート

× 新堀航大兵頭, 和幸砂山, 享祐三上, 貞芳