QUICK REVIEW

[論文レビュー] TOLEBI: Learning Fault-Tolerant Bipedal Locomotion via Online Status Estimation and Fallibility Rewards

Hokyun Lee, Woo-Jeong Baek|arXiv (Cornell University)|Feb 5, 2026

Robotic Locomotion and Control被引用数 0

ひとこと要約

TOLEBIは、オンラインの状態推定と破損性ベースの報酬により関節故障と外乱を扱う、二足歩行のフォールタレント学習フレームワークを提案し、シムから実機への転送を行う humanoid ロボットへの適用を示す。

ABSTRACT

With the growing employment of learning algorithms in robotic applications, research on reinforcement learning for bipedal locomotion has become a central topic for humanoid robotics. While recently published contributions achieve high success rates in locomotion tasks, scarce attention has been devoted to the development of methods that enable to handle hardware faults that may occur during the locomotion process. However, in real-world settings, environmental disturbances or sudden occurrences of hardware faults might yield severe consequences. To address these issues, this paper presents TOLEBI (A faulT-tOlerant Learning framEwork for Bipedal locomotIon) that handles faults on the robot during operation. Specifically, joint locking, power loss and external disturbances are injected in simulation to learn fault-tolerant locomotion strategies. In addition to transferring the learned policy to the real robot via sim-to-real transfer, an online joint status module incorporated. This module enables to classify joint conditions by referring to the actual observations at runtime under real-world conditions. The validation experiments conducted both in real-world and simulation with the humanoid robot TOCABI highlight the applicability of the proposed approach. To our knowledge, this manuscript provides the first learning-based fault-tolerant framework for bipedal locomotion, thereby fostering the development of efficient learning methods in this field.

研究の動機と目的

ハードウェア故障と外乱下で実世界設定における堅牢な二足歩行の動機付け。
関節故障、電力喪失、外部外乱を許容できる学習フレームワークの開発。
ランタイムで方針を適応させるための関節状態のオンライン分類の実装。
フォールタレントな locomotion 方針のシム-ツー-リアル転送を実 humanoid ロボットへ実証。

提案手法

シミュレーション内で関節のロック、電力喪失、外部外乱などの故障を注入してフォールタレント戦略を学習。
オンラインの関節状態モジュールを組み込み、ランタイム観察から関節状態を分類。
フォールタレントベースの報酬で強化学習方針を訓練し、故障耐性を促進。
学習したポリシーをシミュレーションから実機 humanoid ロボットへシム-ツー-リアル手法で転送。
TOCABI humanoid を用いたシミュレーションと実機の両方でアプローチを検証。

実験結果

リサーチクエスチョン

RQ1関節故障と外乱下で堅牢な二足歩行を達成できる学習フレームワークは作れるか。
RQ2故障時のロボット歩行性能を維持するためのオンライン関節状態推定はどれほど有効か。
RQ3破損性ベースの報酬構造はベースライン手法と比べてフォールタレント挙動を改善するか。
RQ4学習済みポリシーはシムから実機 TOCABI へどれだけうまく転送できるか。

主な発見

TOLEBIはシミュレートされた関節故障と外乱の下でフォールタレントな locomotion 戦略を可能にする。
オンラインの関節状態モジュールは実世界条件下での関節状態のランタイム分類をサポートする。
本手法は TOCABI humanoid へのシム-ツー-リアル転送を実証する。
本稿は二足歩行の学習ベースのフォールタレントフレームワークを初めて提供すると主張している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。