QUICK REVIEW

[論文レビュー] Towards Generalizable Robotic Data Flywheel: High-Dimensional Factorization and Composition

Yuyang Xiao, Yifei Zhou|arXiv (Cornell University)|Mar 26, 2026

Domain Adaptation and Few-Shot Learning被引用数 0

ひとこと要約

Introduces F-ACIL, a factor-aware compositional iterative learning framework that factorizes robotic data into Object, Action, and Environment spaces to enable efficient generalization, achieving over 45% performance gains with 5–10× fewer demonstrations.

ABSTRACT

The lack of sufficiently diverse data, coupled with limited data efficiency, remains a major bottleneck for generalist robotic models, yet systematic strategies for collecting and curating such data are not fully explored. Task diversity arises from implicit factors that are sparsely distributed across multiple dimensions and are difficult to define explicitly. To address this challenge, we propose F-ACIL, a heuristic factor-aware compositional iterative learning framework that enables structured data factorization and promotes compositional generalization. F-ACIL decomposes the data distribution into structured factor spaces such as object, action, and environment. Based on the factorized formulation, we develop a factor-wise data collection and an iterative training paradigm that promotes compositional generalization over the high-dimensional factor space, leading to more effective utilization of real-world robotic demonstrations. With extensive real-world experiments, we show that F-ACIL can achieve more than 45% performance gains with 5-10$\times$ fewer demonstrations comparing to that of which without the strategy. The results suggest that structured factorization offers a practical pathway toward efficient compositional generalization in real-world robotic learning. We believe F-ACIL can inspire more systematic research on building generalizable robotic data flywheel strategies. More demonstrations can be found at: https://f-acil.github.io/

研究の動機と目的

多様でデータ効率の良い現実世界のロボットデータを一般化モデルのために不足させないようにする。
オブジェクト、アクション、環境の高次元変動を捉える因子分解データ表現を提案する。
因子空間全体で構成的一般化を促進するシーケンシャルでデータ効率の良いデータ収集戦略を開発する。
実世界のPick-and-PlaceおよびOpen-and-Closeタスクで経験的な改善を示す。

提案手法

ロボットデータを構造化するために因子分解状態空間 S ≈ O × A × E を定義する。
Object を (texture, geometry, size) と、Action を (x, y, z, roll, pitch, yaw) と、Environment を Macro/Micro で light、color temp、shadow、materials、clutter などの因子とともに分解する。
orbitベースの一般化分析に guided されて、O を拡張してから OA、さらに OAE の順でデータを拡張する因子ベースの構成と反復データ収集パイプラインを導入する。
反復アルゴリズム（Alg. 1 および Alg. 2）を用いて、因子空間の軌道が全空間をカバーする最小データセットを、サンプル数を削減して構築する。
固定されたバックボーンを用いて VLA ポリシーを訓練し、F-ACIL の因子認識サンプリングを介してデータ効率を向上させるデモンストレーションを収集する。

実験結果

リサーチクエスチョン

RQ1F-ACIL は因子空間の全積 O × A × E に対する一般化を実現できるか？
RQ2因子認識的反復学習は、ナイーブな全空間探索と比較してロボットデータのフライホイールをどれくらい加速するか？
RQ3デモンストレーション予算を固定した状態で、構成的一般化を活用してデータ効率を改善できるか？
RQ4因子空間の次元を増やすとポリシーの性能はどうなるか？

主な発見

F-ACIL は 45%超の性能改善をもたらす。
因子認識的アプローチなしの戦略に比べ、デモンストレーションは 5–10×少なくて済む。
因子の組み合わせと反復的拡張の慎重な設計により、高次元空間での一般化を実現できる。
効果的な構成的一般化は、タスク間で転移可能な構造を反映する因子比を選択することに依存する。
Sequential factor expansion (O → A → E) は、全空間カバーを大幅にデータを削減して実現できる。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。