Skip to main content
QUICK REVIEW

[论文解读] System Identification under Constraints and Disturbance: A Bayesian Estimation Approach

Sergi Martínez, Steve Tonneau|arXiv (Cornell University)|Feb 18, 2026
Robot Manipulation and Learning被引用 0
一句话总结

论文提出一个贝叶斯系统识别框架,在接触与回路约束下联合估计浮动基座机器人状态轨迹、扰动和物理参数,利用基于能量的观测与等式约束Riccati求解器实现可扩展性。对Unitree B1 搭载 Z1 手臂的硬件实验和基于MPC的评估显示相对于基线在精度和接触一致性上有改善。

ABSTRACT

We introduce a Bayesian system identification (SysID) framework for jointly estimating robot's state trajectories and physical parameters with high accuracy. It embeds physically consistent inverse dynamics, contact and loop-closure constraints, and fully featured joint friction models as hard, stage-wise equality constraints. It relies on energy-based regressors to enhance parameter observability, supports both equality and inequality priors on inertial and actuation parameters, enforces dynamically consistent disturbance projections, and augments proprioceptive measurements with energy observations to disambiguate nonlinear friction effects. To ensure scalability, we derive a parameterized equality-constrained Riccati recursion that preserves the banded structure of the problem, achieving linear complexity in the time horizon, and develop computationally efficient derivatives. Simulation studies on representative robotic systems, together with hardware experiments on a Unitree B1 equipped with a Z1 arm, demonstrate faster convergence, lower inertial and friction estimation errors, and improved contact consistency compared to forward-dynamics and decoupled identification baselines. When deployed within model predictive control frameworks, the resulting models yield measurable improvements in tracking performance during locomotion over challenging environments.

研究动机与目标

  • 在接触与回路约束下,联合估计浮动基座机器人状态轨迹、扰动、执行效应和物理参数。
  • 将物理一致的动力学、接触约束与摩擦模型以硬性、阶段性等式与不等式形式强制执行。
  • 通过能量基观测和无力矩观测提高摩擦可观测性。
  • 研发具有分析导数的可扩展等式约束Riccati求解器,以支撑长时间跨度的 SysID。
  • 在仿真与硬件上展示更快的收敛和更高的参数精度,同时结合MPC带来性能提升。

提出的方法

  • 将贝叶斯 SysID 与混合逆动力学模型以及等式/不等式约束相结合,以强制动力学的一致性与物理可行性(方程1a–1f)。
  • 使用带解析导数的参数化前向或逆动力学以获得优化的灵敏度。
  • 通过运动约束和Gauss原理在两种模式与冲击中引入闭环运动学、双边接触和重置映射。
  • 用 differentiable、能量感知的回归量表征摩擦(粘着、干摩、粘性与斯特里克效应),并用能量基础观测来增强观测信息。
  • 利用李代数基础的参数化与与约束一致的扰动投影在保持物理一致性的同时实现可扩展性。
  • 推导并应用等式约束的Riccati递归及其导数,以在长、分辨率不同的数据集中高效求解结构化的KKT系统。
Figure 1 : The Unitree B1 quadruped robot performing a step-up maneuver. The leg includes a reduction gear and a four-bar linkage, both explicitly modeled as constraints within our SysID framework. The highlighted cross-section reveals the internal linkage bars, whose inertial properties and actuato
Figure 1 : The Unitree B1 quadruped robot performing a step-up maneuver. The leg includes a reduction gear and a four-bar linkage, both explicitly modeled as constraints within our SysID framework. The highlighted cross-section reveals the internal linkage bars, whose inertial properties and actuato

实验结果

研究问题

  • RQ1如何在接触与回路约束下,联合估计浮动基座机器人的状态、扰动、执行效应与物理参数?
  • RQ2在扭矩传感受限或缺失时,能否通过能量基观测改善摩擦参数的可辨识性与估计?
  • RQ3等式约束的Riccati求解器是否能够在较长时间尺度上实现可扩展的贝叶斯 SysID?
  • RQ4能量一致的先验和约束感知形式是否在精度与鲁棒性上优于正向动力学或解耦 SysID 基线?
  • RQ5在挑战性环境下将识别模型用于模型预测控制时,能带来哪些性能提升?

主要发现

  • 相较于正向动力学与解耦基线,仿真与硬件实验显示参数估计更准确且收敛更快。
  • 通过显式约束嵌入与动力学建模,接触与回路闭合的一致性显著提高。
  • 能量基观测有助于区分摩擦效应,在缺乏直接扭矩传感的情况下提高可辨识性。
  • 使用所提框架识别的模型在将其应用于挑战性地形上的MPC行进时,跟踪性能有可观测的提升。
Figure 2 : Overview of the our Bayesian optimization pipeline for system identification. Proprioceptive measurements (e.g., encoders, IMU) and optional exteroceptive measurements (e.g., visual odometry/ICP) are synchronized and used to jointly estimate the state trajectory, disturbances, actuation e
Figure 2 : Overview of the our Bayesian optimization pipeline for system identification. Proprioceptive measurements (e.g., encoders, IMU) and optional exteroceptive measurements (e.g., visual odometry/ICP) are synchronized and used to jointly estimate the state trajectory, disturbances, actuation e

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。