Skip to main content
QUICK REVIEW

[Paper Review] Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification

Max Simchowitz, Horia Mania|arXiv (Cornell University)|Feb 22, 2018
Control Systems and Identification116 citations
TL;DR

The paper proves that ordinary least-squares (OLS) attains nearly minimax optimal rates for identifying linear dynamical systems from a single trajectory, without relying on mixing-time arguments, by leveraging a generalized small-ball method for dependent data.

ABSTRACT

We prove that the ordinary least-squares (OLS) estimator attains nearly minimax optimal performance for the identification of linear dynamical systems from a single observed trajectory. Our upper bound relies on a generalization of Mendelson's small-ball method to dependent data, eschewing the use of standard mixing-time arguments. Our lower bounds reveal that these upper bounds match up to logarithmic factors. In particular, we capture the correct signal-to-noise behavior of the problem, showing that more unstable linear systems are easier to estimate. This behavior is qualitatively different from arguments which rely on mixing-time calculations that suggest that unstable systems are more difficult to estimate. We generalize our technique to provide bounds for a more general class of linear response time-series.

Motivation & Objective

  • Motivate the study of sample complexity in linear system identification from a single trajectory.
  • Characterize how system dynamics, via the controllability Gramian, affect estimation rates.
  • Provide near-minimax upper bounds for OLS in marginally stable regimes (rho(A*) ≤ 1).
  • Establish lower bounds matching upper bounds up to logarithmic factors to reveal signal-to-noise behavior.
  • Extend techniques to a broader class of linear response time-series.

Proposed method

  • Model the system as X_{t+1}=A_*X_t+η_t with η_t ~ N(0, σ^2 I).
  • Analyze the OLS estimator c          | Â(T)=argmin_A ∑_{t=1}^T 1/2 ||X_{t+1}-AX_t||_2^2.
  • Introduce and bound in terms of the finite-time controllability Gramian Γ_T = ∑_{s=0}^{T-1} A_*^s (A_*^s)^T.
  • Generalize Mendelsons small-ball method to dependent data via k-block martingale small-ball (BMSB) conditions.
  • Develop high-probability bounds by linking the minimum eigenvalue λ_min(Γ_k) to estimation error scales.
  • Provide a general theorem (Theorem 2.4) for linear responses with martingale small-ball conditions.
  • Apply corollaries to specific system classes (scalar, scaled orthogonal, diagonalizable).

Experimental results

Research questions

  • RQ1How many samples (in a single trajectory) are needed to estimate A_* with high probability in operator norm?
  • RQ2How does the finite-time controllability Gramian influence the estimation rate across stable and marginally stable regimes?
  • RQ3Can OLS achieve minimax-optimal rates without mixing-time arguments in dependent data settings?
  • RQ4How do different system structures (scalar, scaled orthogonal, diagonalizable) affect the rates and constants of OLS?
  • RQ5Can the results extend to general linear time-series with linear responses beyond dynamical systems?

Key findings

  • OLS achieves estimation error bounds that scale with 1/√(T λ_min(Γ_k)) up to log factors.
  • Bounds hold for any marginally stable A_* (ρ(A_*) ≤ 1) and do not rely on mixing-time arguments.
  • For stable systems, the bounds can be stated without explicit block length dependence for large T (Corollary 2.2).
  • The estimation rate depends on the excitability of the system via the controllability Gramian; larger λ_min(Γ_k) yields faster learning.
  • Lower bounds show minimax optimality up to logarithmic factors in certain regimes (Theorem 2.3).
  • The framework extends to a general time series with linear responses through a martingale small-ball condition (Theorem 2.4).

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.