[Paper Review] Should I Have Expressed a Different Intent? Counterfactual Generation for LLM-Based Autonomous Control
The paper introduces conformal counterfactual generation (CCG) to produce reliable counterfactual reports for LLM-driven agent–environment control, outperforming naive re-execution baselines and providing reliability guarantees.
Large language model (LLM)-powered agents can translate high-level user intents into plans and actions in an environment. Yet after observing an outcome, users may wonder: What if I had phrased my intent differently? We introduce a framework that enables such counterfactual reasoning in agentic LLM-driven control scenarios, while providing formal reliability guarantees. Our approach models the closed-loop interaction between a user, an LLM-based agent, and an environment as a structural causal model (SCM), and leverages test-time scaling to generate multiple candidate counterfactual outcomes via probabilistic abduction. Through an offline calibration phase, the proposed conformal counterfactual generation (CCG) yields sets of counterfactual outcomes that are guaranteed to contain the true counterfactual outcome with high probability. We showcase the performance of CCG on a wireless network control use case, demonstrating significant advantages compared to naive re-execution baselines.
Motivation & Objective
- Enable counterfactual reasoning about alternative user intents in LLM-based autonomous control workflows.
- Model the agent–environment interaction as a structural causal model to generate counterfactual reports.
- Provide reliability guarantees via conformal calibration for sets of counterfactual outcomes.
- Demonstrate fidelity and reliability improvements over naive re-execution baselines in a wireless network (5G) control scenario.
Proposed method
- Model the agent–environment system as a structural causal model (SCM) capturing X (prompt), A (action), Z (environment feedback), and Y (report).
- Use abduction to infer environment noise U_Z from the factual episode T=(X,A,Z,Y).
- Generate counterfactual actions under X′ by replaying the LLM with the same exogenous A- and Y-noise U_A, U_Y.
- Simulate counterfactual environment feedback Ẑ_{X′}(T) using inferred U_Z and counterfactual actions.
- Produce counterfactual reports Ŷ_{X′}(T) from the report-generating LLM conditioned on (X′,Â_{X′}(T),Ẑ_{X′}(T)).
- Apply conformal calibration (CCG) to create a reliable set C_λ(T,X′) of counterfactual reports with high-probability coverage guarantees.

Experimental results
Research questions
- RQ1How to reliably generate counterfactual reports for LLM-based autonomous control when the environment responds to actions?
- RQ2Can we provide statistical reliability guarantees for counterfactual reports in closed-loop agent–environment systems?
- RQ3Does conformal calibration improve the usefulness and reliability of counterfactual reasoning compared to naive re-execution?
- RQ4What is the effect of simulator fidelity on counterfactual accuracy and calibration efficiency?
Key findings
- CG (counterfactual generation) yields higher fidelity counterfactual KPI time series than IG or SIG across multiple metrics.
- In a 5G network control case study, CG achieves MAE and cross-correlation closer to true counterfactuals compared to IG and SIG for both throughput and delay.
- An LLM-based judge prefers CG-generated counterfactual reports over IG and SIG in 92 of 100 cases.
- CCG provides reliability guarantees: the generated set C_λ(T,X′) contains a good counterfactual with probability at least 1−ε, with calibration controlling miscoverage.
- CCG achieves fewer excess samples than fixed-budget baselines across tested budgets, improving sampling efficiency.

Better researchstarts right now
From paper design to paper writing, dramatically reduce your research time.
No credit card · Free plan available
This review was created by AI and reviewed by human editors.