QUICK REVIEW

[論文レビュー] Prose2Policy (P2P): A Practical LLM Pipeline for Translating Natural-Language Access Policies into Executable Rego

Vatsal Gupta, Darshan Sreenivasamurthy|arXiv (Cornell University)|Mar 16, 2026

Natural Language Processing Techniques被引用数 0

ひとこと要約

tldr: Prose2Policy is an end-to-end LLM-based toolchain that translates natural-language access control policies into executable Rego code, including extraction, schema validation, linting, compilation, and automated tests. It emphasizes deployment reliability and auditability.

ABSTRACT

Prose2Policy (P2P) is a LLM-based practical tool that translates natural-language access control policies (NLACPs) into executable Rego code (the policy language of Open Policy Agent, OPA). It provides a modular, end-to-end pipeline that performs policy detection, component extraction, schema validation, linting, compilation, automatic test generation and execution. Prose2Policy is designed to bridge the gap between human-readable access requirements and machine-enforceable policy-as-code (PaC) while emphasizing deployment reliability and auditability. We evaluated Prose2Policy on the ACRE dataset and demonstrated a 95.3\% compile rate for accepted policies, with automated testing achieving a 82.2\% positive-test pass rate and a 98.9\% negative-test pass rate. These results indicate that Prose2Policy produces syntactically robust and behaviorally consistent Rego policies suitable for Zero Trust and compliance-driven environments.

研究の動機と目的

Objective 1: Bridge the gap between human-readable access requirements and machine-enforceable policy code (PaC).
Objective 2: Provide an end-to-end, modular pipeline from NLACP to Rego with validation, linting, and testing.
Objective 3: Enable auditability and deployment reliability through structured prompts and intermediate artifacts.
Objective 4: Support researchers with a reproducible flow to study prompting strategies and guardrails.

提案手法

Method 1: Structured prompting pipeline to identify intent, extract DSARCP components (Decision, Subject, Action, Resource, Condition, Purpose), and synthesize Rego.
Method 2: Four-module architecture: pre-processing, component extraction, schema validation, and Rego generation/refinement/testing.
Method 3: Rego as the target policy language with deny-by-default semantics and audit annotations.
Method 4: Integration of a Rego linter (Regal) and OPA compilation/testing for validation and unit test generation.
Method 5: Batch and single-policy interfaces with optional LLM or rule-based test generation modes.

実験結果

リサーチクエスチョン

RQ1Question 1: Can NLACPs be reliably detected and decomposed into executable policy elements suitable for Rego?
RQ2Question 2: How accurately can LLMs extract policy components (DSARCP) from NLACPs and convert them into Rego code?
RQ3Question 3: Does the integrated linting, compilation, and unit testing pipeline yield syntactically correct and behaviorally correct policies?
RQ4Question 4: What is the impact of RAGent-like baselines on the quality of generated policies compared to Prose2Policy’s approach?
RQ5Question 5: How does Prose2Policy perform on standard datasets (e.g., ACRE) in terms of compile rate and test pass rates?

主な発見

Finding 1: 95.3% のコンパイル率を達成: 371/389 の受理済み NLACPs が有効な Rego ポリシーを生成。
Finding 2: LLM駆動のテスト生成で正のテスト合格率 82.2%（305/371）。
Finding 3: LLM駆動のテストで負のテスト合格率 98.9%（367/371）。
Finding 4: ルールベースのテスト生成は正のテスト合格率 62.1%、負のテスト合格率 97.1%。
Finding 5: 自動生成中に構文の問題が原因で有効な Rego を生成できなかったポリシーが 18 件。
Finding 6: 高い否定テスト成功率を伴う堅牢な deny-by-default ポリシーの適用を実証。

Figure 2. Inputs and outputs of each component

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。