Skip to main content
QUICK REVIEW

[論文レビュー] Prose2Policy (P2P): A Practical LLM Pipeline for Translating Natural-Language Access Policies into Executable Rego

Vatsal Gupta, Darshan Sreenivasamurthy|arXiv (Cornell University)|Mar 16, 2026
Natural Language Processing Techniques被引用数 0
ひとこと要約

tldr: Prose2Policy is an end-to-end LLM-based toolchain that translates natural-language access control policies into executable Rego code, including extraction, schema validation, linting, compilation, and automated tests. It emphasizes deployment reliability and auditability.

ABSTRACT

Prose2Policy (P2P) is a LLM-based practical tool that translates natural-language access control policies (NLACPs) into executable Rego code (the policy language of Open Policy Agent, OPA). It provides a modular, end-to-end pipeline that performs policy detection, component extraction, schema validation, linting, compilation, automatic test generation and execution. Prose2Policy is designed to bridge the gap between human-readable access requirements and machine-enforceable policy-as-code (PaC) while emphasizing deployment reliability and auditability. We evaluated Prose2Policy on the ACRE dataset and demonstrated a 95.3\% compile rate for accepted policies, with automated testing achieving a 82.2\% positive-test pass rate and a 98.9\% negative-test pass rate. These results indicate that Prose2Policy produces syntactically robust and behaviorally consistent Rego policies suitable for Zero Trust and compliance-driven environments.

研究の動機と目的

  • Objective 1: Bridge the gap between human-readable access requirements and machine-enforceable policy code (PaC).
  • Objective 2: Provide an end-to-end, modular pipeline from NLACP to Rego with validation, linting, and testing.
  • Objective 3: Enable auditability and deployment reliability through structured prompts and intermediate artifacts.
  • Objective 4: Support researchers with a reproducible flow to study prompting strategies and guardrails.

提案手法

  • Method 1: Structured prompting pipeline to identify intent, extract DSARCP components (Decision, Subject, Action, Resource, Condition, Purpose), and synthesize Rego.
  • Method 2: Four-module architecture: pre-processing, component extraction, schema validation, and Rego generation/refinement/testing.
  • Method 3: Rego as the target policy language with deny-by-default semantics and audit annotations.
  • Method 4: Integration of a Rego linter (Regal) and OPA compilation/testing for validation and unit test generation.
  • Method 5: Batch and single-policy interfaces with optional LLM or rule-based test generation modes.
Figure 1. Prose2Policy Architecture
Figure 1. Prose2Policy Architecture

実験結果

リサーチクエスチョン

  • RQ1Question 1: Can NLACPs be reliably detected and decomposed into executable policy elements suitable for Rego?
  • RQ2Question 2: How accurately can LLMs extract policy components (DSARCP) from NLACPs and convert them into Rego code?
  • RQ3Question 3: Does the integrated linting, compilation, and unit testing pipeline yield syntactically correct and behaviorally correct policies?
  • RQ4Question 4: What is the impact of RAGent-like baselines on the quality of generated policies compared to Prose2Policy’s approach?
  • RQ5Question 5: How does Prose2Policy perform on standard datasets (e.g., ACRE) in terms of compile rate and test pass rates?

主な発見

  • Finding 1: 95.3% のコンパイル率を達成: 371/389 の受理済み NLACPs が有効な Rego ポリシーを生成。
  • Finding 2: LLM駆動のテスト生成で正のテスト合格率 82.2%(305/371)。
  • Finding 3: LLM駆動のテストで負のテスト合格率 98.9%(367/371)。
  • Finding 4: ルールベースのテスト生成は正のテスト合格率 62.1%、負のテスト合格率 97.1%。
  • Finding 5: 自動生成中に構文の問題が原因で有効な Rego を生成できなかったポリシーが 18 件。
  • Finding 6: 高い否定テスト成功率を伴う堅牢な deny-by-default ポリシーの適用を実証。
Figure 2. Inputs and outputs of each component
Figure 2. Inputs and outputs of each component

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。