QUICK REVIEW

[论文解读] A Scalable Approximate Model Counter

Supratik Chakraborty, Kuldeep S. Meel|arXiv (Cornell University)|Jun 24, 2013

Bayesian Modeling and Causal Inference参考文献 31被引用 20

一句话总结

该论文提出了 ApproxMC，这是首个针对 CNF 公式的可扩展近似模型计数器，能够在高置信度下提供 $(\varepsilon,\delta)$-保证，且容差极小。它通过多项式数量的 SAT 求解器调用来估计模型计数，使其结果在真实计数的 $(1+\varepsilon)$ 因子范围内，实测精度远低于理论界限，在包含数万个变量的基准测试中表现优异。

ABSTRACT

Propositional model counting} (#SAT), i.e., counting the number of satisfying assignments of a propositional formula, is a problem of significant theoretical and practical interest. Due to the inherent complexity of the problem, approximate model counting, which counts the number of satisfying assignments to within given tolerance and confidence level, was proposed as a practical alternative to exact model counting. Yet, approximate model counting has been studied essentially only theoretically. The only reported implementation of approximate model counting, due to Karp and Luby, worked only for DNF formulas. A few existing tools for CNF formulas are bounding model counters; they can handle realistic problem sizes, but fall short of providing counts within given tolerance and confidence, and, thus, are not approximate model counters. We present here a novel algorithm, as well as a reference implementation, that is the first scalable approximate model counter for CNF formulas. The algorithm works by issuing a polynomial number of calls to a SAT solver. Our tool, ApproxMC, scales to formulas with tens of thousands of variables. Careful experimental comparisons show that ApproxMC reports, with high confidence, bounds that are close to the exact count, and also succeeds in reporting bounds with small tolerance and high confidence in cases that are too large for computing exact model counts.

研究动机与目标

为解决精确模型计数器在实际应用中处理命题模型计数 ($\#\mathsf{SAT}$) 时的可扩展性限制。
通过设计适用于 CNF 公式的可扩展算法，弥合理论近似计数与实际实现之间的差距。
为近似计数提供 $(\varepsilon,\delta)$-风格的保证——容差与置信度，而现有工具无法实现这一点。
在保持大规模真实公式可扩展性的前提下，提升近似质量，优于现有边界计数器。

提出的方法

该算法采用一种基于哈希和随机游走的新方法，通过重复的 SAT 查询来估计满足赋值的数量。
它结合了多种边界技术（如 MiniCount、SampleCount、MBound）的混合方法，以同时改进上界和下界。
该方法采用保守的边界估计策略，以确保高置信区间，此策略已在先前工作中得到验证 [13]。
它利用多项式数量的 SAT 求解器调用，实现具有理论保证的 $(\varepsilon,\delta)$-近似。
实现中使用了 MBound 技术的混合变体，以迭代方式细化边界并缩小区间大小。
所有实验均对每个工具设置了 20 小时的超时时间，以确保与精确计数器、近似计数器和边界计数器之间的公平比较。

实验结果

研究问题

RQ1能否设计出一种针对 CNF 公式的近似模型计数器，使其在处理数万个变量时仍能保持 $(\varepsilon,\delta)$-保证？
RQ2在实际应用中，ApproxMC 的近似质量与精确计数器和现有边界计数器相比如何？
RQ3该算法能否在大型复杂公式上以高置信度维持较小的区间大小（即紧密边界）？
RQ4ApproxMC 在区间紧密度和准确性方面是否优于最先进的边界计数器？

主要发现

ApproxMC 在 Cachet 报告精确计数的 95 个基准测试中成功计算出 $(\varepsilon,\delta)$-保证的边界，所有估计值均在真实计数的指定容差范围内。
在全部 95 个基准测试中，相对误差的 $L_1$ 范数为 0.033，表明平均误差仅为 3.3%，远低于理论保证的 $\varepsilon = 0.75$。
ApproxMC 的区间大小始终小于 MiniCount、SampleCount 和 MBound 等边界计数器，证明其具有更优的近似质量。
尽管 Cachet 在更大规模问题上超时，ApproxMC 仍能持续返回高置信度且容差极小的边界，展现出超越精确计数器的可扩展性。
ApproxMC 显著改进了 MiniCount 的上界，并提升了 SampleCount 和 MBound 的下界，即使在采用保守估计策略的情况下也是如此。
该工具成功扩展到包含数万个变量的公式，能够处理对精确计数而言过大而无法处理的实例。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。