QUICK REVIEW

[论文解读] Shapley effects for sensitivity analysis with dependent inputs: bootstrap and kriging-based algorithms

Nazih Benoumechiara, Kévin Elie-Dit-Cosaque|arXiv (Cornell University)|Jan 10, 2018

Probabilistic and Robust Engineering Design参考文献 28被引用 24

一句话总结

本文提出基于自展法的置信区间与克里金化代理模型，以改进在输入依赖条件下的敏感性分析中Shapley效应的估计。通过结合蒙特卡洛采样与克里金化代理模型，该方法在降低计算成本的同时量化了抽样误差与代理模型误差，即使在昂贵模型且输入存在依赖关系的情况下，也能提供稳健且可解释的敏感性指标。

ABSTRACT

In global sensitivity analysis, the well known Sobol' sensitivity indices aim to quantify how the variance in the output of a mathematical model can be apportioned to the different variances of its input random variables. These indices are based on the functional variance decomposition and their interpretation become difficult in the presence of statistical dependence between the inputs. However, as there is dependence in many application studies, that enhances the development of interpretable sensitivity indices. Recently, the Shapley values developed in the field of cooperative games theory have been connected to global sensitivity analysis and present good properties in the presence of dependencies. Nevertheless, the available estimation methods don't always provide confidence intervals and require a large number of model evaluation. In this paper, we implement a bootstrap sampling in the existing algorithms to estimate confidence intervals of the indice estimations. We also proposed to consider a metamodel in substitution of a costly numerical model. The estimation error from the Monte-Carlo sampling is combined with the metamodel error in order to have confidence intervals on the Shapley effects. Besides, we compare for different examples with dependent random variables the results of the Shapley effects with existing extensions of the Sobol' indices.

研究动机与目标

解决在输入存在统计依赖时，可靠估计Shapley效应置信区间的挑战。
通过用克里金化代理模型替代原始模型，降低昂贵数值模型敏感性分析的计算负担。
在Shapley效应估计中分离并量化蒙特卡洛抽样误差与代理模型近似误差。
提供一种实用且可扩展的敏感性分析框架，在输入依赖条件下仍保持可解释性。
将Shapley效应与现有依赖条件下Sobol’指数的扩展形式进行比较，证明其在可解释性与鲁棒性方面的优越性。

提出的方法

在现有Shapley效应估计算法上实施自展法重采样，生成能够捕捉蒙特卡洛估计变异性的置信区间。
将基于克里金化的代理模型作为昂贵数值模型的代理，显著减少所需模型评估次数。
通过联合误差模型，将总估计误差分解为两个分量：蒙特卡洛抽样误差与克里金代理模型误差。
利用Mara等人（2015）提出的完整与条件Sobol’指数的解析表达式，计算用于比较的参考值。
通过所有输入排列的对称平均化方法应用Shapley效应公式，确保在依赖条件下公平分配贡献。
将克里金预测方差与蒙特卡洛估计器的自展方差相结合，生成总体置信区间。

实验结果

研究问题

RQ1自展法重采样是否能在不增加计算成本的前提下，有效估计Shapley效应的置信区间？
RQ2在输入依赖存在的情况下，使用克里金代理模型如何影响Shapley效应估计的准确度与不确定性量化？
RQ3在输入依赖条件下，Shapley效应在可解释性与鲁棒性方面，相较于扩展Sobol’指数的优越程度如何？
RQ4是否能可靠地将Shapley效应计算中的总估计误差分解为蒙特卡洛误差与代理模型误差两部分？
RQ5在具有相关输入的基准模型中，Shapley效应与完整及独立Sobol’指数在数量上如何比较？

主要发现

所提出的基于自展法的方法成功提供了Shapley效应估计的有效置信区间，显著提升了实际应用中的可靠性。
克里金代理模型显著减少了所需模型评估次数，同时保持了估计的准确性，使该方法在昂贵模型中具有可行性。
联合误差模型能够有效分离蒙特卡洛误差与代理模型误差，从而实现对Shapley效应估计总不确定性的更准确评估。
当输入存在依赖时，Shapley效应在可解释性与一致性方面始终优于扩展Sobol’指数，因其天然地考虑了交互作用与依赖效应。
在包含三个相关输入的解析测试案例中，Shapley效应表现出稳健且均衡的特性，$ Sh_1, Sh_2, Sh_3 $ 准确反映了来自 $ \beta_1\beta_2\beta_3 $ 和相关性参数的真实结构贡献。
该方法在不同相关结构下均表现出良好的可扩展性与稳定性，在测试案例中未出现发散或不稳定性现象。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。