QUICK REVIEW

[论文解读] Identifying Treatment Effects using Trimmed Means when Data are Missing Not at Random

Alex Ocampo, Heinz Schmidli|arXiv (Cornell University)|Aug 2, 2019

Statistical Methods in Clinical Trials参考文献 31被引用 4

一句话总结

本文提出使用截尾均值方法来估计临床试验中缺失数据非随机（MNAR）时的治疗效应，特别是当脱落原因与不良结局相关时。通过将缺失值赋为各治疗组中最差的观测值，并对每组分布的最低比例进行截尾，该方法在强MNAR和位置平移假设下可识别平均治疗效应，模拟结果显示其在MNAR下表现稳健，但在MAR下失效。

ABSTRACT

Patients often discontinue treatment in a clinical trial because their health condition is not improving. Consequently, the patients still in the study at the end of the trial have better health outcomes on average than the initial patient population would have had if every patient had completed the trial. If we only analyze the patients who complete the trial, then this missing data problem biases the estimator of a medication's efficacy because study outcomes are missing not at random (MNAR). One way to overcome this problem - the trimmed means approach for missing data - sets missing values as slightly worse than the worst observed outcome and then trims away a fraction of the distribution from each treatment arm before calculating differences in treatment efficacy (Permutt 2017, Pharmaceutical statistics 16.1:20-28). In this paper we derive sufficient and necessary conditions for when this approach can identify the average population treatment effect in the presence of MNAR data. Numerical studies show the trimmed means approach's ability to effectively estimate treatment efficacy when data are MNAR and missingness is strongly associated with an unfavorable outcome, but trimmed means fail when data are missing at random (MAR) when the better approach would be to multiply impute the missing values. If the reasons for discontinuation in a clinical trial are known analysts can improve estimates with a combination of multiple imputation (MI) and the trimmed means approach when the assumptions of each missing data mechanism hold. When the assumptions are justifiable, using trimmed means can help identify treatment effects notwithstanding MNAR data.

研究动机与目标

识别在缺失数据非随机（MNAR）条件下，截尾均值方法可一致估计治疗效应的条件。
通过模拟研究评估截尾均值在各种MNAR和MAR机制下的表现。
提出一种混合方法，结合多重插补与截尾均值，当两种方法的假设均满足时使用。
在真实世界临床试验数据中展示该方法的实用性，特别是在因疗效不足导致高脱落率的慢性疼痛试验中。
强调临床判断在区分MNAR与MAR脱落原因中的重要性，以实现有效推断。

提出的方法

截尾均值方法在每组中将缺失结果赋为最差的观测值，随后对分布下端预先指定比例（α）进行截尾。
该方法将治疗效应估计为两组截尾均值之差，实质上仅分析表现最佳的患者子集。
该方法依赖于两个关键假设：强MNAR（所有缺失值均劣于截尾分位数）和位置平移（未截尾分布为彼此的平移版本）。
该方法通过ANCOVA扩展以纳入协变量，并应用于重复测量的混合模型（MMRM）。
提出一种组合方法：对MAR脱落使用多重插补，对MNAR脱落使用截尾均值，假设MNAR脱落先于MAR脱落发生。
建议进行敏感性分析，使用Kolmogorov-Smirnov检验评估在观测数据上位置平移假设的合理性。

实验结果

研究问题

RQ1在何种条件下，当数据为MNAR时，截尾均值方法可一致估计平均治疗效应？
RQ2在MNAR与MAR缺失机制下，截尾均值方法的表现与标准方法相比如何？
RQ3当两种方法的假设均满足时，结合多重插补与截尾均值能否改善估计效果？
RQ4截尾均值估计量对强MNAR假设的偏离有多稳健？
RQ5在MNAR数据存在时，截尾操作对偏差、方差和检验效能有何影响？

主要发现

在强MNAR和位置平移假设下，截尾均值方法可一致估计治疗效应，模拟结果显示偏差低且置信区间覆盖良好。
在脱落原因与不良结局相关的MNAR情景下，即使强MNAR假设略有违反，该方法仍表现良好。
在MAR机制下，截尾均值方法失效，导致显著偏差，凸显了正确识别缺失机制的重要性。
将MAR脱落的多重插补与MNAR脱落的截尾均值结合，可保持估计准确性，并相比全局使用截尾均值方法降低标准误。
在慢性疼痛临床试验示例中，组合方法的治疗估计值与完整分析结果相近，但标准误低于单独使用截尾均值方法。
偏差-方差权衡表明，当强MNAR假设合理时，应采用较高截尾比例，因为截尾可增强该假设的合理性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。