QUICK REVIEW

[论文解读] A Hybrid Latent-Class Item Response Model for Detecting Measurement Non-Invariance in Ordinal Scales

Gabriel Wallin, Qi Huang|arXiv (Cornell University)|Jan 24, 2026

Psychometric Methodologies and Testing被引用 0

一句话总结

论文开发了一种正则化的比例-优势潜类IRT模型，用于在没有已知分组标签的情况下检测有序量表的 DIF，使用 ℓ1 惩罚的边际似然和 EM 算法。

ABSTRACT

Measurement non-invariance arises when the psychometric properties of a scale differ across subgroups, undermining the validity of group comparisons. At the item level, such non-invariance manifests as differential item functioning (DIF), which occurs when the conditional distribution of an item response differs across groups after controlling for the latent trait. This paper introduces a statistical framework for detecting DIF in ordinal scales without requiring known group labels or anchor items. We propose a hybrid latent-class item response model to ordinal data using a proportional-odds formulation, assigning individuals probabilistically to latent classes. DIF is captured through class-specific shifts in item intercepts and slopes, allowing for both uniform and non-uniform DIF. The identification of DIF effects is achieved via an $L_1$-penalised marginal likelihood function under a sparsity assumption, and model estimation is implemented using a tailored EM algorithm. Simulation studies demonstrate strong recovery of item parameters and both uniform and non-uniform types of DIF. An empirical application to a personality test reveals latent subgroups with distinct response patterns and identifies items that may bias group comparisons. The proposed framework provides a flexible approach to assessing measurement invariance in ordinal scales when comparison groups are unobserved or poorly defined.

研究动机与目标

在有序量表中推动需要在子组之间检测测量不变性的需求。
开发一个不需要已知分组标签或锚项的框架。
提出一个带概率类分配的比例-优势潜类项反应模型。
结合 ℓ1 惩罚的边际似然以在稀疏假设下识别 DIF。
提供适用于该模型的估计算法并展示其有效性。

提出的方法

建立一个概率地将个体分配到潜在类别的比例-优势潜类IRT模型。
通过类特定截距和斜率的变化来建模 DIF，以捕捉统一 DIF 和非统一 DIF。
在稀疏假设下利用 ℓ1 惩罚的边际似然来确定参数。
使用定制的期望最大化（EM）算法来估计参数。
通过稀疏性对潜在尺度进行锚定，以解决由类特异斜率引起的识别问题。
通过仿真研究和一个经验应用来评估性能。

实验结果

研究问题

RQ1在没有已知分组标签或锚项的情况下，能否在有序量表中检测 DIF？
RQ2类特定截距和斜率是否足以捕捉统一 DIF 与非统一 DIF？
RQ3ℓ1 惩罚的边际似然是否能够有效恢复项目参数和 DIF 结构？
RQ4所提出的 EM 算法是否能够准确估计正则化的潜在类别IRT模型？

主要发现

仿真实验显示能够准确恢复项目参数及两种类型的 DIF。
在经验数据中可以识别出具有不同反应模式的潜在子组。
在经验应用中检测到显示潜在类特异测量非不变性的项目。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。