QUICK REVIEW

[论文解读] Classification of Autistic and Non-Autistic Children's Speech: A Cross-Linguistic Study in Finnish, French, and Slovak

Sofoklis Kakouros, Ida-Lotta Myllylä|arXiv (Cornell University)|Mar 6, 2026

Autism Spectrum Disorder Research被引用 0

一句话总结

该研究训练简单的声学-韵律分类器（88个 openSMILE 特征）以区分芬兰语、法语和斯洛伐克语中的ASD与TD语音，评估同语言内表现和跨语言泛化，采用聚合和 LOCO 设置进行评估。

ABSTRACT

We present a cross-linguistic study of speech in autistic and non-autistic children speaking Finnish, French, and Slovak. We combine supervised classification with within-language and cross-corpus transfer experiments to evaluate classification performance within and across languages and to probe which acoustic cues are language-specific versus language-general. Using a large set of acoustic-prosodic features, we implement speaker-level classification benchmarks as an analytical tool rather than to seek state-of-the-art performance. Within-language models, evaluated with speaker-level cross-validation, yielded heterogeneous results. The Finnish model performed best (Accuracy 0.84, F1 0.88), followed by Slovak (Accuracy 0.63, F1 0.68) and French (Accuracy 0.68, F1 0.56). We then tested cross-language generalization. A model trained on all pooled corpora reached an overall Accuracy of 0.61 and F1 0.68. Leave-one-corpus-out experiments, which test transfer to an unseen language, showed moderate success when testing on Slovak (F1 0.70) and Finnish (F1 0.78), but poor transfer to French (F1 0.42). Feature-importance analyses across languages highlighted partially shared, but not fully language-invariant, acoustic markers of autism. These findings suggest that some autism-related speech cues generalize across typologically distinct languages, but robust cross-linguistic classifiers will likely require language-aware modeling and more homogeneous recording conditions.

研究动机与目标

在三种语言中使用韵律特征评估同语言内的 ASD 与 TD 区分能力。
评估在一个或多种语言上训练的分类器的跨语言泛化能力。
通过特征重要性分析识别语言通用与语言特定的自闭症声学标记。
使用说话者级交叉验证以获得现实的泛化估计。
探讨记录条件和语言如何影响跨语言鲁棒性。

提出的方法

对每个 IPU 提取 88 维声学-韵律特征（使用 openSMILE eGeMAPS 的发声单元层函数）。
训练 XGBoost 和随机森林分类器，在说话者层级区分 ASD 与 TD。
进行同语言内的交叉验证以评估语言特异线索。
在跨语言分析中进行聚合的多语言训练并进行留一语料库（LOCO）评估。
使用基于树的重要性度量、TreeSHAP 和置换重要性分析特征重要性。
对比语言特异结果与聚合结果以识别语言通用线索。

实验结果

研究问题

RQ1在每种语言中能否使用韵律特征区分 ASD 与 TD？
RQ2在聚合和 LOCO 设置中，用一语言训练的模型对其他语言的泛化能力如何？
RQ3哪些声学-韵律特征是语言通用的自闭症标记，哪些是语言特异的？

主要发现

芬兰语同语言内准确率 0.84、F1 0.88；斯洛伐克语准确率 0.63、F1 0.68；法语准确率 0.68、F1 0.56。
跨语言聚合模型平均准确率 0.61，F1 0.68。
LOCO 传递显示芬兰 F1 0.78、斯洛伐克 F1 0.70 相对成功，但法语 F1 0.42 仍较差。
跨语言，F0（音高）分布持续有助于区分 ASD 与 TD；语言特定线索涉及谱倾斜、全局谱形、动态、共振峰结构与强度。
聚合的多语言分析揭示一组共享的、语言通用的线索，包括音高和谱形特征，尽管语言特异的强调仍然存在。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。