QUICK REVIEW

[論文レビュー] Actions Speak Louder Than Chats: Investigating AI Chatbot Age Gating

Olivia Figueira, Pranathi Chamarthi|arXiv (Cornell University)|Feb 10, 2026

AI in Service Interactions被引用数 0

ひとこと要約

要約: 本論文は人気のある消費者向けチャットボットを調査し、会話からユーザーの年齢を推定できるか、子どもが識別された場合に対処するかを検証し、ポリシーと実践のギャップを明らかにする。

ABSTRACT

AI chatbots are widely used by children and teens today, but they pose significant risks to youth's privacy and safety due to both increasingly personal conversations and potential exposure to unsafe content. While children under 13 are protected by the Children's Online Privacy Protection Act (COPPA), chatbot providers' own privacy policies may also provide protections, since they typically prohibit children from accessing their platforms. Age gating is often employed to restrict children online, but chatbot age gating in particular has not been studied. In this paper, we investigate whether popular consumer chatbots are (i) able to estimate users' ages based solely on their conversations, and (ii) whether they take action upon identifying children. To that end, we develop an auditing framework in which we programmatically interact with chatbots and conduct 1050 experiments using our comprehensive library of age-indicative prompts, including implicit and explicit age disclosures, to analyze the chatbots' responses and actions. We find that while chatbots are capable of estimating age, they do not take any action when children are identified, contradicting their own policies. Our methodology and findings provide insights for platform design, demonstrated by our proof-of-concept chatbot age gating implementation, and regulation to protect children online.

研究の動機と目的

人気のあるチャットボットが会話だけからユーザーの年齢を推定できるかを評価する。
子どもが識別された場合にチャットボットがアクセスをブロックする等の対処を取るかを評価する。
自動監査手法と年齢推定指示ライブラリを開発・検証し、系統的なテストを可能にする。
チャットボットの年齢ゲーティング対話のラベル付きデータセットを作成し、ベンチマーキングと将来の監査を可能にする。
実務における年齢ゲーティングを改善する設計と規制上の推奨を提供する。

提案手法

消費者向けチャットボットとの現実的な対話を自動化する新規の監査手法を開発する。
子ども・ティーン・大人グループを横断する明示的・暗示的年齢指示プロンプトを含む年齢ゲーティング指示ライブラリを構築する。
private ブラウザセッションを用いた Python ベースの対話自動化ツールを用いて実験を行う AI チャットボット監査ツールを実装する。
年齢推定、対処、応答スタイル（L1–L6 ラベル）を抽出するために、マニュアルおよび自動ラベリングでチャットボットの応答を後処理する。
5つの人気チャットボットを対象に 1050 件の実験（合計 4890 回のやり取り、86 時間）を実施し、ラベル付き AI チャットボット年齢ゲーティングデータセットを構築する。
年齢推定の精度、取られた対処、およびプラットフォームのポリシーとの整合性を分析する。

実験結果

リサーチクエスチョン

RQ1人気のある消費者向けチャットボットは、チャットのやり取りだけでユーザーの年齢を推定できるか。
RQ2子どもが識別された場合、チャットボットは年齢ゲーティングの対処を行い、それらの対処はプライバシーポリシーおよび COPPA に沿っているか。
RQ3明示的および暗示的な年齢指示プロンプトは年齢推定の性能にどのような影響を与えるか。
RQ4チャット内の知識蓄積が年齢推定の精度に与える影響はどの程度か。
RQ5監査成果物は若年層のオンライン安全性の設計と規制にどう活かせるか。

主な発見

明示的な年齢表示プロンプトを用いるとチャットボットは高い年齢推定精度を達成する（93–99%）。
暗示的プロンプトを用いた年齢推定はチャットセッション中に改善する（19%から42%へ）。
識別された子どもユーザーをブロックするなどの対処を、チャットボットはポリシーに沿って実行しなかった。
チャットボットはしばしば意図的な無知を示し、子どもの可能性を認めつつ保護的な対処をとらない。
子ども年齢グループはより幼い表現を示す傾向があったが、年齢信号を認識しても年齢ゲートを実施しなかった。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。