QUICK REVIEW

[論文レビュー] Opinion Consensus Formation Among Networked Large Language Models

Iris Yazici, Mert Kayaalp|arXiv (Cornell University)|Jan 29, 2026

Opinion Dynamics and Social Influence被引用数 0

ひとこと要約

本論文は、DeGrootフレームワークの下でLLMベースのエージェントがさまざまなグラフトポロジーにおいてどのように合意へ収束するかを調査し、最終的な意見がDeGrootの予測と異なること、収束速度がグラフの固有値に結びつくことを発見した。

ABSTRACT

Can classical consensus models predict the group behavior of large language models (LLMs)? We examine multi-round interactions among LLM agents through the DeGroot framework, where agents exchange text-based messages over diverse communication graphs. To track opinion evolution, we map each message to an opinion score via sentiment analysis. We find that agents typically reach consensus and the disagreement between the agents decays exponentially. However, the limiting opinion departs from DeGroot's network-centrality-weighted forecast. The consensus between LLM agents turns out to be largely insensitive to initial conditions and instead depends strongly on the discussion subject and inherent biases. Nevertheless, transient dynamics align with classical graph theory and the convergence rate of opinions is closely related to the second-largest eigenvalue of the graph's combination matrix. Together, these findings can be useful for LLM-driven social-network simulations and the design of resource-efficient multi-agent LLM applications.

研究の動機と目的

DeGrootの合意が、多回の相互作用におけるLLMエージェントの集団挙動を予測するかを評価する。
ネットワークトポロジーとプロンプトがLLMの意見収束に与える影響を調査する。
最終意見のトピック依存の固有バイアスを探索する。
収束速度を定量化し、相互作用グラフのスペクトル特性と関連づける。
研究のさらなる発展のためのLLMネットワーク実験のオープンで大規模なデータセットを提供する。

提案手法

LLMエージェントをDeGroot参加者として、 Directed, weighted graphs上でテキストベースのメッセージを交換させる。
ネットワーク重みとエージェントの個性をシステムプロンプトで強制し、実験中は組み合わせ行列Aを固定する。
各メッセージを別の感情分析LLMにより意見スコアへ対応づけ、0〜1の正規化スコアを生成する。
トポロジー効果を調べるために、 Erdős–Rényiグラフ（結合確率p）、完全に結合されたグラフ、リングトポロジーを使用する。
意見の対立を論争的・中立・賛成と初期化し、トピックを変化させてバイアスと収束を研究する。
764実験と120万超のLLM応答を含むデータセットをHugging Faceでオープンソース化する。

Figure 1: The average standard deviation of agents’ opinions with respect to the number of iterations across $50$ bitcoin-related experiments. The shaded region indicates the standard error of the mean (SEM) across $50$ simulations.

実験結果

リサーチクエスチョン

RQ1ネットワーク上での多回のLLM相互作用は、DeGroot様の更新で合意へ収束するか。
RQ2グラフトポロジーと相互作用重みによって収束速度と最終意見はどう変化するか。
RQ3最終意見はDeGrootのネットワーク中心性加重予測と一致するか、それともトピックや事前学習の整合性により偏っているか。
RQ4収束率と組合せ行列の二次最大固有値との経験的関係は何か。
RQ5重みをシステムプロンプトで強制することは、合意形成にどのような影響を与えるか。

主な発見

Experiment Type	Average STD ± SEM
Weighted Experiments	0.083 ± 0.004
Weightless Experiments	0.165 ± 0.008

エージェントは通常、合意へ収束し、意見の不一致は指数関数的に減衰する。
最終的な合意は、DeGrootが予測する初期意見のネットワーク中心性加重平均とはしばしば異なる。
最終意見にはトピック依存のバイアスが見られ、事前学習と整合性のバイアスに影響される。
収束速度は古典的グラフ理論と一致し、組合せ行列の二次最大固有値の大きさと関連する。
システムプロンプトで重みを課すと、合意形成の達成可能性が高まる。
不一致の半減時間は、第二固有値λ2を考慮したとき理論式 t1/2 = ln(2)/(-ln|λ2|) と一致する。

Figure 2: Left : Initial opinion distributions, Right : Final opinion distributions. Each row belongs to a set of experiments with a different topic and initial opinion distribution. For example, the first row denotes a set of experiments where the initial opinion distribution is highly skewed towar

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。