QUICK REVIEW

[论文解读] Multi-Agent Consensus Seeking via Large Language Models

Huaben Chen, Wenkang Ji|arXiv (Cornell University)|Oct 31, 2023

Multi-Agent Systems and Negotiation被引用 12

一句话总结

本论文研究多位由LLM驱动的代理如何通过协商就数值状态达成共识，分析性格与拓扑等因素，并展示一个多机器人聚集应用。

ABSTRACT

Multi-agent systems driven by large language models (LLMs) have shown promising abilities for solving complex tasks in a collaborative manner. This work considers a fundamental problem in multi-agent collaboration: consensus seeking. When multiple agents work together, we are interested in how they can reach a consensus through inter-agent negotiation. To that end, this work studies a consensus-seeking task where the state of each agent is a numerical value and they negotiate with each other to reach a consensus value. It is revealed that when not explicitly directed on which strategy should be adopted, the LLM-driven agents primarily use the average strategy for consensus seeking although they may occasionally use some other strategies. Moreover, this work analyzes the impact of the agent number, agent personality, and network topology on the negotiation process. The findings reported in this work can potentially lay the foundations for understanding the behaviors of LLM-driven multi-agent systems for solving more complex tasks. Furthermore, LLM-driven consensus seeking is applied to a multi-robot aggregation task. This application demonstrates the potential of LLM-driven agents to achieve zero-shot autonomous planning for multi-robot collaboration tasks. Project website: windylab.github.io/ConsensusLLM/.

研究动机与目标

调查通过代理间协商LLM驱动的代理是否能够达成共识。
确定在共识过程中使用的常见策略（如平均）及其偏差。
检视代理性格与网络拓扑如何影响协商结果。
评估代理数量对共识稳定性和变异性的影响。
展示其作为零样本规划应用于多机器人聚集任务的可行性。

提出的方法

建模 n 个由 GPT-3.5-turbo-0613 驱动的代理，初始数值状态为随机。
允许代理在每轮输出 Reasoning 和 Position 以确定下一个状态。
避免显式策略指令，以观察涌现的共识行为。
调整性格提示以创建固执或易受暗示的代理，并观察影响。
测试不同的网络拓扑（全连通、部分、定向）以研究信息流。
进行蒙特卡洛模拟，分析代理数量和温度如何影响最终共识。

实验结果

研究问题

RQ1多个LLM驱动的代理是否能够通过协商自主达到一个共同的共识值？
RQ2代理收敛到哪些策略（如平均值），在何种条件下会出现振荡或聚类？
RQ3性格特征（固执 vs 易受暗示）与网络拓扑如何影响共识结果？
RQ4增加代理数量是否会降低最终共识的方差和幻觉？
RQ5如何将基于LLM的共识寻求应用于现实世界中的多机器人聚集任务？

主要发现

当没有明确指示时，代理通常采用平均策略达成共识，偶有偏差。
性格影响包括固执代理导致的领袖-跟随者动态，以及易受暗示代理在情境不同下可能引起振荡或更快对齐。
拓扑影响收敛速度与结构：全连通网络收敛更快；有向图造成领袖-跟随者模式，可能收敛较慢或出现死锁。
增加代理数量通常降低最终共识的方差，并可稳定群体决策，而较低温度（0）使初始平均值周围的聚簇更紧密。
某些组合的“完全易受暗示”代理可能引发振荡，而多名固执代理可能在不同状态下产生聚簇。
该框架适用于欧几里得空间中的多机器人聚集，显示基于LLM的规划可以影响自主协调。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。