QUICK REVIEW

[論文レビュー] Generative Agents: Interactive Simulacra of Human Behavior

Joon-Sung Park, Joseph C. O'Brien|arXiv (Cornell University)|Apr 7, 2023

Artificial Intelligence in Games被引用数 163

ひとこと要約

本論文は Generative Agents を紹介します。記憶ストリーム、反省、計画を用いて大規模言語モデルを拡張する記憶主導のアーキテクチャで、インタラクティブな環境における信じられる長期的な人間挙動をシミュレートします。25エージェントの Smallville サンドボックスを用い、出現的な社会ダイナミクスを示し、アブレーションとエンドツーエンドの研究を通じてアーキテクチャを評価します。

ABSTRACT

Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototyping tools. In this paper, we introduce generative agents--computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; they form opinions, notice each other, and initiate conversations; they remember and reflect on days past as they plan the next day. To enable generative agents, we describe an architecture that extends a large language model to store a complete record of the agent's experiences using natural language, synthesize those memories over time into higher-level reflections, and retrieve them dynamically to plan behavior. We instantiate generative agents to populate an interactive sandbox environment inspired by The Sims, where end users can interact with a small town of twenty five agents using natural language. In an evaluation, these generative agents produce believable individual and emergent social behaviors: for example, starting with only a single user-specified notion that one agent wants to throw a Valentine's Day party, the agents autonomously spread invitations to the party over the next two days, make new acquaintances, ask each other out on dates to the party, and coordinate to show up for the party together at the right time. We demonstrate through ablation that the components of our agent architecture--observation, planning, and reflection--each contribute critically to the believability of agent behavior. By fusing large language models with computational, interactive agents, this work introduces architectural and interaction patterns for enabling believable simulations of human behavior.

研究の動機と目的

対話型システムや仮想環境のための、信じられる人間挙動の代理体の創出を動機づける。
長期記憶と推論を用いて一貫性のある、進化する挙動を実現する、大規模言語モデルを拡張するアーキテクチャを提案する。
小さな町のサンドボックスで出現的な社会ダイナミクスと日常生活のルーチンを展示する。
信頼性に対する記憶、反省、計画の各要素の因果的影響を評価する。

提案手法

エージェント体験の総合的かつ自然言語による長期記憶としてのメモリストリームを導入する。
挙動を情報するために関連性、最近性、重要性のバランスを取るメモリ検索モデルを開発する。
自己と他者についての高次の推論へと記憶を統合するために反省を追加する。
反省と現在の文脈を高レベルの計画と詳細な行動へ翻訳する計画を実装する。
信じられる挙動を推進するために、記憶、反省、計画を大規模言語モデルと統合する。
25エージェントのサンドボックス（Smallville）で、エージェントが自然言語を通じて記憶し、計画し、行動し、相互作用する様子を示す。

実験結果

リサーチクエスチョン

RQ1オープンワールドシミュレーションにおいて、エージェントアーキテクチャは長期的な整合性と記憶をどう維持できるか？
RQ2記憶、反省、計画の要素が信頼性と挙動品質に与える因果的影響は何か？
RQ3小規模言語モデルを動力源とするエージェント集団は、時間とともに出現的な社会ダイナミクスと協調活動を生み出せるか？
RQ4対話的環境で信じられるエージェント挙動のためにLLMを使用する際の限界と失敗モードは何か？

主な発見

Generative agents は、オープンワールドのサンドボックスで信じられる個別の挙動と出現的な社会ダイナミクスを生み出す。
記憶検索、反省、計画の各要素が信頼性に重要な寄与をすることは、アブレーション実験で示されている。
エージェントは、最小のシードプロンプト（例: パーティーの計画）から複雑なタスクを手作業で全挙動をコーディングすることなく、調整できる。
エンドツーエンドの評価は、情報拡散や関係性の記憶といったエージェント社会内の出現現象を明らかにする。
アーキテクチャは、記憶検索エラーや過度に形式的な言語傾向といった失敗モードを示し、今後の改良の指針となる。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。