QUICK REVIEW

[논문 리뷰] Generative Agents: Interactive Simulacra of Human Behavior

Joon-Sung Park, Joseph C. O'Brien|arXiv (Cornell University)|2023. 04. 07.

Artificial Intelligence in Games인용 수 163

한 줄 요약

논문은 Generative Agents를 소개한다. 메모리 스트림, 반성, 및 계획으로 대형 언어 모델을 보강하여 인터랙티브한 환경에서 그럴듯한 장기적 인간 행동을 시뮬레이션하는 기억 기반 아키텍처를 제안한다. Emergent social dynamics가 있는 25-agent Smallville 샌드박스를 시연하고 차단(ablation) 및 엔드-투-엔드(end-to-end) 연구를 통해 아키텍처를 평가한다.

ABSTRACT

Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototyping tools. In this paper, we introduce generative agents--computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; they form opinions, notice each other, and initiate conversations; they remember and reflect on days past as they plan the next day. To enable generative agents, we describe an architecture that extends a large language model to store a complete record of the agent's experiences using natural language, synthesize those memories over time into higher-level reflections, and retrieve them dynamically to plan behavior. We instantiate generative agents to populate an interactive sandbox environment inspired by The Sims, where end users can interact with a small town of twenty five agents using natural language. In an evaluation, these generative agents produce believable individual and emergent social behaviors: for example, starting with only a single user-specified notion that one agent wants to throw a Valentine's Day party, the agents autonomously spread invitations to the party over the next two days, make new acquaintances, ask each other out on dates to the party, and coordinate to show up for the party together at the right time. We demonstrate through ablation that the components of our agent architecture--observation, planning, and reflection--each contribute critically to the believability of agent behavior. By fusing large language models with computational, interactive agents, this work introduces architectural and interaction patterns for enabling believable simulations of human behavior.

연구 동기 및 목표

인터랙티브 시스템과 가상 환경에서 인간 행동의 그럴듯한 프록시를 만들기 위한 동기를 부여한다.
대형 언어 모델에 장기 기억과 추론을 확장하여 일관되고 진전하는 행동을 보장하는 아키텍처를 제안한다.
작은 마을 샌드박스에서 출현하는 사회 역동성과 일상 루틴을 선보인다.
메모리, 반성, 계획 구성 요소가 그럴듯함에 미치는 인과적 영향을 평가한다.

제안 방법

에이전트 경험의 장기 기억으로서 포괄적이고 자연어 기반의 메모리 스트림을 도입한다.
행동에 정보를 제공하기 위해 관련성, 최신성, 중요도의 균형을 맞춘 메모리 검색 모델을 개발한다.
자기 및 타인에 대한 더 높은 수준의 추론을 합성하기 위해 기억을 반성에 추가한다.
반성을 현재 맥락과 고수준의 계획으로 번역하기 위한 계획 수립을 구현한다.
메모리, 반성, 계획을 대형 언어 모델과 통합하여 그럴듯한 행동을 이끈다.
25-agent 샌드박스(Smallville)에서 에이전트가 기억하고, 계획하고, 행동하며, 자연어를 통해 상호 작용하는 방식을 시연한다.

실험 결과

연구 질문

RQ1오픈 월드 시뮬레이션에서 에이전트 아키텍처가 장기적 일관성과 기억을 어떻게 유지할 수 있는가?
RQ2메모리, 반성, 계획 구성 요소가 그럴듯함과 행동 품질에 미치는 인과적 영향은 무엇인가?
RQ3소형 언어 모델 기반의 에이전트 인구가 시간에 따라 Emergent 사회 역학 및 조정된 활동을 생성할 수 있는가?
RQ4인터랙티브 환경에서 그럴듯한 에이전트 행동을 위한 LLM 사용의 한계와 실패 모드는 무엇인가?

주요 결과

Generative agents는 개방형 월드 샌드박스에서 그럴듯한 개별 행동과 emergent 사회 역학을 생성한다.
메모리 검색, 반성, 계획 각각이 그럴듯함에 결정적으로 기여한다는 점은 차단 실험에서 입증되었다.
에이전트는 시작 포인트가 최소한의 시드 프롬프트일 때도(예: 파티를 계획) 모든 행동을 수작업으로 코딩하지 않고도 복잡한 작업을 조정할 수 있다.
엔드투엔드 평가에서 정보 확산 및 관계 기억과 같은 emergent 현상이 에이전트 사회 내에서 나타난다.
아키텍처는 기억 검색 오류 및 과도한 형식적 언어 경향과 같은 실패 모드를 드러내며, 향후 개선을 알린다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.