QUICK REVIEW

[논문 리뷰] RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text

Wangchunshu Zhou, Yuchen Eleanor Jiang|arXiv (Cornell University)|2023. 05. 22.

Topic Modeling인용 수 10

한 줄 요약

RecurrentGPT은 자연어로 저장된 장단기 기억을 이용한 언어 기반 재귀 메커니즘을 활용하여 임의의 길이의 텍스트 생성과 ChatGPT와 같은 LLM과의 상호 작용형 글쓰기를 가능하게 한다. 대화형 소설 및 소비자 대상 콘텐츠 사례를 시연한다.

ABSTRACT

The fixed-size context of Transformer makes GPT models incapable of generating arbitrarily long text. In this paper, we introduce RecurrentGPT, a language-based simulacrum of the recurrence mechanism in RNNs. RecurrentGPT is built upon a large language model (LLM) such as ChatGPT and uses natural language to simulate the Long Short-Term Memory mechanism in an LSTM. At each timestep, RecurrentGPT generates a paragraph of text and updates its language-based long-short term memory stored on the hard drive and the prompt, respectively. This recurrence mechanism enables RecurrentGPT to generate texts of arbitrary length without forgetting. Since human users can easily observe and edit the natural language memories, RecurrentGPT is interpretable and enables interactive generation of long text. RecurrentGPT is an initial step towards next-generation computer-assisted writing systems beyond local editing suggestions. In addition to producing AI-generated content (AIGC), we also demonstrate the possibility of using RecurrentGPT as an interactive fiction that directly interacts with consumers. We call this usage of generative models by ``AI As Contents'' (AIAC), which we believe is the next form of conventional AIGC. We further demonstrate the possibility of using RecurrentGPT to create personalized interactive fiction that directly interacts with readers instead of interacting with writers. More broadly, RecurrentGPT demonstrates the utility of borrowing ideas from popular model designs in cognitive science and deep learning for prompting LLMs. Our code is available at https://github.com/aiwaves-cn/RecurrentGPT and an online demo is available at https://www.aiwaves.org/recurrentgpt.

연구 동기 및 목표

Transformer 기반 LLM의 고정 크기 컨텍스트 한계를 극복하여 장문 텍스트 생성을 다루다.
자연어 기억과 프롬프트를 사용하는 언어 기반 재귀의 대리를 도입한다.
백본 LLM의 구조를 바꾸지 않고도 인터랙티브하고 해석 가능하며 맞춤형 장문 텍스트 생성을 가능하게 한다.
인터랙티브한 글쓰기, 개인화된 인터랙티브 픽션, 그리고 콘텐츠로서의 AI 상호 작용 등의 응용을 시연한다.

제안 방법

LSTM 유사 구성요소(콘텐츠, 계획, 단기 기억, 장기 기억)를 자연어 단락과 요약으로 표현한다.
장기 기억을 디스크에 임베디드 콘텐츠의 VectorDB로 저장하여 GPU 메모리 한계를 초과한다.
프롬프트 템플릿을 사용해 LSTM의 재귀 계산 그래프를 시뮬레이션하고 새 콘텐츠, 다음 섹션에 대한 계획, 업데이트된 기억을 생성한다.
타임스텝당 여러 생성 계획을 제공하여 사용자의 선택과 상호 작용을 지원한다.
자율적 장문 텍스트 생성을 위해 계획을 선택하고 수정하는 인간-루프 시뮬레이터를 포함한다.
백본으로서 ChatGPT/GPT-4와의 통합을 시연하고 Rolling-ChatGPT, RE3, DOC 등의 기준선과 비교한다.

Figure 1: Illustration of the RecurrentGPT framework. RecurrentGPT enables recurrent prompting with LLMs by simulating an RNN using natural language building blocks and defines the recurrent computation graph with prompts.

실험 결과

연구 질문

RQ1언어 기반 재귀 메커니즘이 고정 컨텍스트 창을 넘는 진정한 장문 텍스트 생성을 가능하게 할 수 있는가?
RQ2의미론적 검색을 통한 디스크 저장 장기 기억이 수천 토큰에 걸친 일관성을 유지하는가?
RQ3계획과 기억을 통한 인터랙티브 생성이 해석 가능성과 작가 생산성을 향상시키는가?
RQ4일관성과 흥미 측면에서 RecurrentGPT가 계층적 장문 시스템 및 롤링-context 기반 기준선과 어떻게 비교되는가?

주요 결과

사람 평가에서 RecurrentGPT는 롤링 컨텍스트 기준선 및 기존 장문 방법에 비해 훨씬 더 길고 일관된 텍스트를 산출한다.
장기 및 단기 기억은 일관성에 상당히 기여하며, GPT-4 백본이 ChatGPT 또는 GPT-3.5보다 더 강한 성능을 보인다.
이 방식은 자율적 장문 텍스트 생성과 인터랙티브한 글쓰기를 가능하게 하며, 다수의 생성 계획이 사용자 선택과 편집을 지원한다.
인간 평가에서 RecurrentGPT는 장르(공상과학, 로맨스, 판타지, 공포, 미스터리, 스릴러) 전반과 더 긴 길이(약 6000단어)에서 우수함이 입증되었다.
더 강력한 백본 LLM(GPT-4)은 소거 연구에서 경량 백본 대비 성능을 크게 향상시킨다.
실험은 또한 인터랙티브 픽션 형식과 소비자 대상 AI를 콘텐츠로 활용하는 방식을 탐구한다.

Figure 2: Qualitative analysis of using RecurrentGPT as an interactive writing assistant and an interactive fiction. Highlighted plans or choices are that selected by human users.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.