QUICK REVIEW

[論文レビュー] Augmenting LLMs with Knowledge: A survey on hallucination prevention

Κωνσταντίνος Ανδριόπουλος, Johan Pouwelse|arXiv (Cornell University)|Sep 28, 2023

Topic Modeling被引用数 9

ひとこと要約

知識を強化した言語モデルが外部知識を検索・融合して幻覚を減らす方法を網羅した調査で、検索・融合・検索ベースの増強アプローチを扱う。

ABSTRACT

Large pre-trained language models have demonstrated their proficiency in storing factual knowledge within their parameters and achieving remarkable results when fine-tuned for downstream natural language processing tasks. Nonetheless, their capacity to access and manipulate knowledge with precision remains constrained, resulting in performance disparities on knowledge-intensive tasks when compared to task-specific architectures. Additionally, the challenges of providing provenance for model decisions and maintaining up-to-date world knowledge persist as open research frontiers. To address these limitations, the integration of pre-trained models with differentiable access mechanisms to explicit non-parametric memory emerges as a promising solution. This survey delves into the realm of language models (LMs) augmented with the ability to tap into external knowledge sources, including external knowledge bases and search engines. While adhering to the standard objective of predicting missing tokens, these augmented LMs leverage diverse, possibly non-parametric external modules to augment their contextual processing capabilities, departing from the conventional language modeling paradigm. Through an exploration of current advancements in augmenting large language models with knowledge, this work concludes that this emerging research direction holds the potential to address prevalent issues in traditional LMs, such as hallucinations, un-grounded responses, and scalability challenges.

研究の動機と目的

幻覚とLLMの知識制約に対処するための外部知識統合の必要性を動機づける。
外部知識増強技術を、非構造化および構造化ソースを含めて調査・分類する。
取得、メモリ拡張、取得内容と生成の融合を可能にするコアアーキテクチャを説明する。
メモリの更新、効率性、安全性など、知識拡張LLMの課題を強調する。

提案手法

Retrieval-augmented generation (RAG) と fusion-in-decoder (FiD) アーキテクチャを記述・比較する。
REALMスタイルのジョイントリトリーバー-ジェネレータ前学習とその学習ダイナミクスを説明する。
Atlas、RETRO、GRAFT-Net、PullNet を、構造化知識統合とマルチホップQAの例として詳述する。
ライブWeb情報を活用するアプローチとして検索エンジン拡張生成と SeeKeR を論じる。
構造化知識グラフのグラフベースおよびトリプルストア手法（GCN/Relational GCN）を要約する。

Figure 1: Overview of knowledge augmentation of language models from the paper by Izacard et al. [ 7 ] . The input query (light yellow), along with a number of retrieved relevant documents (light blue), passes through the generative seq2seq model to produce an output response.

実験結果

リサーチクエスチョン

RQ1外部知識でLLMを拡張する主なアーキテクチャ的パラダイムは何か？
RQ2取得と統合の仕組みは生成内容の正確性と根拠付けにどのような影響を与えるか？
RQ3パラメトリック知識保存とノンパラメトリック外部メモリのトレードオフは何か？
RQ4構造化知識グラフ、トリプルストア、テキストコーパラのマルチホップ推論とQAへの寄与は？
RQ5知識拡張LLMの更新性、スケーラビリティ、安全性、出所管理の課題は何か？

主な発見

非構造化テキストから構造化グラフまでの知識拡張戦略のスペクトラムを特定する。
取得強化モデルはパラメトリックメモリへの依存を減らし、根拠付けを改善できることを示す。
取得器とジェネレータを jointly train して取得品質とエンドタスク性能を向上させる方法を強調する。
RAGスタイル、FiD、FiD風の連結など、取得内容を組み込む際の異なる統合戦略の利点と制限を論じる。
検索エンジンなどの外部知識源を使用する際のスケーラビリティ、リアルタイム更新、安全性の重要性に言及する。

Figure 2: Overview of the Fusion-in-Decoder (FiD) [ 7 ] technique. The input question gets concatenated with each relevant passage and all concatenations get encoded in parallel. The embeddings that are produced are concatenated together (fusion) and are passed as input to the decoder.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。