QUICK REVIEW

[論文レビュー] Tracking the World State with Recurrent Entity Networks

Mikael Henaff, Jason Weston|arXiv (Cornell University)|Dec 12, 2016

Topic Modeling被引用数 157

ひとこと要約

The paper introduces the Recurrent Entity Network (EntNet), a memory-augmented model with parallel dynamic memory slots for tracking world state, achieving state-of-the-art results on the bAbI tasks and strong performance on CBT with single-pass reading.

ABSTRACT

We introduce a new model, the Recurrent Entity Network (EntNet). It is equipped with a dynamic long-term memory which allows it to maintain and update a representation of the state of the world as it receives new data. For language understanding tasks, it can reason on-the-fly as it reads text, not just when it is required to answer a question or respond as is the case for a Memory Network (Sukhbaatar et al., 2015). Like a Neural Turing Machine or Differentiable Neural Computer (Graves et al., 2014; 2016) it maintains a fixed size memory and can learn to perform location and content-based read and write operations. However, unlike those models it has a simple parallel architecture in which several memory locations can be updated simultaneously. The EntNet sets a new state-of-the-art on the bAbI tasks, and is the first method to solve all the tasks in the 10k training examples setting. We also demonstrate that it can solve a reasoning task which requires a large number of supporting facts, which other methods are not able to solve, and can generalize past its training horizon. It can also be practically used on large scale datasets such as Children's Book Test, where it obtains competitive performance, reading the story in a single pass.

研究の動機と目的

動的な世界状態表現を処理する際に保持する必要性を動機づける。
エンティティ固有の表現を更新する並列でゲート付きメモリスロットを備えたメモリ拡張ニューラルネットワークを提案する。
EntNet がすべての bAbI タスクを解き、訓練の枠を超えた長いシーケンスにも一般化することを示す。
単一パスの読み取りで Children’s Book Test (CBT) で競争力のある結果を示す。

提案手法

固定数のメモリスロットを備え、それぞれにキー w_j と内容 h_j を持ち、入力条件付きのゲーティング機構によって更新されるEntNetを提案する。
概念-エンティティのダイナミクスをモデル化するため、共有パラメータを用いた並列のゲート付きRNN（メモリブロック）を使用する。
内容ベースおよび位置ベースのゲーティング関数 g_j = sigmoid(s_t^T h_j + s_t^T w_j) を定義し、スロットごとの更新を決定する。
学習可能なマスクと総和により入力トークンを固定長ベクトル s_t に集約する入力エンコーダを提供する。
Memory Network のワンホップに類似した出力モジュールを実装し、 memories を q^T h_j でウェイト付けして組み合わせ、解答を予測する。
全体を時間展開誤差逆伝播法で訓練し、出力を要する時間ステップから勾配を伝搬させる。

実験結果

リサーチクエスチョン

RQ1固定サイズの並列メモリ拡張ネットワークが、逐次的テキストを処理する際に内部世界モデルを維持・更新できるか。
RQ2EntNet はより長い推論シーケンスへ拡張でき、訓練の枠を超えた一般化を示すか。
RQ3EntNet は標準的推論ベンチマーク（bAbI）と現実世界に近いデータ（CBT）で、 prior memory アーキテクチャと比較してどうか。

主な発見

Task	NTM	D-NTM	MemN2N	DNC	DMN+	EntNet
1: 1 supporting fact	31.5	4.4	0	0	0	0
2: 2 supporting facts	54.5	27.5	0.3	0.4	0.3	0.1
3: 3 supporting facts	43.9	71.3	2.1	1.8	1.1	4.1
4: 2 argument relations	0	0	0	0	0	0
5: 3 argument relations	0.8	1.7	0.8	0.8	0.5	0.3
6: yes/no questions	17.1	1.5	0.1	0	0	0.2
7: counting	17.8	6.0	2.0	0.6	2.4	0
8: lists/sets	13.8	1.7	0.9	0.3	0.0	0.5
9: simple negation	16.4	0.6	0.3	0.2	0.0	0.1
10: indefinite knowledge	16.6	19.8	0	0.2	0	0.6
11: basic coreference	15.2	0	0.0	0	0.0	0.3
12: conjunction	8.9	6.2	0	0	0.2	0
13: compound coreference	7.4	7.5	0	0	0	1.3
14: time reasoning	24.2	17.5	0.2	0.4	0.2	0
15: basic deduction	47.0	0	0	0	0	0
16: basic induction	53.6	49.6	51.8	55.1	45.3	0.2
17: positional reasoning	25.5	1.2	18.6	12.0	4.2	0.5
18: size reasoning	2.2	0.2	5.3	0.8	2.1	0.3
19: path finding	4.3	39.5	2.3	3.9	0.0	2.3
20: agent’s motivation	1.5	0	0	0	0	0

EntNet は 10k の訓練サンプルで全20タスクの bAbI を解き、最先端を達成した。
モデルは訓練中に見られたシーケンスより長い長さへ一般化し、世界のダイナミクスを学習していることを示す。
合成 World Model タスクでは、EntNet はシーケンス長が成長するにつれて MemN2N および LSTM を上回り、訓練 horizon を超えて一般化する。
EntNet は CBT で競争力のある結果を達成し、単一パスモデルの中で Named Entities および Common Nouns タスクで最も良い性能を示す簡易変種が最良。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。