Skip to main content
QUICK REVIEW

[論文レビュー] Prompt-Learning for Fine-Grained Entity Typing

Ning Ding, Yulin Chen|arXiv (Cornell University)|Aug 24, 2021
Topic Modeling被引用数 44
ひとこと要約

この論文は、プロンプト学習を用いて完全監視、少数-shot、ゼロ-shotの設定で粒度の高い固有表現の型付けを行う方法を研究し、データが限られている場合に vanilla fine-tuning より改善を示す。

ABSTRACT

As an effective approach to tune pre-trained language models (PLMs) for specific tasks, prompt-learning has recently attracted much attention from researchers. By using \textit{cloze}-style language prompts to stimulate the versatile knowledge of PLMs, prompt-learning can achieve promising results on a series of NLP tasks, such as natural language inference, sentiment classification, and knowledge probing. In this work, we investigate the application of prompt-learning on fine-grained entity typing in fully supervised, few-shot and zero-shot scenarios. We first develop a simple and effective prompt-learning pipeline by constructing entity-oriented verbalizers and templates and conducting masked language modeling. Further, to tackle the zero-shot regime, we propose a self-supervised strategy that carries out distribution-level optimization in prompt-learning to automatically summarize the information of entity types. Extensive experiments on three fine-grained entity typing benchmarks (with up to 86 classes) under fully supervised, few-shot and zero-shot settings show that prompt-learning methods significantly outperform fine-tuning baselines, especially when the training data is insufficient.

研究の動機と目的

  • Motivate the use of prompt-learning to stimulate PLMs for fine-grained entity typing.
  • Develop a simple prompt-learning pipeline with entity-oriented labels and templates.
  • Investigate both supervised and self-supervised prompt-learning to handle data-scarce regimes.
  • Evaluate performance on three benchmarks under fully supervised, few-shot, and zero-shot settings.

提案手法

  • Formulate entity typing as a cloze-style task using prompt templates and label words.
  • Construct entity-oriented label word sets V* and compute P(y|x) as an average over P([MASK] = w|T(x)) for w in V_y.
  • Explore hard-encoding T1–T3 and soft-encoding T4 prompts to instantiate prompts for PLMs.
  • Train M with cross-entropy loss L = -log P(y|x; θ, φ) and optimize prompt parameters φ jointly with M.
  • Propose a self-supervised prompt-learning method for zero-shot typing by contrasting distributions over V* using positive and negative sentence pairs.
  • Use Jensen-Shannon divergence to measure distribution similarity and optimize with a contrastive-like objective.
  • Employ a dataset of about 1M positive and negative pairs for self-supervised learning from an entity-linked corpus.

実験結果

リサーチクエスチョン

  • RQ1Can prompt-learning outperform vanilla fine-tuning for fine-grained entity typing in fully supervised regimes?
  • RQ2How do hard and soft prompt encodings compare in effectiveness across datasets?
  • RQ3Does prompt-learning provide advantages in few-shot settings compared to traditional fine-tuning?
  • RQ4Is zero-shot entity typing feasible with self-supervised prompt-learning using unlabeled data?
  • RQ5What are the characteristics and limitations of prompt-learning for entity attribute detection on hierarchical type sets?

主な発見

データセットAcc (FT)Acc (Plet H)Acc (Plet S)MiF (FT)MiF (Plet H)MiF (Plet S)MaF (FT)MaF (Plet H)MaF (Plet S)
Few-NERD79.7579.9079.8685.7485.8485.7685.7485.8485.76
OntoNotes59.7160.3765.6870.4770.7874.5376.5776.4279.77
BBN62.3965.9263.1168.8871.5568.6867.3770.8267.81
  • Prompt-based methods outperform vanilla fine-tuning on the three benchmarks in many settings, with varying gains depending on data availability.
  • Hard-encoding templates (T3) and soft-encoding templates (T4) both improve results over fine-tuning, with dataset-dependent preferences.
  • In fully supervised settings, Plet (hard) and Plet (soft) yield higher accuracies and macro/micro F1 scores across Few-NERD, OntoNotes, and BBN.
  • Zero-shot and few-shot scenarios benefit notably from prompt-learning, particularly OntoNotes and Few-NERD where gains are substantial.
  • Self-supervised prompt-learning enables improvement in zero-shot typing by aligning distributions over label words for similar entities without explicit labels.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。