Skip to main content
QUICK REVIEW

[Paper Review] Cross-lingual Zero- and Few-shot Hate Speech Detection Utilising Frozen Transformer Language Models and AXEL

Lukas Stappen, Fabian Brunn|arXiv (Cornell University)|Apr 13, 2020
Hate Speech and Cyberbullying Detection49 references52 citations
TL;DR

The paper proposes a frozen-transformer feature-extraction approach with a novel AXEL attention-based classification block to tackle uni-, cross-lingual zero-/few-shot hate speech detection on HatEval English and Spanish data, achieving competitive results with far fewer trainable parameters.

ABSTRACT

Detecting hate speech, especially in low-resource languages, is a non-trivial challenge. To tackle this, we developed a tailored architecture based on frozen, pre-trained Transformers to examine cross-lingual zero-shot and few-shot learning, in addition to uni-lingual learning, on the HatEval challenge data set. With our novel attention-based classification block AXEL, we demonstrate highly competitive results on the English and Spanish subsets. We also re-sample the English subset, enabling additional, meaningful comparisons in the future.

Motivation & Objective

  • Motivate cross-lingual hate speech detection in low-resource languages using frozen transformer features.
  • Evaluate uni-lingual and cross-lingual (zero-/few-shot) learning on English and Spanish HatEval data.
  • Develop and assess a lightweight classification block (AXEL) to maximize information from frozen transformer features.
  • Propose a stratified English data partition to mitigate out-of-domain sampling and better assess generalization.

Proposed method

  • Use frozen Transformer Language Models (BERT base multilingual and XLM) as feature extractors rather than fine-tuning.
  • Extract representations from selected layers and feed them into trainable classification blocks.
  • Introduce AXEL, an attention-based block inspired by vision modules to compress and enhance sequential text features before classification.
  • Compare multiple classification blocks (including RCAB, CBAM, CSAR, RAM) and demonstrate AXEL’s superior performance.
  • Evaluate zero-shot and few-shot cross-lingual transfer by training on one language and testing on another, and by injecting small portions of target-language data.

Experimental results

Research questions

  • RQ1How effective are frozen Transformer features for uni-lingual hate speech detection compared to fine-tuned models?
  • RQ2Can AXEL improve classification performance when using frozen Transformer representations for hate speech detection?
  • RQ3What is the impact of cross-lingual zero-shot and few-shot learning on English and Spanish HatEval data?
  • RQ4Does stratified English data partitioning reduce out-of-domain sampling effects and improve generalization?
  • RQ5How do cross-lingual representations (XLM-based) compare to BERT-based features for cross-language hate speech detection?

Key findings

  • Frozen Transformer features with AXEL achieve competitive hate speech detection results with far fewer trainable parameters than fine-tuned models.
  • AXEL substantially outperforms other adapted blocks, with EN-S achieving 71.16 F1 and ES achieving 69.70 F1 in Table 4 results.
  • Cross-lingual zero-shot performance is generally weaker than monolingual, but AXEL remains the best among XLM-based options on most settings; zero-shot can improve with translation-augmented evaluation.
  • Few-shot learning shows substantial gains; adding as little as 1% target-language data dramatically boosts F1 and can even surpass monolingual performance in some setups (e.g., EN-S).
  • A new English data partition (EN-S) reduces out-of-domain sampling effects and yields more balanced, comparable performance across EN and ES subsets.
  • XLM-based models are strong in zero-shot settings when using a simple dense/AXEL classifier, while BERT-based sequential encoders benefit more from full-sequence encoding.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.