QUICK REVIEW

[Paper Review] An Updated Duet Model for Passage Re-ranking

Bhaskar Mitra, Nick Craswell|arXiv (Cornell University)|Mar 18, 2019

Topic Modeling22 references34 citations

TL;DR

This paper presents Duet v2, an updated neural passage re-ranking model that integrates simple modifications (IDF-weighted interactions, word embeddings, ReLU activations, and an MLP fusion with bagging) and demonstrates improved MS MARCO performance through ablations.

ABSTRACT

We propose several small modifications to Duet---a deep neural ranking model---and evaluate the updated model on the MS MARCO passage ranking task. We report significant improvements from the proposed changes based on an ablation study.

Motivation & Objective

Motivate improvements to the Duet neural ranking model for MS MARCO passage ranking.
Propose simple architectural and input representation changes to enhance performance and training efficiency.
Quantify the impact of each modification via ablation studies and compare to state-of-the-art non-BERT baselines.

Proposed method

Replace character-level n-graph encoding with word embeddings in the distributed sub-model to speed up training.
Incorporate IDF weighting into the local interaction matrix to emphasize discriminative query terms.
Replace Tanh with ReLU activations across the model for faster training and potential performance gains.
Use a multi-layer perceptron to jointly fuse vector outputs from local and distributed sub-models (instead of a single scalar combination).
Apply bagging by training multiple Duet v2 models with different seeds and data samples to ensemble predictions.
Train with cross-entropy loss over triplets (q, p+, p−) using Adam optimizer and fixed hyperparameters; trim inputs; limit vocabulary; fixed hidden sizes.

Experimental results

Research questions

RQ1Does IDF weighting of the query-document interaction improve ranking performance on MS MARCO?
RQ2Do non-linear activations (ReLU) and an MLP-based fusion of sub-model outputs outperform the original Duet design?
RQ3Does bagging multiple Duet v2 models yield additional gains in MS MARCO passage ranking?
RQ4How does the updated Duet v2 compare to non-BERT baselines and to BERT-based approaches on MS MARCO?

Key findings

Duet v2 achieves MRR@10 of 0.243 on the dev set and 0.245 on the eval set.
Ensemble of eight Duet v2 models yields MRR@10 of 0.252 (dev) and 0.253 (eval).
An ablation removing IDF weighting degrades MRR by about 33%.
Replacing Tanh with ReLU caused about a 26% degradation in MRR when disabled.
Using a linear combination of local and distributed scores (instead of an MLP) degrades MRR by about 14%.
Bagging yields an additional ~3% improvement in MRR.
Duet v2 approaches comparable performance to other non-BERT top methods on MS MARCO and trains much faster (1.5 hours on a Tesla K40).

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.