QUICK REVIEW
[Paper Review] An Updated Duet Model for Passage Re-ranking
Bhaskar Mitra, Nick Craswell|arXiv (Cornell University)|Mar 18, 2019
Topic Modeling22 references34 citations
TL;DR
This paper presents Duet v2, an updated neural passage re-ranking model that integrates simple modifications (IDF-weighted interactions, word embeddings, ReLU activations, and an MLP fusion with bagging) and demonstrates improved MS MARCO performance through ablations.
ABSTRACT
We propose several small modifications to Duet---a deep neural ranking model---and evaluate the updated model on the MS MARCO passage ranking task. We report significant improvements from the proposed changes based on an ablation study.
Motivation & Objective
- Motivate improvements to the Duet neural ranking model for MS MARCO passage ranking.
- Propose simple architectural and input representation changes to enhance performance and training efficiency.
- Quantify the impact of each modification via ablation studies and compare to state-of-the-art non-BERT baselines.
Proposed method
- Replace character-level n-graph encoding with word embeddings in the distributed sub-model to speed up training.
- Incorporate IDF weighting into the local interaction matrix to emphasize discriminative query terms.
- Replace Tanh with ReLU activations across the model for faster training and potential performance gains.
- Use a multi-layer perceptron to jointly fuse vector outputs from local and distributed sub-models (instead of a single scalar combination).
- Apply bagging by training multiple Duet v2 models with different seeds and data samples to ensemble predictions.
- Train with cross-entropy loss over triplets (q, p+, p−) using Adam optimizer and fixed hyperparameters; trim inputs; limit vocabulary; fixed hidden sizes.
Experimental results
Research questions
- RQ1Does IDF weighting of the query-document interaction improve ranking performance on MS MARCO?
- RQ2Do non-linear activations (ReLU) and an MLP-based fusion of sub-model outputs outperform the original Duet design?
- RQ3Does bagging multiple Duet v2 models yield additional gains in MS MARCO passage ranking?
- RQ4How does the updated Duet v2 compare to non-BERT baselines and to BERT-based approaches on MS MARCO?
Key findings
- Duet v2 achieves MRR@10 of 0.243 on the dev set and 0.245 on the eval set.
- Ensemble of eight Duet v2 models yields MRR@10 of 0.252 (dev) and 0.253 (eval).
- An ablation removing IDF weighting degrades MRR by about 33%.
- Replacing Tanh with ReLU caused about a 26% degradation in MRR when disabled.
- Using a linear combination of local and distributed scores (instead of an MLP) degrades MRR by about 14%.
- Bagging yields an additional ~3% improvement in MRR.
- Duet v2 approaches comparable performance to other non-BERT top methods on MS MARCO and trains much faster (1.5 hours on a Tesla K40).
Better researchstarts right now
From paper design to paper writing, dramatically reduce your research time.
No credit card · Free plan available
This review was created by AI and reviewed by human editors.