Multistage BiCross Encoder: Team GATE Entry for MLIA Multilingual Semantic Search Task 2

01/08/2021
by   Iknoor Singh, et al.
0

The Coronavirus (COVID-19) pandemic has led to a rapidly growing `infodemic' online. Thus, the accurate retrieval of reliable relevant data from millions of documents about COVID-19 has become urgently needed for the general public as well as for other stakeholders. The COVID-19 Multilingual Information Access (MLIA) initiative is a joint effort to ameliorate exchange of COVID-19 related information by developing applications and services through research and community participation. In this work, we present a search system called Multistage BiCross Encoder, developed by team GATE for the MLIA task 2 Multilingual Semantic Search. Multistage BiCross-Encoder is a sequential three stage pipeline which uses the Okapi BM25 algorithm and a transformer based bi-encoder and cross-encoder to effectively rank the documents with respect to the query. The results of round 1 show that our models achieve state-of-the-art performance for all ranking metrics for both monolingual and bilingual runs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2021

Looking for COVID-19 misinformation in multilingual social media texts

This paper presents the Multilingual COVID-19 Analysis Method (CMTA) for...
research
04/09/2021

The Burden of Being a Bridge: Understanding the Role of Multilingual Users during the COVID-19 Pandemic

The outbreak of the COVID-19 pandemic triggers infodemic over online soc...
research
07/09/2019

Multilingual Universal Sentence Encoder for Semantic Retrieval

We introduce two pre-trained retrieval focused multilingual sentence enc...
research
10/12/2020

SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search

With worldwide concerns surrounding the Severe Acute Respiratory Syndrom...
research
06/17/2020

CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization

The COVID-19 global pandemic has resulted in international efforts to un...
research
07/14/2020

Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset

We present Covidex, a search engine that exploits the latest neural rank...

Please sign up or login with your details

Forgot password? Click here to reset