Medical Question Summarization with Entity-driven Contrastive Learning

04/15/2023
by   Sibo Wei, et al.
0

By summarizing longer consumer health questions into shorter and essential ones, medical question answering (MQA) systems can more accurately understand consumer intentions and retrieve suitable answers. However, medical question summarization is very challenging due to obvious distinctions in health trouble descriptions from patients and doctors. Although existing works have attempted to utilize Seq2Seq, reinforcement learning, or contrastive learning to solve the problem, two challenges remain: how to correctly capture question focus to model its semantic intention, and how to obtain reliable datasets to fairly evaluate performance. To address these challenges, this paper proposes a novel medical question summarization framework using entity-driven contrastive learning (ECL). ECL employs medical entities in frequently asked questions (FAQs) as focuses and devises an effective mechanism to generate hard negative samples. This approach forces models to pay attention to the crucial focus information and generate more ideal question summarization. Additionally, we find that some MQA datasets suffer from serious data leakage problems, such as the iCliniq dataset's 33 fairly, this paper carefully checks leaked samples to reorganize more reasonable datasets. Extensive experiments demonstrate that our ECL method outperforms state-of-the-art methods by accurately capturing question focus and generating medical question summaries. The code and datasets are available at https://github.com/yrbobo/MQS-ECL.

READ FULL TEXT
research
09/01/2022

Focus-Driven Contrastive Learniang for Medical Question Summarization

Automatic medical question summarization can significantly help the syst...
research
06/01/2021

Question-aware Transformer Models for Consumer Health Question Summarization

Searching for health information online is becoming customary for more a...
research
05/18/2020

Question-Driven Summarization of Answers to Consumer Health Questions

Automatic summarization of natural language is a widely studied area in ...
research
07/01/2021

Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards

The growth of online consumer health questions has led to the necessity ...
research
06/14/2022

CHQ-Summ: A Dataset for Consumer Healthcare Question Summarization

The quest for seeking health information has swamped the web with consum...
research
06/15/2023

Learning by Analogy: Diverse Questions Generation in Math Word Problem

Solving math word problem (MWP) with AI techniques has recently made gre...
research
11/19/2021

DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions

Automated question quality rating (AQQR) aims to evaluate question quali...

Please sign up or login with your details

Forgot password? Click here to reset