Domain Representative Keywords Selection: A Probabilistic Approach

03/19/2022
by   Pritom Saha Akash, et al.
0

We propose a probabilistic approach to select a subset of a target domain representative keywords from a candidate set, contrasting with a context domain. Such a task is crucial for many downstream tasks in natural language processing. To contrast the target domain and the context domain, we adapt the two-component mixture model concept to generate a distribution of candidate keywords. It provides more importance to the distinctive keywords of the target domain than common keywords contrasting with the context domain. To support the representativeness of the selected keywords towards the target domain, we introduce an optimization algorithm for selecting the subset from the generated candidate distribution. We have shown that the optimization algorithm can be efficiently implemented with a near-optimal approximation guarantee. Finally, extensive experiments on multiple domains demonstrate the superiority of our approach over other baselines for the tasks of keyword summary generation and trending keywords selection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2022

Automatic Document Selection for Efficient Encoder Pretraining

Building pretrained language models is considered expensive and data-int...
research
08/09/2023

SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation

Generalisation of deep neural networks becomes vulnerable when distribut...
research
07/14/2023

Do not Mask Randomly: Effective Domain-adaptive Pre-training by Masking In-domain Keywords

We propose a novel task-agnostic in-domain pre-training method that sits...
research
03/11/2022

Supporting Schema References in Keyword Queries over Relational Databases

Relational Keyword Search (R-KwS) systems enable naive/informal users to...
research
10/23/2022

Unsupervised Non-transferable Text Classification

Training a good deep learning model requires substantial data and comput...
research
11/30/2022

Automated Generating Natural Language Requirements based on Domain Ontology

Software requirements specification is undoubtedly critical for the whol...
research
09/25/2017

"Let me convince you to buy my product ... ": A Case Study of an Automated Persuasive System for Fashion Products

Persuasivenes is a creative art aimed at making people believe in certai...

Please sign up or login with your details

Forgot password? Click here to reset