Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach

05/27/2021
by   Jie Huang, et al.
12

We propose to measure fine-grained domain relevance - the degree that a term is relevant to a broad (e.g., computer science) or narrow (e.g., deep learning) domain. Such measurement is crucial for many downstream tasks in natural language processing. To handle long-tail terms, we build a core-anchored semantic graph, which uses core terms with rich description information to bridge the vast remaining fringe terms semantically. To support a fine-grained domain without relying on a matching corpus for supervision, we develop hierarchical core-fringe learning, which learns core and fringe terms jointly in a semi-supervised manner contextualized in the hierarchy of the domain. To reduce expensive human efforts, we employ automatic annotation and hierarchical positive-unlabeled learning. Our approach applies to big or small domains, covers head or tail terms, and requires little human effort. Extensive experiments demonstrate that our methods outperform strong baselines and even surpass professional human performance.

READ FULL TEXT
research
03/20/2023

Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning

Deep learning in general domains has constantly been extended to domain-...
research
11/23/2021

Semi-Supervised Learning with Taxonomic Labels

We propose techniques to incorporate coarse taxonomic labels to train im...
research
07/24/2022

Explored An Effective Methodology for Fine-Grained Snake Recognition

Fine-Grained Visual Classification (FGVC) is a longstanding and fundamen...
research
03/09/2023

Replacement as a Self-supervision for Fine-grained Vision-language Pre-training

Fine-grained supervision based on object annotations has been widely use...
research
03/10/2022

StyleBabel: Artistic Style Tagging and Captioning

We present StyleBabel, a unique open access dataset of natural language ...
research
03/14/2022

Hierarchical Memory Learning for Fine-Grained Scene Graph Generation

As far as Scene Graph Generation (SGG), coarse and fine predicates mix i...
research
11/27/2019

Large-Scale Noun Compound Interpretation Using Bootstrapping and the Web as a Corpus

Responding to the need for semantic lexical resources in natural languag...

Please sign up or login with your details

Forgot password? Click here to reset