Learning Concept Embeddings for Efficient Bag-of-Concepts Densification

02/10/2017
by   Walid Shalaby, et al.
0

Explicit concept space models have proven efficacy for text representation in many natural language and text mining applications. The idea is to embed textual structures into a semantic space of concepts which captures the main topics of these structures. That so called bag-of-concepts representation suffers from data sparsity causing low similarity scores between similar texts due to low concept overlap. In this paper we propose two neural embedding models in order to learn continuous concept vectors. Once learned, we propose an efficient vector aggregation method to generate fully dense bag-of-concepts representations. Empirical results on a benchmark dataset for measuring entity semantic relatedness show superior performance over other concept embedding models. In addition, by utilizing our efficient aggregation method, we demonstrate the effectiveness of the densified vector representation over the typical sparse representations for dataless classification where we can achieve at least same or better accuracy with much less dimensions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2015

Measuring Semantic Relatedness using Mined Semantic Analysis

Mined Semantic Analysis (MSA) is a novel concept space model which emplo...
research
09/30/2017

Bag-of-Vector Embeddings of Dependency Graphs for Semantic Induction

Vector-space models, from word embeddings to neural network parsers, hav...
research
01/01/2018

Beyond Word Embeddings: Learning Entity and Concept Representations from Large Scale Knowledge Bases

Text representation using neural word embeddings has proven efficacy in ...
research
06/07/2020

Medical Concept Normalization in User Generated Texts by Learning Target Concept Embeddings

Medical concept normalization helps in discovering standard concepts in ...
research
05/02/2022

VICE: Variational Interpretable Concept Embeddings

A central goal in the cognitive sciences is the development of numerical...
research
03/07/2023

ELODIN: Naming Concepts in Embedding Spaces

Despite recent advancements, the field of text-to-image synthesis still ...
research
03/01/2023

Succinct Representations for Concepts

Foundation models like chatGPT have demonstrated remarkable performance ...

Please sign up or login with your details

Forgot password? Click here to reset