A Survey of Text Representation Methods and Their Genealogy

11/26/2022
by   Philipp Siebers, et al.
0

In recent years, with the advent of highly scalable artificial-neural-network-based text representation methods the field of natural language processing has seen unprecedented growth and sophistication. It has become possible to distill complex linguistic information of text into multidimensional dense numeric vectors with the use of the distributional hypothesis. As a consequence, text representation methods have been evolving at such a quick pace that the research community is struggling to retain knowledge of the methods and their interrelations. We contribute threefold to this lack of compilation, composition, and systematization by providing a survey of current approaches, by arranging them in a genealogy, and by conceptualizing a taxonomy of text representation methods to examine and explain the state-of-the-art. Our research is a valuable guide and reference for artificial intelligence researchers and practitioners interested in natural language processing applications such as recommender systems, chatbots, and sentiment analysis.

READ FULL TEXT

page 1

page 5

page 6

page 7

page 13

page 16

page 17

page 25

research
02/03/2023

Witgenstein's influence on artificial intelligence

We examine how much of the contemporary progress in artificial intellige...
research
03/29/2017

Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation

This paper surveys the current state of the art in Natural Language Gene...
research
04/20/2021

Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models

Sentiment prediction remains a challenging and unresolved task in variou...
research
09/30/2022

A Decade of Knowledge Graphs in Natural Language Processing: A Survey

In pace with developments in the research field of artificial intelligen...
research
08/10/2023

Optical Script Identification for multi-lingual Indic-script

Script identification and text recognition are some of the major domains...
research
02/21/2015

Unified vector space mapping for knowledge representation systems

One of the most significant problems which inhibits further developments...
research
09/10/2021

Integrating Approaches to Word Representation

The problem of representing the atomic elements of language in modern ne...

Please sign up or login with your details

Forgot password? Click here to reset