A Geometric Method to Obtain the Generation Probability of a Sentence

by   Chen Lijiang, et al.

"How to generate a sentence" is the most critical and difficult problem in all the natural language processing technologies. In this paper, we present a new approach to explain the generation process of a sentence from the perspective of mathematics. Our method is based on the premise that in our brain a sentence is a part of a word network which is formed by many word nodes. Experiments show that the probability of the entire sentence can be obtained by the probabilities of single words and the probabilities of the co-occurrence of word pairs, which indicate that human use the synthesis method to generate a sentence.



page 1

page 2

page 3

page 4


Learning a Word-Level Language Model with Sentence-Level Noise Contrastive Estimation for Contextual Sentence Probability Estimation

Inferring the probability distribution of sentences or word sequences is...

The role of grammar in transition-probabilities of subsequent words in English text

Sentence formation is a highly structured, history-dependent, and sample...

From Algebraic Word Problem to Program: A Formalized Approach

In this paper, we propose a pipeline to convert grade school level algeb...

Expect the unexpected: Harnessing Sentence Completion for Sarcasm Detection

The trigram `I love being' is expected to be followed by positive words ...

Evaluating Sentence Segmentation and Word Tokenization Systems on Estonian Web Texts

Texts obtained from web are noisy and do not necessarily follow the orth...

Sentence level estimation of psycholinguistic norms using joint multidimensional annotations

Psycholinguistic normatives represent various affective and mental const...

Local word statistics affect reading times independently of surprisal

Surprisal theory has provided a unifying framework for understanding man...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.