A Statistical Exploration of Text Partition Into Constituents: The Case of the Priestly Source in the Books of Genesis and Exodus

05/03/2023
by   Gideon Yoffe, et al.
0

We present a pipeline for a statistical textual exploration, offering a stylometry-based explanation and statistical validation of a hypothesized partition of a text. Given a parameterization of the text, our pipeline: (1) detects literary features yielding the optimal overlap between the hypothesized and unsupervised partitions, (2) performs a hypothesis-testing analysis to quantify the statistical significance of the optimal overlap, while conserving implicit correlations between units of text that are more likely to be grouped, and (3) extracts and quantifies the importance of features most responsible for the classification, estimates their statistical stability and cluster-wise abundance. We apply our pipeline to the first two books in the Bible, where one stylistic component stands out in the eyes of biblical scholars, namely, the Priestly component. We identify and explore statistically significant stylistic differences between the Priestly and non-Priestly components.

READ FULL TEXT

page 7

page 17

page 18

page 19

page 22

research
03/08/2020

ASAP-SML: An Antibody Sequence Analysis Pipeline Using Statistical Testing and Machine Learning

Antibodies are capable of potently and specifically binding individual a...
research
09/22/2022

Characterizing Uncertainty in the Visual Text Analysis Pipeline

Current visual text analysis approaches rely on sophisticated processing...
research
09/05/2023

Superclustering by finding statistically significant separable groups of optimal gaussian clusters

The paper presents the algorithm for clustering a dataset by grouping th...
research
04/09/2020

Two halves of a meaningful text are statistically different

Which statistical features distinguish a meaningful text (possibly writt...
research
09/16/2019

Distance Assessment and Hypothesis Testing of High-Dimensional Samples using Variational Autoencoders

Given two distinct datasets, an important question is if they have arise...
research
10/23/2017

Amorphous Dynamic Partial Reconfiguration with Flexible Boundaries to Remove Fragmentation

Dynamic partial reconfiguration (DPR) allows one region of an field-prog...

Please sign up or login with your details

Forgot password? Click here to reset