Simulation studies on Python using sstudy package with SQL databases as storage

04/27/2020
by   Marco A H Inácio, et al.
0

Performance assessment is a key issue in the process of proposing new machine learning/statistical estimators. A possible method to complete such task is by using simulation studies, which can be defined as the procedure of estimating and comparing properties (such as predictive power) of estimators (and other statistics) by averaging over many replications given a true distribution; i.e.: generating a dataset, fitting the estimator, calculating and storing the predictive power, and then repeating the procedure many times and finally averaging over the stored predictive powers. Given that, in this paper, we present sstudy: a Python package designed to simplify the preparation of simulation studies using SQL database engines as the storage system; more specifically, we present its basic features, usage examples and references to the its documentation. We also present a short statistical description of the simulation study procedure with a simplified explanation of what is being estimated by it, as well as some examples of applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/08/2017

Using simulation studies to evaluate statistical methods

Simulation studies are computer experiments which involve creating data ...
research
09/09/2019

INTEREST: INteractive Tool for Exploring REsults from Simulation sTudies

Simulation studies allow us to explore the properties of statistical met...
research
01/05/2023

TextDescriptives: A Python package for calculating a large variety of statistics from text

TextDescriptives is a Python package for calculating a large variety of ...
research
07/02/2020

A New ECDF Two-Sample Test Statistic

Empirical cumulative distribution functions (ECDFs) have been used to te...
research
07/08/2019

A Versatile Estimation Procedure without Estimating the Nonignorable Missingness Mechanism

We consider the estimation problem in a regression setting where the out...
research
07/05/2023

Replicability of Simulation Studies for the Investigation of Statistical Methods: The RepliSims Project

Results of simulation studies evaluating the performance of statistical ...
research
10/23/2017

bridgesampling: An R Package for Estimating Normalizing Constants

Statistical procedures such as Bayes factor model selection and Bayesian...

Please sign up or login with your details

Forgot password? Click here to reset