Machine Learning using Stata/Python

03/03/2021
by   Giovanni Cerulli, et al.
0

We present two related Stata modules, r_ml_stata and c_ml_stata, for fitting popular Machine Learning (ML) methods both in regression and classification settings. Using the recent Stata/Python integration platform (sfi) of Stata 16, these commands provide hyper-parameters' optimal tuning via K-fold cross-validation using greed search. More specifically, they make use of the Python Scikit-learn API to carry out both cross-validation and outcome/label prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2021

A New Approach to Multilabel Stratified Cross Validation with Application to Large and Sparse Gene Ontology Datasets

Multilabel learning is an important topic in machine learning research. ...
research
07/25/2022

Machine Learning to Predict the Antimicrobial Activity of Cold Atmospheric Plasma-Activated Liquids

Plasma is defined as the fourth state of matter and non-thermal plasma c...
research
05/09/2022

Methodology to Create Analysis-Naive Holdout Records as well as Train and Test Records for Machine Learning Analyses in Healthcare

It is common for researchers to holdout data from a study pool to be use...
research
05/15/2023

Transactional Python for Durable Machine Learning: Vision, Challenges, and Feasibility

In machine learning (ML), Python serves as a convenient abstraction for ...
research
05/27/2014

Futility Analysis in the Cross-Validation of Machine Learning Models

Many machine learning models have important structural tuning parameters...
research
05/28/2020

Machine Learning for recognition of minerals from multispectral data

Machine Learning (ML) has found several applications in spectroscopy, in...
research
06/19/2018

Using J-K fold Cross Validation to Reduce Variance When Tuning NLP Models

K-fold cross validation (CV) is a popular method for estimating the true...

Please sign up or login with your details

Forgot password? Click here to reset