Introducing Variational Inference in Statistics and Data Science Curriculum

01/03/2023
by   Vojtech Kejzlar, et al.
0

Probabilistic models such as logistic regression, Bayesian classification, neural networks, and models for natural language processing, are increasingly more present in both undergraduate and graduate statistics and data science curricula due to their wide range of applications. In this paper, we present a one-week course module for studnets in advanced undergraduate and applied graduate courses on variational inference, a popular optimization-based approach for approximate inference with probabilistic models. Our proposed module is guided by active learning principles: In addition to lecture materials on variational inference, we provide an accompanying class activity, an app, and guided labs based on real data applications of logistic regression and clustering documents using Latent Dirichlet Allocation with code. The main goal of our module is to expose students to a method that facilitates statistical modeling and inference with large datasets. Using our proposed module as a foundation, instructors can adopt and adapt it to introduce more realistic case studies and applications in data science, Bayesian statistics, multivariate analysis, and statistical machine learning courses.

READ FULL TEXT
research
09/19/2012

Variational Inference in Nonconjugate Models

Mean-field variational methods are widely used for approximate posterior...
research
01/12/2023

Open Case Studies: Statistics and Data Science Education through Real-World Applications

With unprecedented and growing interest in data science education, there...
research
08/01/2020

A fresh look at introductory data science

The proliferation of vast quantities of available datasets that are larg...
research
10/30/2022

Changes from Classical Statistics to Modern Statistics and Data Science

A coordinate system is a foundation for every quantitative science, engi...
research
03/17/2022

Kan Extensions in Data Science and Machine Learning

A common problem in data science is "use this function defined over this...
research
10/25/2020

Statistical optimality and stability of tangent transform algorithms in logit models

A systematic approach to finding variational approximation in an otherwi...
research
01/23/2019

Three principles of data science: predictability, computability, and stability (PCS)

We propose the predictability, computability, and stability (PCS) framew...

Please sign up or login with your details

Forgot password? Click here to reset