
Feature Importance in Gradient Boosting Trees with CrossValidation Feature Selection
Gradient Boosting Machines (GBM) are among the goto algorithms on tabul...
read it

Neural Joint Entropy Estimation
Estimating the entropy of a discrete random variable is a fundamental pr...
read it

Innovation Representation of Stochastic Processes with Application to Causal Inference
Typically, realworld stochastic processes are not easy to analyze. In t...
read it

An InformationTheoretic Framework for Nonlinear Canonical Correlation Analysis
Canonical Correlation Analysis (CCA) is a linear representation learning...
read it

Lossless (and Lossy) Compression of Random Forests
Ensemble methods are among the stateoftheart predictive modeling appr...
read it

Bregman Divergence Bounds and the Universality of the Logarithmic Loss
A loss function measures the discrepancy between the true values and the...
read it

Linear Independent Component Analysis over Finite Fields: Algorithms and Bounds
Independent Component Analysis (ICA) is a statistical tool that decompos...
read it

MSc Dissertation: Exclusive Row Biclustering for Gene Expression Using a Combinatorial Auction Approach
The availability of large microarray data has led to a growing interest ...
read it

Generalized Independent Components Analysis Over Finite Alphabets
Independent component analysis (ICA) is a statistical method for transfo...
read it

PhD Dissertation: Generalized Independent Components Analysis Over Finite Alphabets
Independent component analysis (ICA) is a statistical method for transfo...
read it

Outperforming GoodTuring: Preliminary Report
Estimating a large alphabet probability distribution from a limited numb...
read it

On the Universality of the Logistic Loss Function
A loss function measures the discrepancy between the true values (observ...
read it

Optimal Procedures for Multiple Testing Problems
Multiple testing problems are a staple of modern statistical analysis. T...
read it

Gaussian Lower Bound for the Information Bottleneck Limit
The Information Bottleneck (IB) is a conceptual method for extracting th...
read it

CrossValidated Variable Selection in TreeBased Methods Improves Predictive Performance
Recursive partitioning approaches producing treelike models are a long ...
read it
Amichai Painsky
is this you? claim profile