
Multiclass nonAdversarial Image Synthesis, with Application to Classification from Very Small Sample
The generation of synthetic images is currently being dominated by Gener...
Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization
Domain generalization is the problem of machine learning when the traini...
An InformationTheoretic Framework for Nonlinear Canonical Correlation Analysis
Canonical Correlation Analysis (CCA) is a linear representation learning...
Learning to Prune: Speeding up Repeated Computations
It is common to encounter situations where one must solve a sequence of ...
Causal Feature Discovery through Strategic Modification
We consider an online regression setting in which individuals adapt to t...
On the Complexity of Minimizing Convex Finite Sums Without Using the Indices of the Individual Functions
Recent advances in randomized incremental methods for minimizing Lsmoot...
Why do deep convolutional networks generalize so poorly to small image transformations?
Deep convolutional network architectures are often assumed to guarantee ...
Equal Opportunity in Online Classification with Partial Feedback
We study an online classification problem with partial feedback in which...
On selfplay computation of equilibrium in poker
We compare performance of the genetic algorithm and the counterfactual r...
On GANs and GMMs
A longstanding problem in machine learning is to find unsupervised metho...
Learning Parities with Neural Networks
In recent years we see a rapidly growing line of research which shows le...
Ballpark Crowdsourcing: The Wisdom of Rough Group Comparisons
Crowdsourcing has become a popular method for collecting labeled trainin...
On evolutionary selection of blackjack strategies
We apply the approach of evolutionary programming to the problem of opti...
Benefits of Depth for LongTerm Memory of Recurrent Networks
The key attribute that drives the unprecedented success of modern Recurr...
SumProductQuotient Networks
We present a novel tractable generative model that extends SumProduct N...
Neuronlevel Selective Context Aggregation for Scene Segmentation
Contextual information provides important cues for disambiguating visual...
Gaussian Lower Bound for the Information Bottleneck Limit
The Information Bottleneck (IB) is a conceptual method for extracting th...
Analysis and Design of Convolutional Networks via Hierarchical Tensor Decompositions
The driving force behind convolutional networks  the most successful de...
Deep Learning and Quantum Entanglement: Fundamental Connections with Implications to Network Design
Deep convolutional networks have witnessed unprecedented success in vari...
Optimal Shrinkage of Singular Values Under Random Data Contamination
A low rank matrix X has been contaminated by uniformly distributed noise...
On the Expressive Power of Overlapping Architectures of Deep Learning
Expressive efficiency refers to the relation between two architectures A...
Tractable Generative Convolutional Arithmetic Circuits
Casting neural networks in generative frameworks is a highly soughtafte...
Accelerating Innovation Through Analogy Mining
The availability of large idea repositories (e.g., the U.S. patent datab...
Inductive Bias of Deep Convolutional Networks through Pooling Geometry
Our formal understanding of the inductive bias that drives the success o...
Convolutional Rectifier Networks as Generalized Tensor Decompositions
Convolutional rectifier networks, i.e. convolutional neural networks wit...
Linear Readout of Object Manifolds
Objects are represented in sensory systems by continuous manifolds due t...
On the Expressive Power of Deep Learning: A Tensor Analysis
It has long been conjectured that hypotheses spaces suitable for data th...
Deep SimNets
We present a deep layered architecture that generalizes convolutional ne...
Learning Data Manifolds with a Cutting Plane Method
We consider the problem of classifying data manifolds where each manifol...
An Axiomatic Approach to Routing
Information delivery in a network of agents is a key issue for large, co...
Strategyproof Peer Selection using Randomization, Partitioning, and Apportionment
Peer review, evaluation, and selection is a fundamental aspect of modern...
Covariance Plasticity and Regulated Criticality
We propose that a regulation mechanism based on Hebbian covariance plast...
InformationTheoretic Bounded Rationality
Bounded rationality, that is, decisionmaking and planning under resourc...
A Tight Convex Upper Bound on the Likelihood of a Finite Mixture
The likelihood function of a finite mixture model is a nonconvex functi...
Online Trajectory Segmentation and Summary With Applications to Visualization and Retrieval
Trajectory segmentation is the process of subdividing a trajectory into ...
Ballpark Learning: Estimating Labels from Rough Group Comparisons
We are interested in estimating individual labels given only coarse, agg...
Estimating mutual information in high dimensions via classification error
Multivariate pattern analyses approaches in neuroimaging are fundamental...
How many faces can be recognized? Performance extrapolation for multiclass classification
The difficulty of multiclass classification generally increases with th...
Memory shapes time perception and intertemporal choices
There is a consensus that human and nonhuman subjects experience tempor...
Optimized Linear Imputation
Often in realworld datasets, especially in high dimensional data, some ...
An Algorithm for Training Polynomial Networks
We consider deep neural networks, in which the output of each node is a ...
LowRank Matrix Recovery from RowandColumn Affine Measurements
We propose and study a rowandcolumn affine measurement scheme for low...
Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information
We consider the task of predicting various traits of a person given an i...
Learning Finegrained Features via a CNN Tree for Largescale Classification
We propose a novel approach to enhance the discriminability of Convoluti...
Efficient coordinatedescent for orthogonal matrices through Givens rotations
Optimizing over the set of orthogonal matrices is a central component in...
A Quantitative Version of the GibbardSatterthwaite Theorem for Three Alternatives
The GibbardSatterthwaite theorem states that every nondictatorial elec...
Typical models: minimizing false beliefs
A knowledge system S describing a part of real world does in general not...
General Deformations of Point Configurations Viewed By a Pinhole Model Camera
This paper is a theoretical study of the following NonRigid Structure f...
Marginal Likelihoods for Distributed Parameter Estimation of Gaussian Graphical Models
We consider distributed estimation of the inverse covariance matrix, als...
Learning Sparse LowThreshold Linear Classifiers
We consider the problem of learning a nonnegative linear classifier wit...
