
-
Can Shallow Neural Networks Beat the Curse of Dimensionality? A mean field training perspective
We prove that the gradient descent training of a two-layer neural networ...
read it
-
A Mean-field Analysis of Deep ResNet and Beyond: Towards Provable Optimization Via Overparameterization From Depth
Training deep neural networks with stochastic gradient descent (SGD) can...
read it
-
Machine Intelligence at the Edge with Learning Centric Power Allocation
While machine-type communication (MTC) devices generate considerable amo...
read it
-
A Theoretical Analysis of Contrastive Unsupervised Representation Learning
Recent empirical works have successfully used unlabeled data to learn fe...
read it
-
Knowledge Guided Text Retrieval and Reading for Open Domain Question Answering
This paper presents a general approach for open-domain question answerin...
read it
-
OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
Single-view 3D is the task of recovering 3D properties such as depth and...
read it
-
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
We introduce DeepInversion, a new method for synthesizing images from th...
read it
-
SketchGraphs: A Large-Scale Dataset for Modeling Relational Geometry in Computer-Aided Design
Parametric computer-aided design (CAD) is the dominant paradigm in mecha...
read it
-
A Comparative Analysis of the Optimization and Generalization Property of Two-layer Neural Network and Random Feature Models Under Gradient Descent Dynamics
A fairly comprehensive analysis is presented for the gradient descent dy...
read it
-
Kolmogorov Width Decay and Poor Approximators in Machine Learning: Shallow Neural Networks, Random Feature Models and Neural Tangent Kernels
We establish a scale separation of Kolmogorov width type between subspac...
read it
-
Theoretical Analysis of Auto Rate-Tuning by Batch Normalization
Batch Normalization (BN) has become a cornerstone of deep learning acros...
read it
-
DeepV2D: Video to Depth with Differentiable Structure from Motion
We propose DeepV2D, an end-to-end differentiable deep learning architect...
read it
-
Better the Devil you Know: An Analysis of Evasion Attacks using Out-of-Distribution Adversarial Examples
A large body of recent work has investigated the phenomenon of evasion a...
read it
-
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks
Recent works have cast some light on the mystery of why deep nets fit an...
read it
-
Sequential Gaussian Processes for Online Learning of Nonstationary Functions
Many machine learning problems can be framed in the context of estimatin...
read it
-
Second Order Optimization Made Practical
Optimization in machine learning, both theoretical and applied, is prese...
read it
-
Analysis of the Gradient Descent Algorithm for a Deep Neural Network Model with Skip-connections
The behavior of the gradient descent (GD) algorithm is analyzed for a de...
read it
-
The Nonstochastic Control Problem
We consider the problem of controlling an unknown linear dynamical syste...
read it
-
Tracking and Improving Information in the Service of Fairness
As algorithmic prediction systems have become widespread, fears that the...
read it
-
Physically Realizable Adversarial Examples for LiDAR Object Detection
Modern autonomous driving systems rely heavily on deep learning models t...
read it
-
Blockchain Assisted Decentralized Federated Learning (BLADE-FL): Performance Analysis and Resource Allocation
Federated learning (FL), as a distributed machine learning paradigm, pro...
read it
-
Machine Learning for Mechanical Ventilation Control
We consider the problem of controlling an invasive mechanical ventilator...
read it
-
Poisoning Attacks with Generative Adversarial Nets
Machine learning algorithms are vulnerable to poisoning attacks: An adve...
read it
-
HaarPooling: Graph Pooling with Compressive Haar Basis
Deep Graph Neural Networks (GNNs) are instrumental in graph classificati...
read it
-
DiabDeep: Pervasive Diabetes Diagnosis based on Wearable Medical Sensors and Efficient Neural Networks
Diabetes impacts the quality of life of millions of people. However, dia...
read it
-
Nonparametric Deconvolution Models
We describe nonparametric deconvolution models (NDMs), a family of Bayes...
read it
-
RDP-GAN: A Rényi-Differential Privacy based Generative Adversarial Network
Generative adversarial network (GAN) has attracted increasing attention ...
read it
-
A Mathematical Model for Linguistic Universals
Inspired by chemical kinetics and neurobiology, we propose a mathematica...
read it
-
Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity
In this paper, we settle the sampling complexity of solving discounted t...
read it
-
Machine learning based non-Newtonian fluid model with molecular fidelity
We introduce a machine-learning-based framework for constructing continu...
read it
-
Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation
The ability to perform effective planning is crucial for building an ins...
read it
-
D3D: Distilled 3D Networks for Video Action Recognition
State-of-the-art methods for video action recognition commonly use an en...
read it
-
Annealing for Distributed Global Optimization
The paper proves convergence to global optima for a class of distributed...
read it
-
Bio-Inspired Hashing for Unsupervised Similarity Search
The fruit fly Drosophila's olfactory circuit has inspired a new locality...
read it
-
Path Integral Based Convolution and Pooling for Graph Neural Networks
Graph neural networks (GNNs) extends the functionality of traditional ne...
read it
-
Neural Dynamics Discovery via Gaussian Process Recurrent Neural Networks
Latent dynamics discovery is challenging in extracting complex dynamics ...
read it
-
Towards Fairer Datasets: Filtering and Balancing the Distribution of the People Subtree in the ImageNet Hierarchy
Computer vision technology is being used by many but remains representat...
read it
-
LagNetViP: A Lagrangian Neural Network for Video Prediction
The dominant paradigms for video prediction rely on opaque transition mo...
read it
-
A Novel Multi-Stage Training Approach for Human Activity Recognition from Multimodal Wearable Sensor Data Using Deep Neural Network
Deep neural network is an effective choice to automatically recognize hu...
read it
-
Online Control with Adversarial Disturbances
We study the control of a linear dynamical system with adversarial distu...
read it
-
On the Utility of Learning about Humans for Human-AI Coordination
While we would like agents that can coordinate with humans, current algo...
read it
-
SUMO: Unbiased Estimation of Log Marginal Probability for Latent Variable Models
Standard variational lower bounds used to train latent variable models p...
read it
-
Distributed Learning: Sequential Decision Making in Resource-Constrained Environments
We study cost-effective communication strategies that can be used to imp...
read it
-
Inherent Noise in Gradient Based Methods
Previous work has examined the ability of larger capacity neural network...
read it
-
Modeling the Gaia Color-Magnitude Diagram with Bayesian Neural Flows to Constrain Distance Estimates
We demonstrate an algorithm for learning a flexible color-magnitude diag...
read it
-
The Efficiency of Human Cognition Reflects Planned Information Processing
Planning is useful. It lets people take actions that have desirable long...
read it
-
CornerNet: Detecting Objects as Paired Keypoints
We propose CornerNet, a new approach to object detection where we detect...
read it
-
On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization
Adaptive gradient methods are workhorses in deep learning. However, the ...
read it
-
Fixing Variational Bayes: Deterministic Variational Inference for Bayesian Neural Networks
Bayesian neural networks (BNNs) hold great promise as a flexible and pri...
read it
-
Steerable ePCA
In photon-limited imaging, the pixel intensities are affected by photon ...
read it