
-
PBoS: Probabilistic Bag-of-Subwords for Generalizing Word Embedding
We look into the task of generalizing word embeddings: given a set of pr...
read it
-
Functional Regularization for Representation Learning: A Unified Theoretical Perspective
Unsupervised and self-supervised learning approaches have become a cruci...
read it
-
Can Adversarial Weight Perturbations Inject Neural Backdoors?
Adversarial machine learning has exposed several security hazards of neu...
read it
-
Learning Entangled Single-Sample Gaussians in the Subset-of-Signals Model
In the setting of entangled single-sample distributions, the goal is to ...
read it
-
Learning Entangled Single-Sample Distributions via Iterative Trimming
In the setting of entangled single-sample distributions, the goal is to ...
read it
-
Gradients as Features for Deep Representation Learning
We address the challenging problem of deep representation learning–the e...
read it
-
SimpleTran: Transferring Pre-Trained Sentence Embeddings for Low Resource Text Classification
Fine-tuning pre-trained sentence embedding models like BERT has become t...
read it
-
Sketching Transformed Matrices with Applications to Natural Language Processing
Suppose we are given a large matrix A=(a_i,j) that cannot be stored in m...
read it
-
Learning Relationships between Text, Audio, and Video via Deep Canonical Correlation for Multimodal Language Analysis
Multimodal language analysis often considers relationships between featu...
read it
-
Shallow Domain Adaptive Embeddings for Sentiment Analysis
This paper proposes a way to improve the performance of existing algorit...
read it
-
Robust Attribution Regularization
An emerging problem in trustworthy machine learning is to train models t...
read it
-
Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers
Neural networks have great success in many machine learning applications...
read it
-
Recovery Guarantees for Quadratic Tensors with Limited Observations
We consider the tensor completion problem of predicting the missing entr...
read it
-
Generalizing Word Embeddings using Bag of Subwords
We approach the problem of generalizing pre-trained word embeddings beyo...
read it
-
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data
Neural networks have many successful applications, while much less theor...
read it
-
N-Gram Graph, A Novel Molecule Representation
Virtual high-throughput screening provides a strategy for prioritizing c...
read it
-
Improving Adversarial Robustness by Data-Specific Discretization
A recent line of research proposed (either implicitly or explicitly) gra...
read it
-
A La Carte Embedding: Cheap but Effective Induction of Semantic Feature Vectors
Motivations like domain adaptation, transfer learning, and feature learn...
read it
-
Domain Adapted Word Embeddings for Improved Sentiment Classification
Generic word embeddings are trained on large-scale generic corpora; Doma...
read it
-
Learning Mixtures of Linear Regressions with Nearly Optimal Complexity
Mixtures of Linear Regressions (MLR) is an important mixture model with ...
read it
-
Provable Alternating Gradient Descent for Non-negative Matrix Factorization with Strong Correlations
Non-negative matrix factorization is a basic tool for decomposing data i...
read it
-
Optimal Sample Complexity for Matrix Completion and Related Problems via ℓ_2-Regularization
We study the strong duality of non-convex matrix factorization: we show ...
read it
-
Generalization and Equilibrium in Generative Adversarial Nets (GANs)
We show that training of generative adversarial network (GAN) may not ha...
read it
-
Recovery Guarantee of Non-negative Matrix Factorization via Alternating Updates
Non-negative matrix factorization is a popular tool for decomposing data...
read it
-
Diverse Neural Network Learns True Target Functions
Neural networks are a powerful class of functions that can be trained wi...
read it
-
Mapping Between fMRI Responses to Movies and their Natural Language Annotations
Several research groups have shown how to correlate fMRI responses to th...
read it
-
Recovery guarantee of weighted low-rank approximation via alternating minimization
Many applications require recovering a ground truth low-rank matrix from...
read it
-
Linear Algebraic Structure of Word Senses, with Applications to Polysemy
Word embeddings are ubiquitous in NLP and information retrieval, but it'...
read it
-
RAND-WALK: A Latent Variable Model Approach to Word Embeddings
Semantic word embeddings represent the meaning of a word via a vector, a...
read it
-
Scalable Kernel Methods via Doubly Stochastic Gradients
The general perception is that kernel methods are not scalable, and neur...
read it
-
A Distributed Frank-Wolfe Algorithm for Communication-Efficient Sparse Learning
Learning sparse combinations is a frequent theme in machine learning. In...
read it
-
Budgeted Influence Maximization for Multiple Products
The typical algorithmic problem in viral marketing aims to identify a se...
read it
-
Distributed k-Means and k-Median Clustering on General Topologies
This paper provides new algorithms for distributed clustering for two po...
read it