Alex Smola

research

∙ 04/10/2023

A Cheaper and Better Diffusion Language Model with Soft-Masked Noise

Diffusion models that are based on iterative denoising have been recentl...

5 Jiaao Chen, et al. ∙

research

∙ 04/10/2023

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition

This work proposes POMP, a prompt pre-training method for vision-languag...

5 Shuhuai Ren, et al. ∙

research

∙ 02/06/2023

RLSbench: Domain Adaptation Under Relaxed Label Shift

Despite the emergence of principled methods for domain adaptation under ...

0 Saurabh Garg, et al. ∙

research

∙ 02/02/2023

Multimodal Chain-of-Thought Reasoning in Language Models

Large language models (LLMs) have shown impressive performance on comple...

0 Zhuosheng Zhang, et al. ∙

research

∙ 01/04/2023

Parameter-Efficient Fine-Tuning Design Spaces

Parameter-efficient fine-tuning aims to achieve performance comparable t...

1 Jiaao Chen, et al. ∙

research

∙ 10/07/2022

Automatic Chain of Thought Prompting in Large Language Models

Large language models (LLMs) can perform complex reasoning by generating...

0 Zhuosheng Zhang, et al. ∙

research

∙ 07/04/2022

Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition

Existing out-of-distribution (OOD) detection methods are typically bench...

7 Haotao Wang, et al. ∙

research

∙ 11/01/2021

Mixture Proportion Estimation and PU Learning: A Modern Approach

Given only positive examples and unlabeled examples (from both positive ...

0 Saurabh Garg, et al. ∙

research

∙ 04/07/2021

Graph Reordering for Cache-Efficient Near Neighbor Search

Graph search is one of the most successful algorithmic trends in near ne...

0 Benjamin Coleman, et al. ∙

research

∙ 05/16/2020

Tiering as a Stochastic Submodular Optimization Problem

Tiering is an essential technique for building large-scale information r...

0 Hyokun Yun, et al. ∙

research

∙ 09/11/2019

Recognizing Variables from their Data via Deep Embeddings of Distributions

A key obstacle in automated analytics and meta-learning is the inability...

13 Jonas Mueller, et al. ∙

research

∙ 05/28/2019

Deep Factors for Forecasting

Producing probabilistic forecasts for large collections of similar and/o...

6 Yuyang Wang, et al. ∙

research

∙ 03/29/2019

SysML: The New Frontier of Machine Learning Systems

Machine learning (ML) techniques are enjoying rapidly increasing adoptio...

0 Alexander Ratner, et al. ∙

research

∙ 11/30/2018

Deep Factors with Gaussian Processes for Forecasting

A large collection of time series poses significant challenges for class...

0 Danielle C. Maddix, et al. ∙

research

∙ 06/04/2018

Deep Graphs

We propose an algorithm for deep learning on networks and graphs. It rel...

0 Emmanouil Antonios Platanios, et al. ∙

research

∙ 02/12/2018

Detecting and Correcting for Label Shift with Black Box Predictors

Faced with distribution shift between training and test set, we wish to ...

0 Zachary C Lipton, et al. ∙

research

∙ 11/15/2017

Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning

Knowledge bases (KB), both automatically and manually constructed, are o...

0 Rajarshi Das, et al. ∙

research

∙ 02/14/2017

Efficient Multi-task Feature and Relationship Learning

In this paper we propose a multi-convex framework for multi-task learnin...

0 Han Zhao, et al. ∙

research

∙ 11/14/2016

Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy

We propose a method to optimize the representation and distinguishabilit...

0 Dougal J. Sutherland, et al. ∙

research

∙ 08/24/2016

AIDE: Fast and Communication Efficient Distributed Optimization

In this paper, we present two new communication-efficient methods for di...

0 Sashank J Reddi, et al. ∙

research

∙ 07/27/2016

Stochastic Frank-Wolfe Methods for Nonconvex Optimization

We study Frank-Wolfe methods for nonconvex stochastic and finite-sum opt...

0 Sashank J Reddi, et al. ∙

research

∙ 07/18/2016

Neural Machine Translation with Recurrent Attention Modeling

Knowing which words have been attended to in previous time steps while g...

0 Zichao Yang, et al. ∙

research

∙ 05/23/2016

Fast Stochastic Methods for Nonsmooth Nonconvex Optimization

We analyze stochastic algorithms for optimizing nonconvex, nonsmooth fin...

0 Sashank J Reddi, et al. ∙

research

∙ 03/19/2016

Stochastic Variance Reduction for Nonconvex Optimization

We study nonconvex finite-sum problems and analyze stochastic variance r...

0 Sashank J Reddi, et al. ∙

research

∙ 03/19/2016

Fast Incremental Method for Nonconvex Optimization

We analyze a fast incremental aggregated gradient method for optimizing ...

0 Sashank J Reddi, et al. ∙

research

∙ 12/15/2015

Data Driven Resource Allocation for Distributed Learning

In distributed machine learning, data is dispatched to multiple machines...

0 Travis Dick, et al. ∙

research

∙ 11/07/2015

Stacked Attention Networks for Image Question Answering

This paper presents stacked attention networks (SANs) that learn to answ...

0 Zichao Yang, et al. ∙

research

∙ 06/23/2015

On Variance Reduction in Stochastic Gradient Descent and its Asynchronous Variants

We study optimization algorithms based on variance reduction for stochas...

0 Sashank J Reddi, et al. ∙

research

∙ 02/26/2015

Privacy for Free: Posterior Sampling and Stochastic Gradient Monte Carlo

We consider the problem of Bayesian learning on sensitive datasets and p...

0 Yu-Xiang Wang, et al. ∙

research

∙ 12/22/2014

Deep Fried Convnets

The fully connected layers of a deep convolutional neural network typica...

0 Zichao Yang, et al. ∙

research

∙ 10/28/2014

Trend Filtering on Graphs

We introduce a family of adaptive estimators on graphs, based on penaliz...

0 Yu-Xiang Wang, et al. ∙

research

∙ 05/03/2014

The Falling Factorial Basis and Its Statistical Applications

We study a novel spline-like basis, which we name the "falling factorial...

0 Yu-Xiang Wang, et al. ∙

research

∙ 02/01/2014

Randomized Nonlinear Component Analysis

Classical methods such as Principal Component Analysis (PCA) and Canonic...

0 David Lopez-Paz, et al. ∙

research

∙ 07/11/2012

Exponential Families for Conditional Random Fields

In this paper we de ne conditional random elds in reproducing kernel Hil...

0 Yasemin Altun, et al. ∙

research

∙ 06/27/2012

Exponential Regret Bounds for Gaussian Process Bandits with Deterministic Observations

This paper analyzes the problem of Gaussian process (GP) bandits with de...

0 Nando de Freitas, et al. ∙

research

∙ 03/15/2012

Super-Samples from Kernel Herding

We extend the herding algorithm to continuous spaces by using the kernel...

0 Yutian Chen, et al. ∙

research

∙ 03/09/2012

Regret Bounds for Deterministic Gaussian Process Bandits

This paper analyses the problem of Gaussian process (GP) bandits with de...

0 Nando de Freitas, et al. ∙

Alex Smola

Featured Co-authors

Sign in with Google

Consider DeepAI Pro