
Bayesian Optimization in AlphaGo
During the development of AlphaGo, its many hyperparameters were tuned ...
read it

LargeScale Visual Speech Recognition
This work presents a scalable solution to openvocabulary visual speech ...
read it

TaskRelevant Adversarial Imitation Learning
We show that a critical problem in adversarial imitation from highdimen...
read it

Critic Regularized Regression
Offline reinforcement learning (RL), also known as batch RL, offers the ...
read it

Acme: A Research Framework for Distributed Reinforcement Learning
Deep reinforcement learning has led to many recentand groundbreakingad...
read it

Metalearning of Sequential Strategies
In this report we review memorybased metalearning as a tool for buildi...
read it

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
This paper introduces R2D3, an agent that makes efficient use of demonst...
read it

RL Unplugged: Benchmarks for Offline Reinforcement Learning
Offline methods for reinforcement learning have the potential to help br...
read it

Learning Compositional Neural Programs with Recursive Tree Search and Planning
We propose a novel reinforcement learning algorithm, AlphaNPI, that inco...
read it

Modular MetaLearning with Shrinkage
Most gradientbased approaches to metalearning do not explicitly accoun...
read it

OneShot HighFidelity Imitation: Training LargeScale Deep Nets with RL
Humans are experts at highfidelity imitation  closely mimicking a dem...
read it

Playing hard exploration games by watching YouTube
Deep reinforcement learning methods traditionally struggle with tasks wh...
read it

Sample Efficient Adaptive TexttoSpeech
We present a metalearning approach for adaptive texttospeech (TTS) wi...
read it

The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
This paper introduces the Intentional Unintentional (IU) agent. This age...
read it

Learning to learn by gradient descent by gradient descent
The move from handdesigned features to learned features in machine lear...
read it

Neural ProgrammerInterpreters
We propose the neural programmerinterpreter (NPI): a recurrent and comp...
read it

ACDC: A Structured Efficient Linear Layer
The linear layer is one of the most pervasive modules in deep learning r...
read it

Programmable Agents
We build deep RL agents that execute declarative programs expressed in f...
read it

Learned Optimizers that Scale and Generalize
Learning to learn has emerged as an important direction for achieving ar...
read it

Learning to Learn without Gradient Descent by Gradient Descent
We learn recurrent neural network optimizers trained on simple synthetic...
read it

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent QNetworks
We propose deep distributed recurrent Qnetworks (DDRQN), which enable t...
read it

Parallel Multiscale Autoregressive Density Estimation
PixelCNN achieves stateoftheart results in density estimation for nat...
read it

Learning to Perform Physics Experiments via Deep Reinforcement Learning
When encountering novel objects, humans are able to infer a wide range o...
read it

Unbounded Bayesian Optimization via Regularization
Bayesian optimization has recently emerged as a popular and efficient to...
read it

Proceedings of the TwentyEighth Conference on Uncertainty in Artificial Intelligence (2012)
This is the Proceedings of the TwentyEighth Conference on Uncertainty i...
read it

RaoBlackwellised Particle Filtering for Dynamic Bayesian Networks
Particle filters (PFs) are powerful samplingbased inference/learning al...
read it

Deep Fried Convnets
The fully connected layers of a deep convolutional neural network typica...
read it

Deep MultiInstance Transfer Learning
We present a new approach for transferring knowledge from groups to indi...
read it

Heteroscedastic Treed Bayesian Optimisation
Optimising blackbox functions is important in many disciplines, such as...
read it

Nonparametric Bayesian Logic
The Bayesian Logic (BLOG) language was recently developed for defining f...
read it

Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process HyperParameters
Bayesian optimisation has gained great popularity as a tool for optimisi...
read it

Intracluster Moves for Constrained DiscreteSpace MCMC
This paper addresses the problem of sampling from binary distributions w...
read it

Bayesian MultiScale Optimistic Optimization
Bayesian optimization is a powerful global optimization technique for ex...
read it

Learning where to Attend with Deep Architectures for Image Tracking
We discuss an attentional model for simultaneous object tracking and rec...
read it

Narrowing the Gap: Random Forests In Theory and In Practice
Despite widespread interest and practical use, the theoretical propertie...
read it

Linear and Parallel Learning of Markov Random Fields
We introduce a new embarrassingly parallel parameter learning algorithm ...
read it

Predicting Parameters in Deep Learning
We demonstrate that there is significant redundancy in the parameterizat...
read it

Exploiting correlation and budget constraints in Bayesian multiarmed bandit optimization
We address the problem of finding the maximizer of a nonlinear smooth fu...
read it

Consistency of Online Random Forests
As a testament to their success, the theory of random forests has long b...
read it

Herded Gibbs Sampling
The Gibbs sampler is one of the most popular algorithms for inference in...
read it

Reversible Jump MCMC Simulated Annealing for Neural Networks
We propose a novel reversible jump Markov chain Monte Carlo (MCMC) simul...
read it

Variational MCMC
We propose a new class of learning algorithms that combines variational ...
read it

Bayesian Optimization in a Billion Dimensions via Random Embeddings
Bayesian optimization techniques have been successfully applied to robot...
read it

Toward Practical N2 Monte Carlo: the Marginal Particle Filter
Sequential Monte Carlo techniques are useful for state estimation in non...
read it

Learning about individuals from group statistics
We propose a new problem formulation which is similar to, but more infor...
read it

Exponential Regret Bounds for Gaussian Process Bandits with Deterministic Observations
This paper analyzes the problem of Gaussian process (GP) bandits with de...
read it

New inference strategies for solving Markov Decision Processes using reversible jump MCMC
In this paper we build on previous work which uses inferences techniques...
read it

Decentralized, Adaptive, LookAhead Particle Filtering
The decentralized particle filter (DPF) was proposed recently to increas...
read it

Regret Bounds for Deterministic Gaussian Process Bandits
This paper analyses the problem of Gaussian process (GP) bandits with de...
read it

Asymptotic Efficiency of Deterministic Estimators for Discrete EnergyBased Models: Ratio Matching and Pseudolikelihood
Standard maximum likelihood estimation cannot be applied to discrete ene...
read it
Nando de Freitas
is this you? claim profile
Nando de Freitas is a computer science professor at Oxford University. He is also a Linacre College Fellow in Oxford. De Freitas is known as a body in the fields of machine learning, especially in the subfields of neural networking, Bayesian inference and optimization of Bayesian and deep learning.
Born in Zimbabwe, De Freitas. He studied his bachelor’s and MSc at Witwatersrand University and completed a PhD at Trinity College, Cambridge. From 2001 he was a professor at the University of British Columbia, in 2013 he joined the Department of Computer Science at Oxford University and worked for DeepMind of Google.
De Freitas has been recognized by the following awards for his contributions to machine learning:
Best Paper Award at the International Machine Learning Conference
Best Paper Award at the International Learning Conference
Google Research Faculty Award
Distinguished Paper Award for Artificial Intelligence at the International Joint Conference
Charles A. McDowell Award for Research Excellence
Young Researcher Award for Mathematics of Information and Complex Systems