
Confident OffPolicy Evaluation and Selection through SelfNormalized Importance Weighting
We consider offpolicy evaluation in the contextual bandit setting for t...
read it

NonStationary Bandits with Intermediate Observations
Online recommender systems often face long delays in receiving feedback,...
read it

Minimal Assumptions Refinement for GR(1) Specifications
Reactive synthesis is concerned with finding a correctbyconstruction c...
read it

Metalearning of Sequential Strategies
In this report we review memorybased metalearning as a tool for buildi...
read it

Detecting Overfitting via Adversarial Examples
The repeated reuse of test sets in popular benchmark problems raises dou...
read it

Communication without Interception: Defense against DeepLearningbased Modulation Detection
We consider a communication scenario, in which an intruder, employing a ...
read it

Reinforcement Learning to Minimize Age of Information with an Energy Harvesting Sensor with HARQ and Sensing Cost
The time average expected age of information (AoI) is studied for status...
read it

Learning from Delayed Outcomes with Intermediate Observations
Optimizing for long term value is desirable in many practical applicatio...
read it

LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
We consider the problem of configuring generalpurpose solvers to run ef...
read it

Adaptive MCMC via Combining Local Samplers
Markov chain Monte Carlo (MCMC) methods are widely used in machine learn...
read it

A Reinforcement Learning Approach to Age of Information in MultiUser Networks
Scheduling the transmission of timesensitive data to multiple users ove...
read it

A Weakness Measure for GR(1) Formulae
In spite of the theoretical and algorithmic developments for system synt...
read it

Detection of Adversarial Training Examples in Poisoning Attacks through Anomaly Detection
Machine learning has become an important component for many systems and ...
read it

A ReinforcementLearning Approach to Proactive Caching in Wireless Networks
We consider a mobile user accessing contents in a dynamic environment, w...
read it

A Modular Analysis of Adaptive (Non)Convex Optimization: Optimism, Composite Objectives, and Variational Bounds
Recently, much work has been done on extending the scope of online learn...
read it

(Bandit) Convex Optimization with Biased Noisy Gradient Oracles
Algorithms for bandit convex optimization and online learning often rely...
read it

Chaining Bounds for Empirical Risk Minimization
This paper extends the standard chaining technique to prove excess risk ...
read it

Online Learning with Gaussian Payoffs and Side Observations
We consider a sequential learning problem with Gaussian payoffs and side...
read it

Fast CrossValidation for Incremental Learning
Crossvalidation (CV) is one of the main tools for performance estimatio...
read it

Adaptive Monte Carlo via Bandit Allocation
We consider the problem of sequentially choosing between a set of unbias...
read it

Efficient MultiStart Strategies for Local Search Algorithms
Local search algorithms applied to optimization problems often suffer fr...
read it

Online Learning under Delayed Feedback
Online learning with delayed feedback has received increasing attention ...
read it

Partition Tree Weighting
This paper introduces the Partition Tree Weighting technique, an efficie...
read it

A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning
We consider the problem of simultaneously learning to linearly combine a...
read it
András György
is this you? claim profile
Ph.D. degree in technical informatics from the Budapest University of Technology and Economics in 2003, Visiting Research Scholar in the Department of Electrical and Computer Engineering, University of California, San Diego, USA, in spring of 1998, Computer and Automation Research Institute of the Hungarian Academy of Sciences from 20022011, Senior Researcher and Head of the Machine Learning Research Group in 2006, NATO Science Fellow in the Department of Mathematics and Statistics, Queen's University 20032004, parttime research position at GusGus Capital Llc., Budapest, Hungary, in 20062011, researcher in the Department of Computing Science, University of Alberta, Edmonton, AB, Canada 20122015, Senior Lecturer in the Department of Electrical and Electronic Engineering of Imperial College London since 2006.