
Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
QualityDiversity (QD) is a concept from Neuroevolution with some intrig...
read it

OffPolicy Interval Estimation with Lipschitz Value Iteration
Offpolicy evaluation provides an essential tool for evaluating the effe...
read it

Learning Guidance Rewards with Trajectoryspace Smoothing
Longterm temporal credit assignment is an important challenge in deep r...
read it

Efficient Competitive SelfPlay Policy Optimization
Reinforcement learning from selfplay has recently reported many success...
read it

Pretraining of Graph Neural Network for Modeling Effects of Mutations on ProteinProtein Binding Affinity
Modeling the effects of mutations on the binding affinity plays a crucia...
read it

Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
In this paper, we propose an effective knowledge transfer framework to b...
read it

Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion
Langevin diffusion is a powerful method for nonconvex optimization, whic...
read it

Mutual Information Based Knowledge Transfer Under StateAction Dimension Mismatch
Deep reinforcement learning (RL) algorithms have achieved great success ...
read it

DASNet: Dual attentive fully convolutional siamese networks for change detection of high resolution satellite images
Change detection is a basic task of remote sensing image processing. The...
read it

Stein Variational Inference for Discrete Distributions
Gradientbased approximate inference methods, such as Stein variational ...
read it

Stateonly Imitation with Transition Dynamics Mismatch
Imitation Learning (IL) is a popular paradigm for training agents to ach...
read it

Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning
In many visionbased reinforcement learning (RL) problems, the agent con...
read it

Convolution Neural Network Architecture Learning for Remote Sensing Scene Classification
Remote sensing image scene classification is a fundamental but challengi...
read it

DeepMask: an algorithm for cloud and cloud shadow detection in optical satellite remote sensing images using deep residual network
Detecting and masking cloud and cloud shadow from satellite remote sensi...
read it

HeteSpaceyWalk: A Heterogeneous Spacey Random Walk for Heterogeneous Information Network Embedding
Heterogeneous information network (HIN) embedding has gained increasing ...
read it

√(n)Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank
In this paper, we consider the problem of online learning of Markov deci...
read it

Characterizing Attacks on Deep Reinforcement Learning
Deep reinforcement learning (DRL) has achieved great success in various ...
read it

Learning Belief Representations for Imitation Learning in POMDPs
We consider the problem of imitation learning from expert demonstrations...
read it

Exploration via Hindsight Goal Generation
Goaloriented reinforcement learning has recently been a practical frame...
read it

A gradual, semidiscrete approach to generative network training via explicit wasserstein minimization
This paper provides a simple procedure to fit generative networks to tar...
read it

Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning
Recent advances in deep reinforcement learning algorithms have shown gre...
read it

Thresholding Bandit with Optimal Aggregate Regret
We consider the thresholding bandit problem, whose goal is to find arms ...
read it

Stochastic Variance Reduction for Deep Qlearning
Recent advances in deep reinforcement learning have achieved humanlevel...
read it

Knowledge Flow: Improve Upon Your Teachers
A zoo of deep nets is available these days for almost any given task, an...
read it

Overcoming Catastrophic Forgetting by Soft Parameter Pruning
Catastrophic forgetting is a challenge issue in continual learning when ...
read it

Anchor Box Optimization for Object Detection
In this paper, we propose a general approach to optimize anchor boxes fo...
read it

Understanding the Importance of Single Directions via Representative Substitution
Understanding the internal representations of deep neural networks (DNNs...
read it

emrQA: A Large Corpus for Question Answering on Electronic Medical Records
We propose a novel methodology to generate domainspecific largescale q...
read it

OffPolicy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy
When learning from a batch of logged bandit feedback, the discrepancy be...
read it

LargeMargin Classification in Hyperbolic Space
Representing data in hyperbolic space can effectively capture latent hie...
read it

Learning SelfImitating Diverse Policies
Deep reinforcement learning algorithms, including policy gradient method...
read it

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Many efforts have been made to facilitate natural language processing ta...
read it

Learning to Explore with MetaPolicy Gradient
The performance of offpolicy learning, including deep Qlearning and de...
read it

LowNorm Graph Embedding
Learning distributed representations for nodes in graphs has become an i...
read it

Genetic Policy Optimization
Genetic algorithms have been widely used in many practical optimization ...
read it

Sampleefficient Policy Optimization with Stein Control Variate
Policy gradient methods have achieved remarkable successes in solving ch...
read it

Efficient Localized Inference for Large Graphical Models
We propose a new localized inference algorithm for answering marginaliza...
read it

Stochastic Variance Reduction for Policy Gradient Estimation
Recent advances in policy gradient methods and deep learning have demons...
read it

Empower Sequence Labeling with TaskAware Neural Language Model
Linguistic sequence labeling is a general modeling approach that encompa...
read it

On the Selective and Invariant Representation of DCNN for HighResolution Remote Sensing Image Recognition
Human vision possesses strong invariance in image recognition. The cogni...
read it

What do We Learn by Semantic Scene Understanding for Remote Sensing imagery in CNN framework?
Recently, deep convolutional neural network (DCNN) achieved increasingly...
read it

Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening
We propose a novel training algorithm for reinforcement learning which c...
read it

DPPred: An Effective Prediction Framework with Concise Discriminative Patterns
In the literature, two series of models have been proposed to address pr...
read it

Diffusion Component Analysis: Unraveling Functional Topology in Biological Networks
Complex biological systems have been successfully modeled by biochemical...
read it

Exact Hybrid Covariance Thresholding for Joint Graphical Lasso
This paper considers the problem of estimating multiple related Gaussian...
read it

Tightening Fractional Covering Upper Bounds on the Partition Function for HighOrder Region Graphs
In this paper we present a new approach for tightening upper bounds on t...
read it
Jian Peng
is this you? claim profile
Assistant Professor in the Department of Computer Science at the University of Illinois at UrbanaChampaign