
-
Deep Jump Q-Evaluation for Offline Policy Evaluation in Continuous Action Space
We consider off-policy evaluation (OPE) in continuous action domains, su...
read it
-
Knowledge-guided Open Attribute Value Extraction with Reinforcement Learning
Open attribute value extraction for emerging entities is an important bu...
read it
-
Statistical Inference for Online Decision Making via Stochastic Gradient Descent
Online decision making aims to learn the optimal decision rule by making...
read it
-
Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting
Online decision-making problem requires us to make a sequence of decisio...
read it
-
Multi-Objective Reinforcement Learning for Infectious Disease Control with Application to COVID-19 Spread
Severe infectious diseases such as the novel coronavirus (COVID-19) pose...
read it
-
Causal Effect Estimation and Optimal Dose Suggestions in Mobile Health
In this article, we propose novel structural nested models to estimate c...
read it
-
Kernel Assisted Learning for Personalized Dose Finding
An individualized dose rule recommends a dose level within a continuous ...
read it
-
AdaptiveWeighted Attention Network with Camera Spectral Sensitivity Prior for Spectral Reconstruction from RGB Images
Recent promising effort for spectral reconstruction (SR) focuses on lear...
read it
-
A New Framework for Online Testing of Heterogeneous Treatment Effect
We propose a new framework for online testing of heterogeneous treatment...
read it
-
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making
The Markov assumption (MA) is fundamental to the empirical validity of r...
read it
-
A Reinforcement Learning Framework for Time-Dependent Causal Effects Evaluation in A/B Testing
A/B testing, or online experiment is a standard business strategy to com...
read it
-
Doubly Robust Inference when Combining Probability and Non-probability Samples with High-dimensional Data
Non-probability samples become increasingly popular in survey statistics...
read it
-
The USTC-NEL Speech Translation system at IWSLT 2018
This paper describes the USTC-NEL system to the speech translation task ...
read it
-
Neural network-based arithmetic coding of intra prediction modes in HEVC
In both H.264 and HEVC, context-adaptive binary arithmetic coding (CABAC...
read it
-
A ParaBoost Stereoscopic Image Quality Assessment (PBSIQA) System
The problem of stereoscopic image quality assessment, which finds applic...
read it
-
Robust Learning for Optimal Treatment Decision with NP-Dimensionality
In order to identify important variables that are involved in making opt...
read it
-
Sure Screening for Gaussian Graphical Models
We propose graphical sure screening, or GRASS, a very simple and computa...
read it
-
Sequential Advantage Selection for Optimal Treatment Regimes
Variable selection for optimal treatment regime in a clinical trial or a...
read it
-
Nonparametric Independence Screening in Sparse Ultra-High Dimensional Additive Models
A variable screening procedure via correlation learning was proposed Fan...
read it