Training on web-scale data can take months. But most computation and tim...
We introduce Goldilocks Selection, a technique for faster model training...
We study the problem of learning conditional average treatment effects (...
There remains much uncertainty about the relative effectiveness of diffe...
Recommending the best course of action for an individual is a major
appl...
Reward design, the problem of selecting an appropriate reward function f...
Inverse reinforcement learning (IRL) attempts to infer human rewards or
...