Simple regret minimization is a critical problem in learning optimal
tre...
We consider estimation and inference with data collected from episodic
r...
The wide popularity of short videos on social media poses new opportunit...
Watch-time prediction remains to be a key factor in reinforcing user
eng...
Long-term engagement is preferred over immediate engagement in sequentia...
The wide popularity of short videos on social media poses new opportunit...
It has become increasingly common for data to be collected adaptively, f...
Learning optimal policies from historical data enables the gains from
pe...
Watermarking is the process of embedding information into an image that ...
Adaptive experiments can result in considerable cost savings in multi-ar...