Extracting expressive visual features is crucial for accurate
Click-Thro...
Multimodal supervision has achieved promising results in many visual lan...
Transfer learning with pre-training on large-scale datasets has played a...
Pedestrian detection in crowd scenes poses a challenging problem due to ...
Training with more data has always been the most stable and effective wa...
This report details our solution to the Google AI Open Images Challenge ...
This report demonstrates our solution for the Open Images 2018 Challenge...
Recent studies have shown that aggregating convolutional features of a
p...