-
Learning the Best Pooling Strategy for Visual Semantic Embedding
Visual Semantic Embedding (VSE) is a dominant approach for vision-langua...
read it
-
BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps
Learning to follow instructions is of fundamental importance to autonomo...
read it
-
Floor-SP: Inverse CAD for Floorplans by Sequential Room-wise Shortest Path
This paper proposes a new approach for automated floorplan reconstructio...
read it
-
Learning to Forecast Videos of Human Activity with Multi-granularity Models and Adaptive Rendering
We propose an approach for forecasting video of complex human activity i...
read it

Jiacheng Chen
is this you? claim profile