In a Human-in-the-Loop paradigm, a robotic agent is able to act mostly
a...
Household robots operate in the same space for years. Such robots
increm...
Household environments are visually diverse. Embodied agents performing
...
In Video Question Answering (VideoQA), answering general questions about...
Physically rearranging objects is an important capability for embodied
a...
Robots operating in human spaces must be able to engage in natural langu...
Shopping is difficult for people with motor impairments. This includes o...
As any savvy online shopper knows, second-hand peer-to-peer marketplaces...
While lots of people may think branding begins and ends with a logo, fas...
In this paper, we introduce an attribute-based interactive image search ...
Fine-grained image search is still a challenging problem due to the
diff...
Understanding clothes from a single image has strong commercial and cult...
This paper presents an approach for grounding phrases in images which jo...
In this work, we propose an efficient and effective approach for
unconst...
In this paper, we propose a novel end-to-end approach for scalable visua...
Discovering visual knowledge from weakly labeled data is crucial to scal...
In this work, we propose and address a new computer vision task, which w...
Text is ubiquitous in the artificial world and easily attainable when it...
Recent advances in consumer depth sensors have created many opportunitie...
In image classification, visual separability between different object
ca...
We describe a completely automated large scale visual recommendation sys...