Zero-shot referring image segmentation is a challenging task because it ...
Avoiding synthesizing specific visual concepts is an essential challenge...
3D photography renders a static image into a video with appealing 3D vis...
Without the demand of training in reality, humans can easily detect a kn...
Language guided image inpainting aims to fill in the defective regions o...
In a dialog system, dialog act recognition and sentiment classification ...
In dialog system, dialog act recognition and sentiment classification ar...
Multi-lingual contextualized embeddings, such as multilingual-BERT (mBER...
Spoken language understanding has been addressed as a supervised learnin...