Audio visual segmentation (AVS) aims to segment the sounding objects for...
Monocular 3D lane detection is a challenging task due to its lack of dep...
Video language pre-training methods have mainly adopted sparse sampling
...
Contrastive learning has been extensively studied in sentence embedding
...
Referring video object segmentation aims to predict foreground labels fo...
Contrastive learning has been attracting much attention for learning
uns...
Contrastive learning has been gradually applied to learn high-quality
un...
Language-queried video actor segmentation aims to predict the pixel-leve...
Face reenactment is a challenging task, as it is difficult to maintain
a...
Learning to capture dependencies between spatial positions is essential ...
The dissemination of fake news significantly affects personal reputation...
Referring image segmentation aims to predict the foreground mask of the
...
Referring image segmentation aims at segmenting the foreground masks of ...
Document-level Sentiment Analysis (DSA) is more challenging due to vague...
While several state-of-the-art approaches to dialogue state tracking (DS...
Automatically labeling multiple styles for every song is a comprehensive...
The development of social media has revolutionized the way people
commun...
Opinion spam has become a widespread problem in social media, where hire...
This paper focuses on the task of generating long structured sentences w...
This paper focuses on the task of sentiment transfer on non-parallel tex...
Imbalanced data commonly exists in real world, espacially in
sentiment-r...
Typically, Ultra-deep neural network(UDNN) tends to yield high-quality m...
We propose a novel data augmentation method for labeled sentences called...
Human parsing has been extensively studied recently due to its wide
appl...