Bin Sun

research

∙ 05/09/2023

Large Language Models Need Holistically Thought in Medical Conversational QA

The medical conversational question answering (CQA) system aims at provi...

0 Yixuan Weng, et al. ∙

research

∙ 05/05/2023

LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition

Previous methods for dynamic facial expression recognition (DFER) in the...

0 Fuyan Ma, et al. ∙

research

∙ 03/21/2023

Heterogeneous-Branch Collaborative Learning for Dialogue Generation

With the development of deep learning, advanced dialogue generation meth...

0 Yiwei Li, et al. ∙

research

∙ 03/02/2023

Image as Set of Points

What is an image and how to extract latent features? Convolutional Netwo...

0 Xu Ma, et al. ∙

research

∙ 12/02/2022

Towards Diverse, Relevant and Coherent Open-Domain Dialogue Generation via Hybrid Latent Variables

Conditional variational models, using either continuous or discrete late...

0 Bin Sun, et al. ∙

research

∙ 12/01/2022

Modeling Complex Dialogue Mappings via Sentence Semantic Segmentation Guided Conditional Variational Auto-Encoder

Complex dialogue mappings (CDM), including one-to-many and many-to-one m...

0 Bin Sun, et al. ∙

research

∙ 10/11/2022

Learning to Locate Visual Answer in Video Corpus Using Question

We introduce a new task, named video corpus visual answer localization (...

0 Bin Li, et al. ∙

research

∙ 07/26/2022

TransFiner: A Full-Scale Refinement Approach for Multiple Object Tracking

Multiple object tracking (MOT) is the task containing detection and asso...

0 Bin Sun, et al. ∙

research

∙ 07/05/2022

Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation

This paper introduces the schemes of Team LingJing's experiments in NLPC...

0 Bin Li, et al. ∙

research

∙ 06/20/2022

Explicit and implicit models in infrared and visible image fusion

Infrared and visible images, as multi-modal image pairs, show significan...

0 Zixuan Wang, et al. ∙

research

∙ 06/09/2022

Towards Layer-wise Image Vectorization

Image rasterization is a mature technique in computer graphics, while im...

64 Xu Ma, et al. ∙

research

∙ 05/23/2022

Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning

There is a growing interest in improving the conversational ability of m...

0 Yiwei Li, et al. ∙

research

∙ 05/10/2022

Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild

Previous methods for dynamic facial expression in the wild are mainly ba...

0 Fuyan Ma, et al. ∙

research

∙ 05/05/2022

Diversifying Neural Dialogue Generation via Negative Distillation

Generative dialogue models suffer badly from the generic response proble...

0 Yiwei Li, et al. ∙

research

∙ 04/20/2022

LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs

The medical conversational system can relieve the burden of doctors and ...

0 Fei Xia, et al. ∙

research

∙ 03/23/2022

Prompt-based Pre-trained Model for Personality and Interpersonal Reactivity Prediction

This paper describes the LingJing team's method to the Workshop on Compu...

0 Bin Li, et al. ∙

research

∙ 03/16/2022

Hybrid Pixel-Unshuffled Network for Lightweight Image Super-Resolution

Convolutional neural network (CNN) has achieved great success on image s...

0 Bin Sun, et al. ∙

research

∙ 03/13/2022

Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video

The temporal answering grounding in the video (TAGV) is a new task natur...

0 Bin Li, et al. ∙

research

∙ 11/29/2021

SimCLAD: A Simple Framework for Contrastive Learning of Acronym Disambiguation

Acronym disambiguation means finding the correct meaning of an ambiguous...

0 Bin Li, et al. ∙

research

∙ 11/29/2021

PSG: Prompt-based Sequence Generation for Acronym Extraction

Acronym extraction aims to find acronyms (i.e., short-forms) and their m...

0 Bin Li, et al. ∙

research

∙ 10/16/2021

Hybrid Mutimodal Fusion for Dimensional Emotion Recognition

In this paper, we extensively present our solutions for the MuSe-Stress ...

0 Ziyu Ma, et al. ∙

research

∙ 10/12/2021

Sign Language Recognition via Skeleton-Aware Multi-Model Ensemble

Sign language is commonly used by deaf or mute people to communicate but...

0 Songyao Jiang, et al. ∙

research

∙ 09/07/2021

Grassmannian Graph-attentional Landmark Selection for Domain Adaptation

Domain adaptation aims to leverage information from the source domain to...

0 Bin Sun, et al. ∙

research

∙ 08/03/2021

More but Correct: Generating Diversified and Entity-revised Medical Response

Medical Dialogue Generation (MDG) is intended to build a medical dialogu...

0 Bin Li, et al. ∙

research

∙ 06/15/2021

Bilateral Personalized Dialogue Generation with Dynamic Persona-Aware Fusion

Generating personalized responses is one of the major challenges in natu...

0 Bin Li, et al. ∙

research

∙ 06/07/2021

Generating Relevant and Coherent Dialogue Responses using Self-separated Conditional Variational AutoEncoders

Conditional Variational AutoEncoder (CVAE) effectively increases the div...

0 Bin Sun, et al. ∙

research

∙ 05/28/2021

THINK: A Novel Conversation Model for Generating Grammatically Correct and Coherent Responses

Many existing conversation models that are based on the encoder-decoder ...

0 Bin Sun, et al. ∙

research

∙ 05/25/2021

GAN for Vision, KG for Relation: a Two-stage Deep Network for Zero-shot Action Recognition

Zero-shot action recognition can recognize samples of unseen classes tha...

0 Bin Sun, et al. ∙

research

∙ 05/24/2021

Real-time Human Action Recognition Using Locally Aggregated Kinematic-Guided Skeletonlet and Supervised Hashing-by-Analysis Model

3D action recognition is referred to as the classification of action seq...

0 Bin Sun, et al. ∙

research

∙ 03/31/2021

Robust Facial Expression Recognition with Convolutional Visual Transformers

Facial Expression Recognition (FER) in the wild is extremely challenging...

0 Fuyan Ma, et al. ∙

research

∙ 03/16/2021

Skeleton Based Sign Language Recognition Using Whole-body Keypoints

Sign language is a visual language that is used by deaf or speech impair...

0 Songyao Jiang, et al. ∙

research

∙ 10/05/2020

Regularizing Dialogue Generation by Imitating Implicit Scenarios

Human dialogues are scenario-based and appropriate responses generally r...

0 Shaoxiong Feng, et al. ∙

research

∙ 08/08/2020

Recent Advances and New Guidelines on Hyperspectral and Multispectral Image Fusion

Hyperspectral image (HSI) with high spectral resolution often suffers fr...

0 Renwei Dian, et al. ∙

research

∙ 12/23/2019

Fully Automated Multi-Organ Segmentation in Abdominal Magnetic Resonance Imaging with Deep Neural Networks

Segmentation of multiple organs-at-risk (OARs) is essential for radiatio...

66 Yuhua Chen, et al. ∙

research

∙ 11/19/2019

FollowMeUp Sports: New Benchmark for 2D Human Keypoint Recognition

Human pose estimation has made significant advancement in recent years. ...

0 Ying Huang, et al. ∙

research

∙ 10/25/2019

LPRNet: Lightweight Deep Network by Low-rank Pointwise Residual Convolution

Deep learning has become popular in recent years primarily due to the po...

7 Bin Sun, et al. ∙

research

∙ 10/25/2019

Real-time Memory Efficient Large-pose Face Alignment via Deep Evolutionary Network

There is an urgent need to apply face alignment in a memory-efficient an...

11 Bin Sun, et al. ∙

research

∙ 04/20/2019

EV-Action: Electromyography-Vision Multi-Modal Action Dataset

Multi-modal human motion analysis is a critical and attractive research ...

0 Lichen Wang, et al. ∙

research

∙ 04/12/2019

GeoCapsNet: Aerial to Ground view Image Geo-localization using Capsule Network

The task of cross-view image geo-localization aims to determine the geo-...

0 Bin Sun, et al. ∙

Bin Sun

Featured Co-authors

Sign in with Google

Consider DeepAI Pro