Chain-of-Thought (CoT) plays a crucial role in reasoning for math proble...
Learning on Graphs has attracted immense attention due to its wide real-...
We investigate the sequential manipulation planning problem for unmanned...
Integrated visible light positioning and communication (VLPC), capable o...
Off-policy learning, referring to the procedure of policy optimization w...
Semantic communications, as one of the potential key technologies of the...
Diffusion models, as a novel generative paradigm, have achieved remarkab...
Accuracy and diversity have long been considered to be two conflicting g...
Foundation models or pre-trained models have substantially improved the
...
In this paper, we propose a semi-discrete first-order low regularity
exp...
A major goal of multimodal research is to improve machine understanding ...
Decisions in agriculture are increasingly data-driven; however, valuable...
Boolean query construction is often critical for medical systematic revi...
Vision language pre-training aims to learn alignments between vision and...
In this paper, we investigate the optimal probabilistic constellation sh...
Users of a recommender system may want part of their data being deleted,...
We propose covert beamforming design frameworks for integrated radar sen...
The human brain can be considered to be a graphical structure comprising...
Graphs can model real-world, complex systems by representing entities an...
Pseudo-Relevance Feedback (PRF) assumes that the top results retrieved b...
Current pre-trained language model approaches to information retrieval c...
Multimodal pre-training for audio-and-text has recently been proved to b...
In this paper we study how to effectively exploit implicit feedback in D...
Deep neural networks, empowered by pre-trained language models, have ach...
Modern recommender systems face an increasing need to explain their
reco...
The bound of the information transmission rate of direct current biased
...
To enhance research on multimodal knowledge base and multimodal informat...
Pseudo-Relevance Feedback (PRF) utilises the relevance signals from the ...
High-quality medical systematic reviews require comprehensive literature...
Most existing methods in vision language pre-training rely on object-cen...
In this paper, we investigate the performance of a practical aggregated
...
We present TWIST, a novel self-supervised representation learning method...
We present a unified computational theory of perception and memory. In o...
We study a new problem setting of information extraction (IE), referred ...
In this paper, we consider covert beamforming design for intelligent
ref...
Existing audio-language task-specific predictive approaches focus on bui...
This paper presents Self-correcting Encoding (Secoco), a framework that
...
Pseudo Relevance Feedback (PRF) is known to improve the effectiveness of...
The quality of vocal delivery is one of the key indicators for evaluatin...
In this paper, we propose a simple yet effective solution to build pract...
In this work, we study computational approaches to detect online dialogi...
Visual Question Answering (VQA) is concerned with answering free-form
qu...
The amount of medical images for training deep classification models is
...
In this paper, we consider a common unicast beamforming network where Al...
While pretrained language models ("LM") have driven impressive gains ove...
Metasurfaces have provided a novel and promising platform for the realiz...
This paper is concerned with dialogue state tracking (DST) in a task-ori...
Speech emotion recognition is a challenging task because the emotion
exp...
Pre-trained language models such as BERT have exhibited remarkable
perfo...
Segmentation of objects of interest is one of the central tasks in medic...