Large Vision-Language Models (LVLMs) such as MiniGPT-4 and LLaVA have
de...
Deep Neural Networks (DNNs) for 3D point cloud recognition are vulnerabl...
The growing adoption of voice-enabled devices (e.g., smart speakers),
pa...
Eavesdropping from the user's smartphone is a well-known threat to the u...
Faced with the threat of identity leakage during voice data publishing, ...
Deep neural networks (DNNs) have been widely adopted in brain lesion
det...
Self-supervised learning (SSL) holds promise in leveraging large amounts...
Recently, the vulnerability of DNN-based audio systems to adversarial at...
As the popularity of voice user interface (VUI) exploded in recent years...
In this paper, we build a speech privacy attack that exploits speech
rev...
Scene text recognition has received increased attention in the research
...
Datasets drive vision progress and autonomous driving is a critical visi...
Reading text in the wild is a challenging task in the field of computer
...
Image matting plays an important role in image and video editing. Howeve...
Foreground segmentation in video sequences is a classic topic in compute...