Denoising Diffusion models have shown remarkable performance in generati...
We present StyleBabel, a unique open access dataset of natural language
...
There has been a recent spike in interest in multi-modal Language and Vi...
There has been a recent spike in interest in multi-modal Language and Vi...
Most real world applications of image retrieval such as Adobe Stock, whi...
Text-visual (or called semantic-visual) embedding is a central problem i...
A drone monitoring system that integrates deep-learning-based detection ...