-
Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders
Despite the evolution of deep-learning-based visual-textual processing s...
read it
-
Transformer Reasoning Network for Image-Text Matching and Retrieval
Image-text matching is an interesting and fascinating task in modern AI ...
read it
-
Virtual to Real adaptation of Pedestrian Detectors for Smart Cities
Pedestrian detection through computer vision is a building block for a m...
read it

Nicola Messina
is this you? claim profile