In the field of autonomous driving, accurate and comprehensive perceptio...
Early detection of dysplasia of the cervix is critical for cervical canc...
Street-view imagery provides us with novel experiences to explore differ...
Contrastive self-supervised learning has attracted significant research
...
Motion prediction is a classic problem in computer vision, which aims at...
Sensors are the key to sensing the environment and imparting benefits to...
Indoor localization is a fundamental problem in location-based applicati...
Speech as a natural signal is composed of three parts - visemes (visual ...
There are a wide range of applications that involve multi-modal data, su...
Lipreading has a lot of potential applications such as in the domain of
...
Multiple modalities represent different aspects by which information is
...