Augmented Reality (AR) has been used to facilitate surgical guidance dur...
Large-scale vision-language pre-training has shown impressive advances i...
Understanding human emotions is a crucial ability for intelligent robots...
Temporal grounding in videos aims to localize one target video segment t...
Purpose: Image guidance is crucial for the success of many interventions...