Current scene graph datasets suffer from strong long-tail distributions ...
Challenges drive the state-of-the-art of automated medical image analysi...
Pseudo depth maps are depth map predicitions which are used as ground tr...
Performance analyses based on videos are commonly used by coaches of ath...
Analyses based on the body posture are crucial for top-class athletes in...
Semi-supervised learning (SSL) can reduce the need for large labelled
da...
The state-of-the-art for monocular 3D human pose estimation in videos is...
Since COVID strongly affects the respiratory system, lung CT scans can b...
Nearly all Human Pose Estimation (HPE) datasets consist of a fixed set o...
The Multimedia and Computer Vision Lab of the University of Augsburg
par...
Video-to-Text (VTT) is the task of automatically generating descriptions...
Unsupervised Domain Adaptation (UDA) aims to adapt models trained on a s...
The current state-of-the-art in monocular 3D human pose estimation is he...
In this paper we address the problem of motion event detection in athlet...
Automatic medical report generation from chest X-ray images is one
possi...
Automatically generating descriptive captions for images is a well-resea...
Human pose detection systems based on state-of-the-art DNNs are on the g...
Automatically captioning images with natural language sentences is an
im...
In this paper we consider the problem of human pose estimation in real-w...
Current top performing object recognition systems build on object propos...
In this paper we study the problem of estimating innercyclic time interv...