Talking face generation, also known as speech-to-lip generation, reconst...
Audio and visual signals complement each other in human speech perceptio...
Most of the prior studies in the spatial DoA domain focus on a single
mo...
A hybrid map representation, which consists of a modified generalized Vo...
The success of Deep Neural Networks (DNNs) can be attributed to its deep...