-
MTM Dataset for Joint Representation Learning among Sheet Music, Lyrics, and Musical Audio?
We introduce the music Ternary Modalities Dataset (MTM Dataset), which i...
read it
-
Unsupervised Generative Adversarial Alignment Representation for Sheet music, Audio and Lyrics
Sheet music, audio, and lyrics are three main modalities during writing ...
read it
-
Learning Joint Embedding for Cross-Modal Retrieval
A cross-modal retrieval process is to use a query in one modality to obt...
read it
-
Audio-Visual Embedding for Cross-Modal MusicVideo Retrieval through Supervised Deep CCA
Deep learning has successfully shown excellent performance in learning j...
read it
-
Personalized Music Recommendation with Triplet Network
Since many online music services emerged in recent years so that effecti...
read it
-
Deep Triplet Neural Networks with Cluster-CCA for Audio-Visual Cross-modal Retrieval
Cross-modal retrieval aims to retrieve data in one modality by a query i...
read it
-
Deep Learning of Human Perception in Audio Event Classification
In this paper, we introduce our recent studies on human perception in au...
read it

Donghuo Zeng
is this you? claim profile