Emmanouil Benetos

research

∙ 09/15/2023

MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response

Large Language Models (LLMs) have shown immense potential in multimodal ...

0 Zihao Deng, et al. ∙

research

∙ 07/19/2023

From West to East: Who can understand the music of the others better?

Recent developments in MIR have led to several benchmark deep learning m...

0 Charilaos Papaioannou, et al. ∙

research

∙ 07/11/2023

On the Effectiveness of Speech Self-supervised Learning for Music

Self-supervised learning (SSL) has shown promising results in various sp...

0 Yinghao Ma, et al. ∙

research

∙ 06/29/2023

LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

We introduce LyricWhiz, a robust, multilingual, and zero-shot automatic ...

0 Le Zhuo, et al. ∙

research

∙ 06/18/2023

MARBLE: Music Audio Representation Benchmark for Universal Evaluation

In the era of extensive intersection between art and Artificial Intellig...

4 Ruibin Yuan, et al. ∙

research

∙ 05/31/2023

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training

Self-supervised learning (SSL) has recently emerged as a promising parad...

2 Yizhi Li, et al. ∙

research

∙ 12/05/2022

MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning

The deep learning community has witnessed an exponentially growing inter...

16 Yizhi Li, et al. ∙

research

∙ 10/27/2022

Learning Music Representations with wav2vec 2.0

Learning music representations that are general-purpose offers the flexi...

0 Alessandro Ragano, et al. ∙

research

∙ 08/25/2022

Contrastive Audio-Language Learning for Music

As one of the most intuitive interfaces known to humans, natural languag...

3 Ilaria Manco, et al. ∙

research

∙ 07/15/2022

Anomalous behaviour in loss-gradient based interpretability methods

Loss-gradients are used to interpret the decision making process of deep...

0 Vinod Subramanian, et al. ∙

research

∙ 04/10/2022

Deep Conditional Representation Learning for Drum Sample Retrieval by Vocalisation

Imitating musical instruments with the human voice is an efficient way o...

0 Alejandro Delgado, et al. ∙

research

∙ 04/08/2022

Exploring Transformer's potential on automatic piano transcription

Most recent research about automatic music transcription (AMT) uses conv...

0 Longshen Ou, et al. ∙

research

∙ 02/03/2022

Improving Lyrics Alignment through Joint Pitch Detection

In recent years, the accuracy of automatic lyrics alignment methods has ...

0 Jiawen Huang, et al. ∙

research

∙ 12/08/2021

Learning music audio representations via weak language supervision

Audio representations for music information retrieval are typically lear...

10 Ilaria Manco, et al. ∙

research

∙ 10/09/2021

An evaluation of data augmentation methods for sound scene geotagging

Sound scene geotagging is a new topic of research which has evolved from...

0 Helen L Bear, et al. ∙

research

∙ 10/08/2021

Joint Scattering for Automatic Chick Call Recognition

Animal vocalisations contain important information about health, emotion...

0 Changhong Wang, et al. ∙

research

∙ 08/19/2021

More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations

Non-intrusive speech quality assessment is a crucial operation in multim...

0 Alessandro Ragano, et al. ∙

research

∙ 07/28/2021

Pitch-Informed Instrument Assignment Using a Deep Convolutional Network with Multiple Kernel Shapes

This paper proposes a deep convolutional neural network for performing n...

0 Carlos Lordelo, et al. ∙

research

∙ 04/24/2021

MusCaps: Generating Captions for Music Audio

Content-based music information retrieval has seen rapid progress with t...

8 Ilaria Manco, et al. ∙

research

∙ 04/14/2021

Revisiting the Onsets and Frames Model with Additive Attention

Recent advances in automatic music transcription (AMT) have achieved hig...

0 Kin Wai Cheuk, et al. ∙

research

∙ 01/03/2021

Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation

This paper addresses the problem of domain adaptation for the task of mu...

0 Carlos Lordelo, et al. ∙

research

∙ 10/20/2020

The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy

Most of the state-of-the-art automatic music transcription (AMT) models ...

4 Kin Wai Cheuk, et al. ∙

research

∙ 10/15/2020

Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark

The Automatic Speaker Verification Spoofing and Countermeasures Challeng...

0 Bhusan Chettri, et al. ∙

research

∙ 05/15/2020

Reliable Local Explanations for Machine Listening

One way to analyse the behaviour of machine learning models is through l...

0 Saumitra Mishra, et al. ∙

research

∙ 05/13/2020

Memory Controlled Sequential Self Attention for Sound Recognition

In this paper we investigate the importance of the extent of memory in s...

0 Arjun Pankajakshan, et al. ∙

research

∙ 04/15/2020

Musical Features for Automatic Music Transcription Evaluation

This technical report gives a detailed, formal description of the featur...

0 Adrien Ycart, et al. ∙

research

∙ 03/22/2020

Audio Impairment Recognition Using a Correlation-Based Feature Representation

Audio impairment recognition is based on finding noise in audio files an...

0 Alessandro Ragano, et al. ∙

research

∙ 10/22/2019

Modeling plate and spring reverberation using a DSP-informed deep neural network

Plate and spring reverberators are electromechanical systems first used ...

0 Marco A. Martínez Ramírez, et al. ∙

research

∙ 07/11/2019

Polyphonic Sound Event and Sound Activity Detection: A Multi-task approach

Polyphonic Sound Event Detection (SED) in real-world recordings is a cha...

0 Arjun Pankajakshan, et al. ∙

research

∙ 07/04/2019

Adversarial Attacks in Sound Event Classification

Adversarial attacks refer to a set of methods that perturb the input to ...

0 Vinod Subramanian, et al. ∙

research

∙ 05/15/2019

A general-purpose deep learning approach to model time-varying audio effects

Audio processors whose parameters are modified periodically over time ar...

0 Marco A. Martínez Ramírez, et al. ∙

research

∙ 05/06/2019

Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation

In this paper we propose an efficient deep learning encoder-decoder netw...

0 Carlos Lordelo, et al. ∙

research

∙ 05/02/2019

City classification from multiple real-world sound scenes

The majority of sound scene analysis work focuses on one of two clearly ...

0 Helen L Bear, et al. ∙

research

∙ 04/23/2019

Towards joint sound scene and polyphonic sound event recognition

Acoustic Scene Classification (ASC) and Sound Event Detection (SED) are ...

0 Helen L Bear, et al. ∙

research

∙ 04/21/2019

GAN-based Generation and Automatic Selection of Explanations for Neural Networks

One way to interpret trained deep neural networks (DNNs) is by inspectin...

0 Saumitra Mishra, et al. ∙

research

∙ 04/09/2019

Ensemble Models for Spoofing Detection in Automatic Speaker Verification

Detecting spoofing attempts of automatic speaker verification (ASV) syst...

0 Bhusan Chettri, et al. ∙

research

∙ 11/15/2018

Audio-based identification of beehive states

The absence of the queen in a beehive is a very strong indicator of the ...

0 Inês Nolasco, et al. ∙

research

∙ 11/14/2018

To bee or not to bee: Investigating machine learning approaches for beehive sound recognition

In this work, we aim to explore the potential of machine learning method...

0 Inês Nolasco, et al. ∙

research

∙ 10/30/2018

SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification

Acoustic Scene Classification (ASC) is one of the core research problems...

0 Sai Samarth R Phaye, et al. ∙

research

∙ 09/26/2018

An extensible cluster-graph taxonomy for open set sound scene analysis

We present a new extensible and divisible taxonomy for open set sound sc...

0 Helen L Bear, et al. ∙

research

∙ 05/22/2018

A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing

The second Automatic Speaker Verification Spoofing and Countermeasures c...

0 Bhusan Chettri, et al. ∙

research

∙ 04/30/2018

Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting

In this paper, we show empirical evidence on how to construct the optima...

0 Eurico Covas, et al. ∙

research

∙ 11/15/2017

Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results

As part of the 2016 public evaluation challenge on Detection and Classif...

0 Grégoire Lafay, et al. ∙

research

∙ 08/07/2015

An End-to-End Neural Network for Polyphonic Piano Music Transcription

We present a supervised neural network model for polyphonic piano music ...

0 Siddharth Sigtia, et al. ∙

research

∙ 01/31/2015

An evaluation framework for event detection using a morphological model of acoustic scenes

This paper introduces a model of environmental acoustic scenes which ado...

0 Mathieu Lagrange, et al. ∙

Emmanouil Benetos

Featured Co-authors

Sign in with Google

Consider DeepAI Pro