Daisuke Niizumi

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Tuomas Virtanen
68 publications
Marc Delcroix
51 publications
Yuma Koizumi
34 publications
Keisuke Imoto
32 publications
Noboru Harada
29 publications
Tsubasa Ochiai
28 publications
Shoko Araki
24 publications
Heikki Huttunen
23 publications
Yohei Kawaguchi
20 publications
Kunio Kashino
20 publications
Akisato Kimura
18 publications

research

∙ 08/23/2023

Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement

We proposed Audio Difference Captioning (ADC) as a new extension task of...

0 Daiki Takeuchi, et al. ∙

research

∙ 05/23/2023

Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation

Self-supervised learning general-purpose audio representations have demo...

0 Daisuke Niizumi, et al. ∙

research

∙ 05/13/2023

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

We present the task description of the Detection and Classification of A...

0 Kota Dohi, et al. ∙

research

∙ 03/01/2023

First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline

This paper provides a baseline system for First-shot-compliant unsupervi...

0 Noboru Harada, et al. ∙

research

∙ 10/26/2022

Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input

Masked Autoencoders is a simple yet powerful self-supervised learning me...

0 Daisuke Niizumi, et al. ∙

research

∙ 07/25/2022

ConceptBeam: Concept Driven Target Speech Extraction

We propose a novel framework for target speech extraction based on seman...

0 Yasunori Ohishi, et al. ∙

research

∙ 07/20/2022

Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval

The amount of audio data available on public websites is growing rapidly...

0 Daiki Takeuchi, et al. ∙

research

∙ 06/13/2022

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques

We present the task description of the Detection and Classification of A...

10 Kota Dohi, et al. ∙

research

∙ 05/17/2022

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model

Many application studies rely on audio DNN models pre-trained on a large...

0 Daisuke Niizumi, et al. ∙

research

∙ 04/26/2022

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation

Recent general-purpose audio representations show state-of-the-art perfo...

0 Daisuke Niizumi, et al. ∙

research

∙ 04/15/2022

BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations

Pre-trained models are essential as feature extractors in modern machine...

0 Daisuke Niizumi, et al. ∙

research

∙ 06/08/2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions

We present the task description and discussion on the results of the DCA...

1 Yohei Kawaguchi, et al. ∙

research

∙ 03/11/2021

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

Inspired by the recent progress in self-supervised learning for computer...

0 Daisuke Niizumi, et al. ∙

research

∙ 12/14/2020

Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval

The goal of audio captioning is to translate input audio into its descri...

0 Yuma Koizumi, et al. ∙

research

∙ 08/02/2018

Acoustic Scene Classification: A Competition Review

In this paper we study the problem of acoustic scene classification, i.e...

0 Shayan Gharib, et al. ∙

Success!

An error occurred

Daisuke Niizumi

Featured Co-authors

Sign in with Google

Consider DeepAI Pro