Team AcieLee: Technical Report for EPIC-SOUNDS Audio-Based Interaction Recognition Challenge 2023

06/15/2023
by   Yuqi Li, et al.
0

In this report, we describe the technical details of our submission to the EPIC-SOUNDS Audio-Based Interaction Recognition Challenge 2023, by Team "AcieLee" (username: Yuqi_Li). The task is to classify the audio caused by interactions between objects, or from events of the camera wearer. We conducted exhaustive experiments and found learning rate step decay, backbone frozen, label smoothing and focal loss contribute most to the performance improvement. After training, we combined multiple models from different stages and integrated them into a single model by assigning fusion weights. This proposed method allowed us to achieve 3rd place in the CVPR 2023 workshop of EPIC-SOUNDS Audio-Based Interaction Recognition Challenge.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2021

Method Towards CVPR 2021 SimLocMatch Challenge

This report describes Megvii-3D team's approach towards SimLocMatch Chal...
research
08/10/2021

Method Towards CVPR 2021 Image Matching Challenge

This report describes Megvii-3D team's approach towards CVPR 2021 Image ...
research
11/16/2022

Exploring Detection-based Method For Speaker Diarization @ Ego4D Audio-only Diarization Challenge 2022

We provide the technical report for Ego4D audio-only diarization challen...
research
02/09/2022

CAU_KU team's submission to ADD 2022 Challenge task 1: Low-quality fake audio detection through frequency feature masking

This technical report describes Chung-Ang University and Korea Universit...
research
07/14/2023

AudioInceptionNeXt: TCL AI LAB Submission to EPIC-SOUND Audio-Based-Interaction-Recognition Challenge 2023

This report presents the technical details of our submission to the 2023...
research
10/25/2021

2nd Place Solution for SODA10M Challenge 2021 – Continual Detection Track

In this technical report, we present our approaches for the continual ob...
research
06/18/2023

STHG: Spatial-Temporal Heterogeneous Graph Learning for Advanced Audio-Visual Diarization

This report introduces our novel method named STHG for the Audio-Visual ...

Please sign up or login with your details

Forgot password? Click here to reset