Multi-Scale Temporal Convolution Network for Classroom Voice Detection

05/31/2021
by   Lu Ma, et al.
0

Teaching with the cooperation of expert teacher and assistant teacher, which is the so-called "double-teachers classroom", i.e., the course is giving by the expert online and presented through projection screen at the classroom, and the teacher at the classroom performs as an assistant for guiding the students in learning, is becoming more prevalent in today's teaching method for K-12 education. For monitoring the teaching quality, a microphone clipped on the assistant's neckline is always used for voice recording, then fed to the downstream tasks of automatic speech recognition (ASR) and neural language processing (NLP). However, besides its voice, there would be some other interfering voices, including the expert's one and the student's one. Here, we propose to extract the assistant' voices from the perspective of sound event detection, i.e., the voices are classified into four categories, namely the expert, the teacher, the mixture of them, and the background. To make frame-level identification, which is important for grabbing sensitive words for the downstream tasks, a multi-scale temporal convolution neural network is constructed with stacked dilated convolutions for considering both local and global properties. These features are concatenated and fed to a classification network constructed by three linear layers. The framework is evaluated on simulated data and real-world recordings, giving considerable performance in terms of precision and recall, compared with some classical classification methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
05/31/2021

Multi-Scale Attention Neural Network for Acoustic Echo Cancellation

Acoustic Echo Cancellation (AEC) plays a key role in speech interaction ...
research
05/10/2021

Voice activity detection in the wild: A data-driven approach using teacher-student training

Voice activity detection is an essential pre-processing component for sp...
research
06/07/2023

Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks

In recent years, self-supervised learning (SSL) has emerged as a popular...
research
07/16/2021

Representation Consolidation for Training Expert Students

Traditionally, distillation has been used to train a student model to em...
research
05/05/2023

Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects

This perspective paper proposes a series of interactive scenarios that u...
research
10/10/2022

Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks

Teacher-student knowledge distillation is a popular technique for compre...
research
11/07/2018

Reflective Diary for Professional Development of Novice Teachers

Many starting teachers of computer science have great professional skill...

Please sign up or login with your details

Forgot password? Click here to reset