Masking Kernel for Learning Energy-Efficient Speech Representation

02/08/2023
by   Apiwat Ditthapron, et al.
0

Modern smartphones are equipped with powerful audio hardware and processors, allowing them to acquire and perform on-device speech processing at high sampling rates. However, energy consumption remains a concern, especially for resource-intensive DNNs. Prior mobile speech processing reduced computational complexity by compacting the model or reducing input dimensions via hyperparameter tuning, which reduced accuracy or required more training iterations. This paper proposes gradient descent for optimizing energy-efficient speech recording format (length and sampling rate). The goal is to reduce the input size, which reduces data collection and inference energy. For a backward pass, a masking function with non-zero derivatives (Gaussian, Hann, and Hamming) is used as a windowing function and a lowpass filter. An energy-efficient penalty is introduced to incentivize the reduction of the input size. The proposed masking outperformed baselines by 8.7 speaker recognition and traumatic brain injury detection using 49 duration, sampled at a lower frequency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/22/2016

Energy-Efficient ConvNets Through Approximate Computing

Recently ConvNets or convolutional neural networks (CNN) have come up as...
research
04/07/2022

Energy Consumption and Performance of Heapsort in Hardware and Software

In this poster abstract we will report on a case study on implementing t...
research
07/22/2020

Resource-Efficient Speech Mask Estimation for Multi-Channel Speech Enhancement

While machine learning techniques are traditionally resource intensive, ...
research
09/25/2020

Resource-Constrained On-Device Learning by Dynamic Averaging

The communication between data-generating devices is partially responsib...
research
07/19/2013

Speaker Independent Continuous Speech to Text Converter for Mobile Application

An efficient speech to text converter for mobile application is presente...
research
03/19/2023

ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement

Social ambiance describes the context in which social interactions happe...
research
09/07/2021

Energy Efficient Sampling Policies for Edge Computing Feedback Systems

We study the problem of finding efficient sampling policies in an edge-b...

Please sign up or login with your details

Forgot password? Click here to reset