Binary classification of spoken words with passive elastic metastructures

11/14/2021
by   Tena Dubček, et al.
0

Many electronic devices spend most of their time waiting for a wake-up event: pacemakers waiting for an anomalous heartbeat, security systems on alert to detect an intruder, smartphones listening for the user to say a wake-up phrase. These devices continuously convert physical signals into electrical currents that are then analyzed on a digital computer – leading to power consumption even when no event is taking place. Solving this problem requires the ability to passively distinguish relevant from irrelevant events (e.g. tell a wake-up phrase from a regular conversation). Here, we experimentally demonstrate an elastic metastructure, consisting of a network of coupled silicon resonators, that passively discriminates between pairs of spoken words – solving the wake-up problem for scenarios where only two classes of events are possible. This passive speech recognition is demonstrated on a dataset from speakers with significant gender and accent diversity. The geometry of the metastructure is determined during the design process, in which the network of resonators ('mechanical neurones') learns to selectively respond to spoken words. Training is facilitated by a machine learning model that reduces the number of computationally expensive three-dimensional elastic wave simulations. By embedding event detection in the structural dynamics, mechanical neural networks thus enable novel classes of always-on smart devices with no standby power consumption.

READ FULL TEXT

page 4

page 7

research
07/13/2022

Estimating the Power Consumption of Heterogeneous Devices when performing AI Inference

Modern-day life is driven by electronic devices connected to the interne...
research
12/18/2019

How the Mechanical Properties and Thickness of Glass Affect TPaD Performance

One well-known class of surface haptic devices that we have called TPaDs...
research
05/21/2018

Event-based Convolutional Networks for Object Detection in Neuromorphic Cameras

Event-based cameras are bioinspired sensors able to perceive changes in ...
research
11/12/2021

A Convolutional Neural Network Based Approach to Recognize Bangla Spoken Digits from Speech Signal

Speech recognition is a technique that converts human speech signals int...
research
05/03/2023

Plug-and-Play Multilingual Few-shot Spoken Words Recognition

As technology advances and digital devices become prevalent, seamless hu...
research
01/24/2023

A Comparison of Temporal Encoders for Neuromorphic Keyword Spotting with Few Neurons

With the expansion of AI-powered virtual assistants, there is a need for...

Please sign up or login with your details

Forgot password? Click here to reset