WhisperWand: Simultaneous Voice and Gesture Tracking Interface

01/24/2023
by   Yang Bai, et al.
0

This paper presents the design and implementation of WhisperWand, a comprehensive voice and motion tracking interface for voice assistants. Distinct from prior works, WhisperWand is a precise tracking interface that can co-exist with the voice interface on low sampling rate voice assistants. Taking handwriting as a specific application, it can also capture natural strokes and the individualized style of writing while occupying only a single frequency. The core technique includes an accurate acoustic ranging method called Cross Frequency Continuous Wave (CFCW) sonar, enabling voice assistants to use ultrasound as a ranging signal while using the regular microphone system of voice assistants as a receiver. We also design a new optimization algorithm that only requires a single frequency for time difference of arrival. WhisperWand prototype achieves 73 um of median error for 1D ranging and 1.4 mm of median error in 3D tracking of an acoustic beacon using the microphone array used in voice assistants. Our implementation of an in-air handwriting interface achieves 94.1 writing on paper (96.6 authentication only increases from 6.26

READ FULL TEXT

page 9

page 11

research
06/01/2021

A Continuous Liveness Detection for Voice Authentication on Smart Devices

Voice biometrics is drawing increasing attention as it is a promising al...
research
10/29/2020

Acoustic Correlates of the Voice Qualifiers: A Survey

Our voices are as distinctive as our faces and fingerprints. There is a ...
research
01/20/2019

MilliSonic: Pushing the Limits of Acoustic Motion Tracking

Recent years have seen interest in device tracking and localization usin...
research
06/02/2021

A Continuous Liveness Detection System for Text-independent Speaker Verification

Voice authentication is drawing increasing attention and becomes an attr...
research
06/29/2020

Ultra2Speech – A Deep Learning Framework for Formant Frequency Estimation and Tracking from Ultrasound Tongue Images

Thousands of individuals need surgical removal of their larynx due to cr...
research
04/29/2021

On the use of PN Ranging with High-rate Spectrally-efficient Modulations

In this paper, we study the feasibility of coupling the PN ranging with ...
research
02/19/2020

EyeTAP: A Novel Technique using Voice Inputs to Address the Midas Touch Problem for Gaze-based Interactions

One of the main challenges of gaze-based interactions is the ability to ...

Please sign up or login with your details

Forgot password? Click here to reset