Oral Billiards

01/30/2020
by   Elaine Y L Tsiang, et al.
0

We propose a physical model of speech to explain its precision and robustness. We begin by reducing the dynamics to the bare minimum of polygonal billiards. The symbolic stability of the billiard trajectories against variations in action and the oral cavity geometry forms the basis for precision and robustness in articulation. This stability survives forcing and dissipation to underpin reliable encoding of the trajectories into acoustic emissions. The kinematics of oral billiards and the cyclical nature of the forcing mechanism engender a grammar of the syllable independent of any language. The symbolic dynamics of oral billiards is rendered nearly maximally observable by their concomitant acoustic emissions. Speech recognition is the set of computations on the sub-maximally informative acoustic observables from which the symbolic dynamics of oral billiards may be inferred.

READ FULL TEXT
research
02/26/2023

From Audio to Symbolic Encoding

Automatic music transcription (AMT) aims to convert raw audio to symboli...
research
03/27/2018

Multi-Modal Data Augmentation for End-to-end ASR

We present a new end-to-end architecture for automatic speech recognitio...
research
10/10/2017

Contaminated speech training methods for robust DNN-HMM distant speech recognition

Despite the significant progress made in the last years, state-of-the-ar...
research
05/29/2015

Symbolic Segmentation Using Algorithm Selection

In this paper we present an alternative approach to symbolic segmentatio...
research
11/08/2012

Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition

In this paper, we propose a novel architecture of wavelet network called...
research
03/03/2022

OptiTrap: Optimal Trap Trajectories for Acoustic Levitation Displays

Acoustic levitation has recently demonstrated the ability to create volu...

Please sign up or login with your details

Forgot password? Click here to reset