Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals

06/22/2022
by   Running Zhao, et al.
0

Considering the microphone is easily affected by noise and soundproof materials, the radio frequency (RF) signal is a promising candidate to recover audio as it is immune to noise and can traverse many soundproof objects. In this paper, we introduce Radio2Speech, a system that uses RF signals to recover high quality speech from the loudspeaker. Radio2Speech can recover speech comparable to the quality of the microphone, advancing from recovering only single tone music or incomprehensible speech in existing approaches. We use Radio UNet to accurately recover speech in time-frequency domain from RF signals with limited frequency band. Also, we incorporate the neural vocoder to synthesize the speech waveform from the estimated time-frequency representation without using the contaminated phase. Quantitative and qualitative evaluations show that in quiet, noisy and soundproof scenarios, Radio2Speech achieves state-of-the-art performance and is on par with the microphone that works in quiet scenarios.

READ FULL TEXT

page 2

page 3

research
02/02/2020

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss

Tacotron-based text-to-speech (TTS) systems directly synthesize speech f...
research
04/16/2019

Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering

Speech separation has been very successful with deep learning techniques...
research
03/05/2023

Time-frequency Network for Robust Speaker Recognition

The wide deployment of speech-based biometric systems usually demands hi...
research
05/19/2021

Deep Learning Radio Frequency Signal Classification with Hybrid Images

In recent years, Deep Learning (DL) has been successfully applied to det...
research
03/02/2021

Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech

In this paper we investigate the task of detecting carrier frequency dif...
research
08/09/2018

LED Arrays of Laser Printers as Sources of Valuable Emissions for Electromagnetic Penetration Process

Protection of information against electromagnetic eavesdropping is an im...
research
12/03/2020

Individually amplified text-to-speech

Text-to-speech (TTS) offers the opportunity to compensate for a hearing ...

Please sign up or login with your details

Forgot password? Click here to reset