
-
Gauge Invariant Autoregressive Neural Networks for Quantum Lattice Models
Gauge invariance plays a crucial role in quantum mechanics from condense...
read it
-
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording
Leveraging additional speaker information to facilitate speech separatio...
read it
-
Task Offloading for Large-Scale Asynchronous Mobile Edge Computing: An Index Policy Approach
Mobile-edge computing (MEC) offloads computational tasks from wireless d...
read it
-
Rethinking the Separation Layers in Speech Separation Networks
Modules in all existing speech separation networks can be categorized in...
read it
-
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
We present ESPnet-SE, which is designed for the quick development of spe...
read it
-
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription
Joint optimization of multi-channel front-end and automatic speech recog...
read it
-
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Recently, an end-to-end speaker-attributed automatic speech recognition ...
read it
-
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Multi-speaker speech recognition of unsegmented recordings has diverse a...
read it
-
Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
With its strong modeling capacity that comes from a multi-head and multi...
read it
-
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020
This paper describes the Microsoft speaker diarization system for monaur...
read it
-
Speaker Separation Using Speaker Inventories and Estimated Speech
We propose speaker separation using speaker inventories and estimated sp...
read it
-
Justifications for Goal-Directed Constraint Answer Set Programming
Ethical and legal concerns make it necessary for programs that may direc...
read it
-
An End-to-end Architecture of Online Multi-channel Speech Separation
Multi-speaker speech recognition has been one of the keychallenges in co...
read it
-
Brain Stroke Lesion Segmentation Using Consistent Perception Generative Adversarial Network
Recently, the state-of-the-art deep learning methods have demonstrated i...
read it
-
Continuous Speech Separation with Conformer
Continuous speech separation plays a vital role in complicated speech re...
read it
-
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Recently, an end-to-end (E2E) speaker-attributed automatic speech recogn...
read it
-
Deep Multi-Task Learning for Cooperative NOMA: System Design and Principles
Envisioned as a promising component of the future wireless Internet-of-T...
read it
-
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
We propose an end-to-end speaker-attributed automatic speech recognition...
read it
-
Neural Speech Separation Using Spatially Distributed Microphones
This paper proposes a neural network based speech separation method usin...
read it
-
Generative Adversarial Zero-shot Learning via Knowledge Graphs
Zero-shot learning (ZSL) is to handle the prediction of those unseen cla...
read it
-
Continuous speech separation: dataset and analysis
This paper describes a dataset and protocols for evaluating continuous s...
read it
-
Spectrum Intelligent Radio: Technology, Development, and Future Trends
The advent of Industry 4.0 with massive connectivity places significant ...
read it
-
Advances in Online Audio-Visual Meeting Transcription
This paper describes a system that generates speaker-annotated transcrip...
read it
-
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
An important problem in ad-hoc microphone speech separation is how to gu...
read it
-
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Recent studies in deep learning-based speech separation have proven the ...
read it
-
A Learning-Based Two-Stage Spectrum Sharing Strategy with Multiple Primary Transmit Power Levels
Multi-parameter cognition in a cognitive radio network (CRN) provides a ...
read it
-
Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch
We introduce PyKaldi2 speech recognition toolkit implemented based on Ka...
read it
-
ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
In recent years, Convolutional Neural Networks (CNNs) have shown superio...
read it
-
Meeting Transcription Using Virtual Microphone Arrays
We describe a system that generates speaker-annotated transcripts of mee...
read it
-
Low-Latency Speaker-Independent Continuous Speech Separation
Speaker independent continuous speech separation (SI-CSS) is a task of c...
read it
-
On the Distribution of GSVD
In this paper, some new results on the distribution of the generalized s...
read it
-
Understanding the Impact of Label Granularity on CNN-based Image Classification
In recent years, supervised learning using Convolutional Neural Networks...
read it
-
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
The goal of this work is to develop a meeting transcription system that ...
read it
-
Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing
The recent advances of hardware technology have made the intelligent ana...
read it
-
Mobile Collaborative Spectrum Sensing for Heterogeneous Networks: A Bayesian Machine Learning Approach
Spectrum sensing in a large-scale heterogeneous network is very challeng...
read it
-
Asymptotic Performance Analysis of GSVD-NOMA Systems with a Large-Scale Antenna Array
This paper considers a multiple-input multiple-output (MIMO) downlink co...
read it
-
Developing Far-Field Speaker System Via Teacher-Student Learning
In this study, we develop the keyword spotting (KWS) and acoustic model ...
read it
-
Speaker-Invariant Training via Adversarial Learning
We propose a novel adversarial multi-task learning scheme, aiming at act...
read it
-
Cracking the cocktail party problem by multi-beam deep attractor network
While recent progresses in neural network approaches to single-channel s...
read it
-
Pedestrian-Robot Interaction Experiments in an Exit Corridor
The study of human-robot interaction (HRI) has received increasing resea...
read it
-
Priority-Aware Near-Optimal Scheduling for Heterogeneous Multi-Core Systems with Specialized Accelerators
To deliver high performance in power limited systems, architects have tu...
read it
-
Task Scheduling for Heterogeneous Multicore Systems
In recent years, as the demand for low energy and high performance compu...
read it
-
Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition
Unsupervised domain adaptation of speech signal aims at adapting a well-...
read it
-
Image Quality Assessment Guided Deep Neural Networks Training
For many computer vision problems, the deep neural networks are trained ...
read it
-
Improving Adherence to Heart Failure Management Guidelines via Abductive Reasoning
Management of chronic diseases such as heart failure (HF) is a major pub...
read it
-
Speaker-independent Speech Separation with Deep Attractor Network
Despite the recent success of deep learning for many speech processing t...
read it
-
End-to-End Attention based Text-Dependent Speaker Verification
A new type of End-to-End system for text-dependent speaker verification ...
read it
-
Deep Clustering and Conventional Networks for Music Separation: Stronger Together
Deep clustering is the first method to handle general audio separation s...
read it
-
A Physician Advisory System for Chronic Heart Failure Management Based on Knowledge Patterns
Management of chronic diseases such as heart failure, diabetes, and chro...
read it
-
Single-Channel Multi-Speaker Separation using Deep Clustering
Deep clustering is a recently introduced deep learning architecture that...
read it