Tao Lei

research

∙ 06/07/2023

TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation

The hybrid architecture of convolution neural networks (CNN) and Transfo...

0 Tao Lei, et al. ∙

research

∙ 06/06/2023

CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation

The hybrid architecture of convolutional neural networks (CNNs) and Tran...

0 Tao Lei, et al. ∙

research

∙ 06/03/2023

Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection

Popular Transformer networks have been successfully applied to remote se...

0 Tao Lei, et al. ∙

research

∙ 04/11/2023

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

We propose Conditional Adapter (CoDA), a parameter-efficient transfer le...

0 Tao Lei, et al. ∙

research

∙ 04/04/2023

Rethinking the Role of Token Retrieval in Multi-Vector Retrieval

Multi-vector retrieval models such as ColBERT [Khattab and Zaharia, 2020...

0 Jinhyuk Lee, et al. ∙

research

∙ 03/17/2023

CoLT5: Faster Long-Range Transformers with Conditional Computation

Many natural language processing tasks benefit from long inputs, but pro...

0 Joshua Ainslie, et al. ∙

research

∙ 12/04/2022

Lightweight Facial Attractiveness Prediction Using Dual Label Distribution

Facial attractiveness prediction (FAP) aims to assess the facial attract...

0 Shu Liu, et al. ∙

research

∙ 11/02/2022

Multi-Vector Retrieval as Sparse Alignment

Multi-vector retrieval models improve over single-vector dual encoders o...

0 Yujie Qian, et al. ∙

research

∙ 05/25/2022

Training Language Models with Memory Augmentation

Recent work has improved language models remarkably by equipping them wi...

0 Zexuan Zhong, et al. ∙

research

∙ 05/23/2022

Simple Recurrence Improves Masked Language Models

In this work, we explore whether modeling recurrence into the Transforme...

0 Tao Lei, et al. ∙

research

∙ 02/18/2022

Mixture-of-Experts with Expert Choice Routing

Sparsely-activated Mixture-of-experts (MoE) models allow the number of p...

0 Yanqi Zhou, et al. ∙

research

∙ 10/11/2021

SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition

The Transformer architecture has been well adopted as a dominant archite...

0 Jing Pan, et al. ∙

research

∙ 08/17/2021

Channel-Temporal Attention for First-Person Video Domain Adaptation

Unsupervised Domain Adaptation (UDA) can transfer knowledge from labeled...

0 Xianyuan Liu, et al. ∙

research

∙ 06/22/2021

Team PyKale (xy9) Submission to the EPIC-Kitchens 2021 Unsupervised Domain Adaptation Challenge for Action Recognition

This report describes the technical details of our submission to the EPI...

0 Xianyuan Liu, et al. ∙

research

∙ 04/08/2021

Nutribullets Hybrid: Multi-document Health Summarization

We present a method for generating comparative summaries that highlights...

2 Darsh J Shah, et al. ∙

research

∙ 03/22/2021

Nutri-bullets: Summarizing Health Studies by Composing Segments

We introduce Nutri-bullets, a multi-document summarization task for heal...

17 Darsh J Shah, et al. ∙

research

∙ 02/24/2021

When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute

Large language models have become increasingly difficult to train becaus...

1 Tao Lei, et al. ∙

research

∙ 09/28/2020

Medical Image Segmentation Using Deep Learning: A Survey

Deep learning has been widely used for medical image segmentation and a ...

0 Tao Lei, et al. ∙

research

∙ 09/15/2020

Autoregressive Knowledge Distillation through Imitation Learning

The performance of autoregressive models on natural language generation ...

0 Alexander Lin, et al. ∙

research

∙ 05/27/2020

Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport

Selecting input features of top relevance has become a popular method fo...

7 Kyle Swanson, et al. ∙

research

∙ 05/21/2020

ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition

In this paper we present state-of-the-art (SOTA) performance on the Libr...

0 Jing Pan, et al. ∙

research

∙ 11/09/2019

Interactive Classification by Asking Informative Questions

Natural language systems often rely on a single, potentially ambiguous i...

0 Lili Yu, et al. ∙

research

∙ 11/04/2019

Metric Learning for Dynamic Text Classification

Traditional text classifiers are limited to predicting over a fixed set ...

0 Jeremy Wohlwend, et al. ∙

research

∙ 10/10/2019

Structured Pruning of Large Language Models

Large language models have recently achieved state of the art performanc...

0 Ziheng Wang, et al. ∙

research

∙ 06/07/2019

Building a Production Model for Retrieval-Based Chatbots

Response suggestion is an important task for building human-computer con...

0 Kyle Swanson, et al. ∙

research

∙ 04/08/2019

Adaptive Morphological Reconstruction for Seeded Image Segmentation

Morphological reconstruction (MR) is often employed by seeded image segm...

0 Tao Lei, et al. ∙

research

∙ 03/12/2019

Service Capacity Enhanced Task Offloading and Resource Allocation in Multi-Server Edge Computing Environment

An edge computing environment features multiple edge servers and multipl...

0 Wei Du, et al. ∙

research

∙ 09/07/2018

Adversarial Domain Adaptation for Duplicate Question Detection

We address the problem of detecting duplicate questions in forums, which...

0 Darsh J Shah, et al. ∙

research

∙ 09/08/2017

Training RNNs as Fast as CNNs

Common recurrent neural network architectures scale poorly due to the in...

0 Tao Lei, et al. ∙

research

∙ 05/26/2017

Style Transfer from Non-Parallel Text by Cross-Alignment

This paper focuses on style transfer on the basis of non-parallel text. ...

0 Tianxiao Shen, et al. ∙

research

∙ 05/25/2017

Deriving Neural Architectures from Sequence and Graph Kernels

The design of neural architectures for structured objects is typically g...

0 Tao Lei, et al. ∙

research

∙ 06/13/2016

Rationalizing Neural Predictions

Prediction without justification has limited applicability. As a remedy,...

0 Tao Lei, et al. ∙

research

∙ 12/17/2015

Semi-supervised Question Retrieval with Gated Convolutions

Question answering forums are rapidly growing in size with no effective ...

0 Tao Lei, et al. ∙

research

∙ 08/17/2015

Molding CNNs for text: non-linear, non-consecutive convolutions

The success of deep learning often derives from well-chosen operational ...

0 Tao Lei, et al. ∙

Tao Lei

Featured Co-authors

Sign in with Google

Consider DeepAI Pro