Lala Li

research

∙ 07/31/2023

Guiding Image Captioning Models Toward More Specific Captions

Image captioning is conventionally formulated as the task of generating ...

0 Simon Kornblith, et al. ∙

research

∙ 05/22/2023

FIT: Far-reaching Interleaved Transformers

We present FIT: a transformer-based architecture with efficient self-att...

0 Ting Chen, et al. ∙

research

∙ 10/12/2022

A Generalist Framework for Panoptic Segmentation of Images and Videos

Panoptic segmentation assigns semantic and instance ID labels to every p...

0 Ting Chen, et al. ∙

research

∙ 06/15/2022

A Unified Sequence Interface for Vision Tasks

While language tasks are naturally expressed in a single, unified, model...

6 Ting Chen, et al. ∙

research

∙ 05/23/2022

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

We present Imagen, a text-to-image diffusion model with an unprecedented...

0 Chitwan Saharia, et al. ∙

research

∙ 09/22/2021

Pix2seq: A Language Modeling Framework for Object Detection

This paper presents Pix2Seq, a simple and generic framework for object d...

11 Ting Chen, et al. ∙

research

∙ 11/05/2020

Intriguing Properties of Contrastive Losses

Contrastive loss and its variants have become very popular recently for ...

14 Ting Chen, et al. ∙

research

∙ 10/29/2019

Big Bidirectional Insertion Representations for Documents

The Insertion Transformer is well suited for long form text generation d...

0 Lala Li, et al. ∙

research

∙ 07/09/2019

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model

Increasing the batch size is a popular way to speed up neural network tr...

3 Guodong Zhang, et al. ∙

Lala Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro