Kaipeng Zhang

research

∙ 09/07/2023

ImageBind-LLM: Multi-modality Instruction Tuning

We present ImageBind-LLM, a multi-modality instruction tuning method of ...

0 Jiaming Han, et al. ∙

research

∙ 08/25/2023

OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

Large language models (LLMs) have revolutionized natural language proces...

0 Wenqi Shao, et al. ∙

research

∙ 08/11/2023

Foundation Model is Efficient Multimodal Multitask Model Selector

This paper investigates an under-explored but important problem: given a...

0 Fanqing Meng, et al. ∙

research

∙ 08/07/2023

Tiny LVLM-eHub: Early Multimodal Experiments with Bard

Recent advancements in Large Vision-Language Models (LVLMs) have demonst...

0 Wenqi Shao, et al. ∙

research

∙ 07/20/2023

Meta-Transformer: A Unified Framework for Multimodal Learning

Multimodal learning aims to build models that can process and relate inf...

0 Yiyuan Zhang, et al. ∙

research

∙ 06/20/2023

Align, Adapt and Inject: Sound-guided Unified Image Generation

Text-guided image generation has witnessed unprecedented progress due to...

0 Yue Yang, et al. ∙

research

∙ 06/15/2023

LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models

Large Vision-Language Models (LVLMs) have recently played a dominant rol...

0 Peng Xu, et al. ∙

research

∙ 05/29/2023

DiffRate : Differentiable Compression Rate for Efficient Vision Transformers

Token compression aims to speed up large-scale vision transformers (e.g....

0 Mengzhao Chen, et al. ∙

research

∙ 03/09/2020

FarSee-Net: Real-Time Semantic Segmentation by Efficient Multi-scale Context Aggregation and Feature Space Super-resolution

Real-time semantic segmentation is desirable in many robotic application...

0 Zhanpeng Zhang, et al. ∙

research

∙ 07/08/2019

Bootstrap Model Ensemble and Rank Loss for Engagement Intensity Regression

This paper presents our approach for the engagement intensity regression...

0 Kai Wang, et al. ∙

research

∙ 11/06/2018

Super-Identity Convolutional Neural Network for Face Hallucination

Face hallucination is a generative task to super-resolve the facial imag...

4 Kaipeng Zhang, et al. ∙

research

∙ 04/11/2016

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Face detection and alignment in unconstrained environment are challengin...

0 Kaipeng Zhang, et al. ∙

Kaipeng Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro