Ziyu Guo

research

∙ 09/14/2023

HIGT: Hierarchical Interaction Graph-Transformer for Whole Slide Image Analysis

In computation pathology, the pyramid structure of gigapixel Whole Slide...

0 Ziyu Guo, et al. ∙

research

∙ 09/07/2023

ImageBind-LLM: Multi-modality Instruction Tuning

We present ImageBind-LLM, a multi-modality instruction tuning method of ...

0 Jiaming Han, et al. ∙

research

∙ 09/01/2023

Point-Bind Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following

We introduce Point-Bind, a 3D multi-modality model aligning point clouds...

0 Ziyu Guo, et al. ∙

research

∙ 05/25/2023

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation

Recently, video object segmentation (VOS) referred by multi-modal signal...

0 Shilin Yan, et al. ∙

research

∙ 05/04/2023

Personalize Segment Anything Model with One Shot

Driven by large-data pre-training, Segment Anything Model (SAM) has been...

4 Renrui Zhang, et al. ∙

research

∙ 03/29/2023

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance

Understanding 3D scenes from multi-view inputs has been proven to allevi...

0 Ziyu Guo, et al. ∙

research

∙ 03/01/2023

Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis

Performances on standard 3D point cloud benchmarks have plateaued, resul...

0 Renrui Zhang, et al. ∙

research

∙ 02/27/2023

Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud Pre-training

Masked Autoencoders (MAE) have shown promising performance in self-super...

0 Ziyu Guo, et al. ∙

research

∙ 09/28/2022

CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention

Contrastive Language-Image Pre-training (CLIP) has been shown to learn v...

8 Ziyu Guo, et al. ∙

research

∙ 07/03/2022

Can Language Understand Depth?

Besides image classification, Contrastive Language-Image Pre-training (C...

0 Renrui Zhang, et al. ∙

research

∙ 05/28/2022

Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training

Masked Autoencoders (MAE) have shown great potentials in self-supervised...

0 Renrui Zhang, et al. ∙

research

∙ 03/24/2022

MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection

Monocular 3D object detection has long been a challenging task in autono...

0 Renrui Zhang, et al. ∙

research

∙ 12/04/2021

PointCLIP: Point Cloud Understanding by CLIP

Recently, zero-shot and few-shot learning via Contrastive Vision-Languag...

9 Renrui Zhang, et al. ∙

research

∙ 11/19/2021

DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion

Point cloud processing is a challenging task due to its sparsity and irr...

0 Renrui Zhang, et al. ∙

research

∙ 10/12/2021

Improved Heatmap-based Landmark Detection

Mitral valve repair is a very difficult operation, often requiring exper...

17 Huifeng Yao, et al. ∙

Ziyu Guo

Featured Co-authors

Sign in with Google

Consider DeepAI Pro