Bilateral-ViT for Robust Fovea Localization

10/19/2021
by   Sifan Song, et al.
0

The fovea is an important anatomical landmark of the retina. Detecting the location of the fovea is essential for the analysis of many retinal diseases. However, robust fovea localization remains a challenging problem, as the fovea region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes a novel vision transformer (ViT) approach that integrates information both inside and outside the fovea region to achieve robust fovea localization. Our proposed network named Bilateral-Vision-Transformer (Bilateral-ViT) consists of two network branches: a transformer-based main network branch for integrating global context across the entire fundus image and a vessel branch for explicitly incorporating the structure of blood vessels. The encoded features from both network branches are subsequently merged with a customized multi-scale feature fusion (MFF) module. Our comprehensive experiments demonstrate that the proposed approach is significantly more robust for diseased images and establishes the new state of the arts on both Messidor and PALM datasets.

READ FULL TEXT

page 2

page 4

research
02/14/2023

Bilateral-Fuser: A Novel Multi-cue Fusion Architecture with Anatomical-aware Tokens for Fovea Localization

Accurate localization of fovea is one of the primary steps in analyzing ...
research
05/25/2023

Multi-scale Efficient Graph-Transformer for Whole Slide Image Classification

The multi-scale information among the whole slide images (WSIs) is essen...
research
04/29/2022

Where in the World is this Image? Transformer-based Geo-localization in the Wild

Predicting the geographic location (geo-localization) from a single grou...
research
02/25/2023

TBFormer: Two-Branch Transformer for Image Forgery Localization

Image forgery localization aims to identify forged regions by capturing ...
research
03/26/2023

RGBT Tracking via Progressive Fusion Transformer with Dynamically Guided Learning

Existing Transformer-based RGBT tracking methods either use cross-attent...
research
11/10/2021

Learning to Disentangle Scenes for Person Re-identification

There are many challenging problems in the person re-identification (ReI...
research
06/22/2023

Toward Automated Detection of Microbleeds with Anatomical Scale Localization: A Complete Clinical Diagnosis Support Using Deep Learning

Cerebral Microbleeds (CMBs) are chronic deposits of small blood products...

Please sign up or login with your details

Forgot password? Click here to reset