ViT Cane: Visual Assistant for the Visually Impaired

09/26/2021
by   Bhavesh Kumar, et al.
0

Blind and visually challenged face multiple issues with navigating the world independently. Some of these challenges include finding the shortest path to a destination and detecting obstacles from a distance. To tackle this issue, this paper proposes ViT Cane, which leverages a vision transformer model in order to detect obstacles in real-time. Our entire system consists of a Pi Camera Module v2, Raspberry Pi 4B with 8GB Ram and 4 motors. Based on tactile input using the 4 motors, the obstacle detection model is highly efficient in helping visually impaired navigate unknown terrain and is designed to be easily reproduced. The paper discusses the utility of a Visual Transformer model in comparison to other CNN based models for this specific application. Through rigorous testing, the proposed obstacle detection model has achieved higher performance on the Common Object in Context (COCO) data set than its CNN counterpart. Comprehensive field tests were conducted to verify the effectiveness of our system for holistic indoor understanding and obstacle avoidance.

READ FULL TEXT
research
01/12/2022

Obstacle avoidance for blind people using a 3D camera and a haptic feedback sleeve

Navigation and obstacle avoidance are some of the hardest tasks for the ...
research
09/27/2020

Virtual Experience to Real World Application: Sidewalk Obstacle Avoidance Using Reinforcement Learning for Visually Impaired

Finding a path free from obstacles that poses minimal risk is critical f...
research
05/23/2020

An Intelligent Obstacle and Edge Recognition System using Bug Algorithm

Obstacle avoidance is an important task in robotics as the autonomous ro...
research
07/17/2018

Real-time on-board obstacle avoidance for UAVs based on embedded stereo vision

In order to improve usability and safety, modern unmanned aerial vehicle...
research
10/15/2020

APF-PF: Probabilistic Depth Perception for 3D Reactive Obstacle Avoidance

This paper proposes a framework for 3D obstacle avoidance in the presenc...
research
07/07/2021

Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World

Common fully glazed facades and transparent objects present architectura...

Please sign up or login with your details

Forgot password? Click here to reset