VisionStick

The VisionStick is a smart system integrated with a mobility cane, aimed at helping the visually impaired navigate their surroundings. The device is designed to detect obstacles around the user. When an obstacle is detected, users are alerted through an alert system.

Common Detected Objects

person, bicycle, car, motorcycle, fire hydrant, stop sign, chair, bus, traffic light

Why VisionStick?

Cities like Heilbronn have constant construction and complex traffic zones that make navigation especially difficult for people with visual impairments.
Vision-Stick enhances a traditional white cane with real-time obstacle awareness and discreet haptic feedback, aiming to increase safety and independence.

Features

Obstacle detection with stereo cameras and YOLO object detection.
Distance measurement using stereo depth + ultrasonic sensors.
Haptic feedback via two vibration motors (direction and urgency encoded).
Lightweight Raspberry Pi client + external server for real-time processing.
Stereo calibration with checkerboard for accurate depth estimation.

Hardware

Raspberry Pi 5 (8 GB)
2 × Raspberry Pi Cameras (IMX219)
2 × HC-SR04 ultrasonic sensors
2 × cylindrical vibration motors
3D-printed mounting bracket for the cane

Software Stack

Python 3.10+
OpenCV (contrib) — stereo calibration & disparity maps
Ultralytics YOLO — object detection
Flask — HTTPS API for communication between Pi and server
gpiozero / RPi.GPIO — ultrasonic + motor control

Dependencies & Setup

We created a HTTPS server using the Flask framework with the purpose of storing incoming frames from the Pi 5 cameras. This server runs in an external host (i.e. Windows Laptop); with the main goal of executing the Stereo-Vision module in this device. Thus, this relieves the Raspberry Pi 5 of executing CPU-intensive operations, reducing the overhead and thus allowing the system to work within the real-time constraints of such an assisting device. This server must be initialized every time the stick is used, and codes must be updated accordingly to enable it.
For stereo vision, we performed a camera calibration to compute the parameters of each camera and the relative geometry between them. To do this, we generated a checkerboard pattern with 7 x 5 inner corners, printed it on A4 paper at 100% scale, and mounted it on a flat surface to keep it rigid. Using our Raspberry Pi setup, we then captured a total of 65 stereo image pairs of this checkerboard. The images were taken from different distances (ranging from about 30 cm up to 1.5 m), at varied angles (tilted left, right, up, down, and slightly rotated), and positioned across different parts of the frame (center, corners, and edges). This variety ensured that the calibration algorithm could robustly estimate the stereo parameters. After collecting the dataset, we ran the calibration script, which detected checkerboard corners in all pairs and produced a file called stereo_params.npz. This file contains the rectification maps and projection matrix required for disparity and depth calculation. To validate the results, we checked rectified sample images and confirmed that the horizontal lines of the checkerboard appeared aligned in both views. If alignment had not been satisfactory, the calibration process could have been repeated with additional or more varied photos.
In the source codes, we used many ubiquitous python libraries such as: flask, numpy, opencv, supervision, requests and openssl. This enables the logical integration of the stereovision model with the server manager code and various mathematical operations.
In order to program the logic of the implemented circuits we have used various phyton libraries as well; such as gpiozero, picamera2, numpy and cv. As one can infer by their name these allow to define the explicit behavior of the gpio pins and the connected camera modules. The Raspberry Pi 5 must be initially setup as well with the necessary libraries and python packages.
Of course, we also implemented our own phyton classes and modules to manage important functionalities and objects, these are SendingClient, and generic classes such as DisplayManager and StereoVissionProcessor.

Demo Video

[Video Link] (https://youtu.be/4yO5TPfKfZM)

Acknowledgements

Developed as part of the Embedded Systems, Cyber-Physical Systems and Robotics (INHN0018) course at TUM.
Thanks to BSVW Heilbronn for field insights
Built on OpenCV and Ultralytics YOLO
Stereo Vision Github-Repository (https://github.com/LearnTechWithUs/Stereo-Vision)

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
src		src
.gitignore		.gitignore
CPS_Project.pdf		CPS_Project.pdf
README.md		README.md
Vision_Stick_Presentation.pdf		Vision_Stick_Presentation.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VisionStick

Common Detected Objects

Why VisionStick?

Features

Hardware

Software Stack

Dependencies & Setup

Demo Video

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

CPSCourse-TUM-HN/Vision-Stick

Folders and files

Latest commit

History

Repository files navigation

VisionStick

Common Detected Objects

Why VisionStick?

Features

Hardware

Software Stack

Dependencies & Setup

Demo Video

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages