Vizable

Image-to-Audio Assistant for Visual Disabilities

What is Vizable?

Vizable is an app developed for individuals with visual disabilities. The app is primarily focused on converting visual input (in the form of images) to audio in real-time so that the user is able to identify the object in front of them.

vizable_video_demo.mp4

Demo Video

It was made with the purpose of catering to a usually underserved demographic. The image-to-speech technology is made more accessible by incorporating a real-time translation tool that assists the user in receiving translated audio output.

What technologies did we use?

We primarily used CLIP model (Contrastive Language-Image Pre-training) developed by OpenAI. We tested this model on the Vizwiz dataset for our usecase. The Vizwiz dataset comprises of images that are clicked by individuals with visual impairements so we thought it would be a useful dataset to test the model against.

We conducted accuracy score evaluations of how CLIP peformed on the Vizwiz dataset. After reaching a satisfactory threshold of accuracy, we decided to test the model on images captured by our own phones.

On successfully completing these two parts, our team created a fullstack mobile application using React Native. We incorporated the CLIP model into our backend and we designed our front-end so that our users can click images and have real-time audio narration of the image they captured.

Credits:

This project was completed with the guidance of project advisor SouYoung

This project was done in collaboration with Aracely Moreno, Sarah Branch, and Naila Thevenot

CLIP model (Contrastive Language-Image Pre-training)

Vizwiz dataset

React Native

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
vizable_back		vizable_back
vizable_front		vizable_front
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vizable

What is Vizable?

What technologies did we use?

Credits:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

mumtazf/vizable

Folders and files

Latest commit

History

Repository files navigation

Vizable

What is Vizable?

What technologies did we use?

Credits:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages