A Python application that generates captions for images using LSTM (Long Short-Term Memory) and reads them out using gTTS (Google Text-to-Speech).
-
Clone the repository:
git clone https://github.com/prtm1908/Assistive-Vision-CaptionBot-LSTM.git
-
Install the required dependencies:
pip install -r requirements.txt
-
Download the pretrained model weights from the release page or train your own model.
-
Run the captioning script on your image:
https://assistive-vision-captionbot.streamlit.app/