Deep-Learning-based-Image-Caption-Generator

🖼️ Image Caption Generator (CNN + LSTM)

This project implements an Image Caption Generator using Deep Learning techniques.
It combines a Convolutional Neural Network (CNN) for image feature extraction with a Recurrent Neural Network (LSTM) for generating descriptive captions in natural language.

Project Overview

Dataset: Flickr8k (8,000 images with 5 captions each)
Model: CNN (InceptionV3) + LSTM
Frameworks: Python, TensorFlow/Keras
Goal: Generate accurate captions for unseen images by learning the mapping between image features and natural language.

Features

Image preprocessing and feature extraction using pre-trained InceptionV3
Caption preprocessing with tokenization and padding
Sequence modeling with LSTM
Evaluation using BLEU scores
Inference script to generate captions for custom images

Folder Structure

data/ # Dataset: images and captions src/ # Scripts: utils, feature extraction, model, training, inference notebooks/ # Interactive notebooks for exploration, training, inference requirements.txt # Python dependencies

Dataset

Images: 8,000 images of everyday scenes.
Captions: 5 captions per image (Flickr8k.token.txt).

Instructions:

Download Flickr8k dataset.
Place images in data/Images/ and captions in data/Flickr8k.token.txt.

Setup

Clone the repo: git clone <your_repo_link> cd Image-Caption-Generator
Install dependencies:

Training

Run notebooks/03_model_training.ipynb to train the model.
Training and validation loss will be plotted automatically.
Model weights will be saved to models/model_weights.h5.

Inference / Demo

Run notebooks/04_inference_demo.ipynb to generate captions for sample images.
Visualize results inline or save them in examples/.

References

Flickr8k Dataset

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep-Learning-based-Image-Caption-Generator

🖼️ Image Caption Generator (CNN + LSTM)

Project Overview

Features

Folder Structure

Dataset

Setup

Training

Inference / Demo

References

About

Uh oh!

Releases

Packages

Languages

arpit1452/Deep-Learning-based-Image-Caption-Generator

Folders and files

Latest commit

History

Repository files navigation

Deep-Learning-based-Image-Caption-Generator

🖼️ Image Caption Generator (CNN + LSTM)

Project Overview

Features

Folder Structure

Dataset

Setup

Training

Inference / Demo

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages