Image Gallery Service

A FastAPI-based service for managing and processing images with optional AI-powered caption generation.

Features

Image management (upload, list, retrieve)
Image cropping with customizable target sizes
Optional AI-powered caption generation
Image export functionality
RESTful API endpoints

Prerequisites

Python 3.12+
FastAPI
Pillow
PyTorch (optional, for AI caption generation)
Unsloth (optional, for AI caption generation)

Installation

Clone the repository:

git clone <repository-url>
cd gallery-project

Create a virtual environment (recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Running

wsl sudo mount -t drvfs D: /mnt/d

pass 9571

cd source unsloth/bin/activate python /mnt/c/playground/imageGalleryServer/main.py

Configuration

The service uses the following configuration:

IMAGES_DIR: Directory where images are stored (default: /Users/stuartleal/gallery-project/images)
IMAGES_PER_PAGE: Number of images per page in pagination (default: 10)
CAPTION_GENERATOR: Type of caption generator to use (DUMMY or UNSLOTH)

API Endpoints

Image Management

GET /images: List images with pagination
- Query parameters:
  - page: Page number (default: 1)
  - page_size: Images per page (default: 10)
- Returns: List of images with metadata
GET /images/{image_id}: Get a specific image
- Returns: Image file

Caption Management

GET /images/{image_id}/caption: Get image caption
- Returns: Caption text
POST /images/{image_id}/caption: Save image caption
- Body: {"caption": "string"}
POST /images/{image_id}/generate-caption: Generate caption using AI
- Query parameters:
  - prompt: Optional prompt for caption generation
- Returns: Generated caption

Image Cropping

GET /images/{image_id}/preview/{target_size}: Get image preview
- Returns: Scaled image preview

POST /images/{image_id}/crop: Crop image

Body:

{
  "targetSize": number,
  "normalizedDeltas": {
    "x": number,
    "y": number
  }
}

Returns: Cropped image

Export

POST /api/export-images: Export selected images
- Body:
```
{
  "imageIds": ["string"]
}
```
- Returns: ZIP file containing selected images and their captions

AI Caption Generation

The service supports two modes for caption generation:

Dummy Mode: Generates simple, predefined captions
- No additional dependencies required
- Fast and lightweight
- Good for testing and development
- Example: "A picture of something" or "A picture of {prompt}"
AI Mode: Uses Unsloth's Llama 3.2 Vision model
- Requires NVIDIA or Intel GPU
- More sophisticated captions
- Higher resource requirements
- Dependencies:
  - PyTorch
  - Unsloth
  - Transformers
  - Accelerate

To switch between modes, modify the CAPTION_GENERATOR setting in config.py:

# For dummy mode (default)
CAPTION_GENERATOR = CaptionGeneratorType.DUMMY

# For AI mode
CAPTION_GENERATOR = CaptionGeneratorType.UNSLOTH

File Structure

gallery-project/
├── image_server/
│   ├── main.py              # Main FastAPI application
│   ├── caption_generator.py # Caption generation logic
│   ├── config.py           # Configuration settings
│   └── requirements.txt    # Python dependencies
├── images/                 # Image storage directory
└── README.md              # This documentation

Running the Service

Start the server:

cd image_server
python main.py

The server will start at http://localhost:4322

Development

Adding New Features

Create new endpoints in main.py
Add corresponding models in the Models section
Implement business logic in separate modules
Update documentation

Testing

Install test dependencies:

pip install -r requirements-test.txt

Run tests:

pytest

Contributing

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Create a Pull Request

License

[Your chosen license]

Support

For support, please open an issue or contact [your contact information].

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
caches		caches
controllers		controllers
design_docs		design_docs
helper_scripts		helper_scripts
models		models
pytorch-openpose		pytorch-openpose
services		services
test_pose_results/results/pose_extraction/20250721_151017_19a43841		test_pose_results/results/pose_extraction/20250721_151017_19a43841
unsloth_compiled_cache		unsloth_compiled_cache
v2_migration		v2_migration
.gitignore		.gitignore
README.md		README.md
caption_generator.py		caption_generator.py
config.py		config.py
filter_manager.py		filter_manager.py
main.py		main.py
requirements.txt		requirements.txt
test_canny_clustering.py		test_canny_clustering.py
test_feature_visualization.py		test_feature_visualization.py
yolo11n-pose.pt		yolo11n-pose.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Gallery Service

Features

Prerequisites

Installation

Running

pass 9571

Configuration

API Endpoints

Image Management

Caption Management

Image Cropping

Export

AI Caption Generation

File Structure

Running the Service

Development

Adding New Features

Testing

Contributing

License

Support

About

Uh oh!

Releases

Packages

Languages

slealq/imageGalleryServer

Folders and files

Latest commit

History

Repository files navigation

Image Gallery Service

Features

Prerequisites

Installation

Running

pass 9571

Configuration

API Endpoints

Image Management

Caption Management

Image Cropping

Export

AI Caption Generation

File Structure

Running the Service

Development

Adding New Features

Testing

Contributing

License

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages