llm_api

FastAPI server for RAG project. Contains API for calling LLM inference running on my PC with GPU's.

Need a HUGGINGFACE_TOKEN in Dockerfile file to get access to llama 3.2 docker build --build-arg HUGGINGFACE_TOKEN="Token Here" -t llm-api .

docker run --name llm-api -p 5002:5002 llm-api

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
app		app
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback