This repository was archived by the owner on Jun 24, 2024. It is now read-only.

Description
I really like this project, especially the plans to not only support llama based models but to create a more general llm runtime.
I was asking myself if there are any plans to create python bindings for this project similar to e.g. llama-cpp-python.
A huge part of the ML community uses python as their primary language and could probably profit from the ability of performing model inference on cpu based nodes with this package. This also would simplify the integration of llama-rs run models into already existing solution like LangChain.