QKV Core
Popular repositories Loading
Repositories
Showing 2 of 2 repositories
- QKV-Core Public
"Adaptive Hybrid Quantization Framework for deploying 7B+ LLMs on low-VRAM devices (e.g., GTX 1050). Features surgical block alignment and Numba-accelerated inference.
QKV-Core/QKV-Core’s past year of commit activity