NanoLLM Save

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

Project README

NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

[!NOTE]
See dusty-nv.github.io/NanoLLM for docs and Jetson AI Lab for tutorials.

Latest Release: 24.5.1 (dustynv/nano_llm:24.5.1-r36.2.0)

Open Source Agenda is not affiliated with "NanoLLM" Project. README Source: dusty-nv/NanoLLM
Stars
81
Open Issues
6
Last Commit
1 week ago
Repository
License
MIT

Open Source Agenda Badge

Open Source Agenda Rating