Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible...
The most flexible way to serve AI/ML models in production - Build Model ...