LocalAI is an open-source API and LLM engine for running GGUF, embedding, and image-generation models on CPU or GPU.

Deploying LocalAI

Run LocalAI as a Docker container.

Configure and update models in the Settings menu.