Connection Setup Download the KoboldCPP application. Launch KoboldCPP to host the GGML/GGUF model. Configure the connection to the KoboldCPP endpoint in Settings (modifiable at any time to switch models).