introduction-guide

Technical Overview

The AnythingLLM Desktop Assistant is an operating-system-level overlay interface for chat, agent execution, and Model Context Protocol (MCP) tasks.

Platform	Support Level	Screen Capture / OCR Features
MacOS (Silicon/Intel)	Full	Enabled
Windows (x64/ARM64)	Full	Enabled
Linux (x64/ARM64)	Limited	Disabled (No screen, application, or area capture)

Open/Toggle Overlay (Default):
- MacOS: CMD + /
- Windows/Linux: CTRL + /
Modify Shortcut: Navigate to Settings -> Desktop Assistant in the main AnythingLLM menu.

The Desktop Assistant utilizes screen capture and image data for context.

Ensure your selected cloud model provider natively supports image input. If the model does not support images, API errors will occur.

If using the Default LLM Provider, Ollama, or LM Studio, AnythingLLM processes images locally on-device.

If the local model lacks vision capabilities: The system performs local Optical Character Recognition (OCR) on the screen capture and sends the extracted text to the model.