1. Download Model Archives

Download the required model ZIP archive:

  • Llama-3.2-3B-Chat (8k context): https://cdn.anythingllm.com/support/qnn/llama_v3_2_3b_chat_8k.zip
  • Llama-3.2-3B-Chat (16k context): https://cdn.anythingllm.com/support/qnn/llama_v3_2_3b_chat_16k.zip
  • Llama-3.1-8B-Chat (8k context): https://cdn.anythingllm.com/support/qnn/llama_v3_1_8b_chat_8k.zip
  • Phi 3.5-mini-instruct (4k context): https://cdn.anythingllm.com/support/qnn/phi_3_5_mini_instruct_4k.zip

2. Extraction and Placement

  1. Navigate to the desktop storage directory.
  2. Create the target directory if it does not exist: models/QNN
  3. Move the downloaded ZIP archive into models/QNN.
  4. Extract the ZIP archive.

Target Directory Structure

models/QNN/
└── llama_v3_2_3b_chat_8k/
    ├── genie_config.json
    ├── htp_backend_etc.bin
    ├── related-model-bin-file.bin
    └── tokenizer.json

3. Apply Changes

  1. Restart the desktop application.
  2. Select the model via the graphical user interface.