37 lines
No EOL
1.1 KiB
Text
37 lines
No EOL
1.1 KiB
Text
---
|
|
title: "Using GPU"
|
|
description: "Runing Ollama models on GPU"
|
|
---
|
|
|
|
Presenton supports GPU acceleration when using Ollama models, significantly improving performance — especially for larger models.
|
|
|
|
To enable GPU support, you need to install and configure the **NVIDIA Container Toolkit**.
|
|
|
|
### 🛠️ Step 1: Install NVIDIA Container Toolkit
|
|
|
|
Follow the official guide to install the toolkit:\
|
|
👉 https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html
|
|
|
|
### 🚀 Step 2: Run Presenton with GPU
|
|
|
|
Once installed, use the `--gpus=all` flag when running the container:
|
|
|
|
- **Running without environment variables**
|
|
|
|
```bash
|
|
docker run -it --name presenton --gpus=all -p 5000:80 \
|
|
-v "./user_data:/app/user_data" \
|
|
ghcr.io/presenton/presenton:v0.3.0-beta
|
|
```
|
|
|
|
- **Running with environment variables**
|
|
|
|
```bash
|
|
docker run -it --name presenton --gpus=all -p 5000:80 \
|
|
-e LLM="ollama" \
|
|
-e OLLAMA_MODEL="llama3.2:3b" \
|
|
-e PEXELS_API_KEY="your_pexels_api_key" \
|
|
-e CAN_CHANGE_KEYS="false" \
|
|
-v "./user_data:/app/user_data" \
|
|
ghcr.io/presenton/presenton:latest
|
|
``` |