diff --git a/docs/configurations/using-gpu.mdx b/docs/configurations/using-gpu.mdx new file mode 100644 index 00000000..6c3c106a --- /dev/null +++ b/docs/configurations/using-gpu.mdx @@ -0,0 +1,37 @@ +--- +title: "Using GPU" +description: "Runing Ollama models on GPU" +--- + +Presenton supports GPU acceleration when using Ollama models, significantly improving performance — especially for larger models. + +To enable GPU support, you need to install and configure the **NVIDIA Container Toolkit**. + +### 🛠️ Step 1: Install NVIDIA Container Toolkit + +Follow the official guide to install the toolkit:\ +👉 https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html + +### 🚀 Step 2: Run Presenton with GPU + +Once installed, use the `--gpus=all` flag when running the container: + +- **Running without environment variables** + +```bash +docker run -it --name presenton --gpus=all -p 5000:80 \ + -v "./user_data:/app/user_data" \ + ghcr.io/presenton/presenton:v0.3.0-beta +``` + +- **Running with environment variables** + +```bash +docker run -it --name presenton --gpus=all -p 5000:80 \ + -e LLM="ollama" \ + -e OLLAMA_MODEL="llama3.2:3b" \ + -e PEXELS_API_KEY="your_pexels_api_key" \ + -e CAN_CHANGE_KEYS="false" \ + -v "./user_data:/app/user_data" \ + ghcr.io/presenton/presenton:v0.3.0-beta +``` \ No newline at end of file diff --git a/docs/docs.json b/docs/docs.json index 45578027..067a2fd0 100644 --- a/docs/docs.json +++ b/docs/docs.json @@ -24,7 +24,8 @@ { "group": "Configurations", "pages": [ - "configurations/environment-variables" + "configurations/environment-variables", + "configurations/using-gpu" ] }, {