LM Studio Headless / Service Mode

GUI-less operation of LM Studio: run as a background daemon, start on machine login, and load models on demand via JIT.

Two Approaches

Approach	Best For	GUI Required?
llmster (recommended)	Linux servers, cloud, GPU rigs, headless machines	No
Desktop app headless mode	Machines with a GUI where app is already installed	Yes (hidden to tray)

llmster is the core of the LM Studio desktop app, repackaged as a server-native daemon. No GUI dependency.

# Linux / Mac
curl -fsSL https://lmstudio.ai/install.sh | bash

# Windows (PowerShell)
irm https://lmstudio.ai/install.ps1 | iex

lms daemon up

To auto-start on Linux boot, configure it as a Linux Startup Task (see LM Studio docs).
Full CLI reference: lms daemon --help

Works on Mac, Windows, Linux (with GUI). Useful if the desktop app is already installed.

lms server start

Last server state is saved and restored automatically on launch.

Applies to both options. Useful when using LM Studio as a backend for other tools (Open WebUI, Claude Code, custom apps).

JIT State	`/v1/models` returns	Inference behavior
ON	All downloaded models	Auto-loads model into VRAM on first call
OFF	Only models in VRAM	Must manually load model first

JIT-loaded models are auto-evicted after a period of inactivity — see wiki/claude-code/lmstudio-idle-ttl-auto-evict for TTL settings and per-request ttl field.

llmster is the preferred headless path — works on servers and CI without any GUI
Desktop headless mode is a quick option for developer machines already running the app
JIT loading eliminates manual lms load calls; models are loaded on first inference request
JIT-loaded models auto-unload after inactivity (configurable TTL)
Use lms server start to programmatically control the REST server state
The OpenAI-compatible REST API (/v1/...) is available in both modes — see wiki/claude-code/lmstudio-openai-compat-endpoints and wiki/claude-code/lmstudio-rest-api

wiki/claude-code/lmstudio-rest-api — all endpoints and lifecycle management
wiki/claude-code/lmstudio-idle-ttl-auto-evict — memory management for JIT-loaded models
wiki/claude-code/lmstudio-openai-compat-endpoints — drop-in base_url swap for any OpenAI client
wiki/claude-code/lmstudio-anthropic-compat — redirect Claude Code / Anthropic SDK to local LM Studio