obsidian/wiki/claude-code/lmstudio-lms-cli.md

---
title: "lms — LM Studio CLI"
aliases: [lms-cli, lmstudio-cli]
tags: [lmstudio, cli, local-llm, inference, server]
sources: [raw/lms — LM Studio's CLI.md]
created: 2026-04-30
updated: 2026-04-30
---

# lms — LM Studio CLI

`lms` is LM Studio's built-in CLI utility for managing models, the inference server, and the runtime. Ships with LM Studio — no separate install needed. MIT licensed, open source on GitHub.

## Installation & Verification

```bash
# Already installed with LM Studio — just verify:
lms --help
```

Current version: `v0.0.47`

## Command Reference

| Command | What it does |
|---------|-------------|
| `lms chat` | Start interactive chat with a model in the terminal |
| `lms get` | Search and download models |
| `lms ls` | List models available on disk |
| `lms ps` | List models currently loaded in memory |
| `lms load` | Load a model (with GPU/context options) |
| `lms unload` | Unload a model |
| `lms import` | Import a model file into LM Studio |
| `lms server start/stop` | Control the local API server |
| `lms log` | Stream incoming/outgoing messages for debugging |
| `lms runtime` | Manage and update the inference runtime |
| `lms daemon` | Manage the headless llmster daemon |
| `lms link` | Manage LM Link |
| `lms clone` | Clone an artifact from LM Studio Hub |
| `lms push` | Upload artifact to LM Studio Hub |
| `lms login` | Authenticate with LM Studio |

## Common Workflows

### Server control

```bash
lms server start
lms server stop
```

### List & inspect models

```bash
lms ls        # models on disk (reflects My Models directory)
lms ps        # models currently loaded in memory
```

### Load a model

```bash
# With GPU offload and context size:
lms load [--gpu=max|auto|0.0-1.0] [--context-length=1-N]

# --gpu=1.0 → 100% GPU offload
# With a stable identifier alias:
lms load openai/gpt-oss-20b --identifier="my-model-name"
```

Using `--identifier` keeps the model ID stable across loads — useful when client code hardcodes a model name.

### Unload a model

```bash
lms unload           # unload specific model
lms unload --all     # unload everything
```

### Debug message flow

```bash
lms log stream       # tail all incoming/outgoing API messages live
```

Pairs with [[wiki/claude-code/lmstudio-chat-completions|lmstudio-chat-completions]] for debugging request/response cycles.

## Key Takeaways

- `lms` ships with LM Studio — zero extra install steps
- `lms ps` vs `lms ls`: loaded-in-memory vs on-disk — two different commands
- `--gpu=1.0` forces full GPU offload; `--gpu=auto` lets LM Studio decide
- `--identifier` flag on `lms load` decouples client model names from actual model paths
- `lms log stream` is the fastest way to debug what's hitting the server
- `lms daemon` manages [[wiki/claude-code/lmstudio-headless-service|llmster]] for headless/service deployments
- MIT licensed: safe to embed in scripts and automation

## Related Articles

- [[wiki/claude-code/lmstudio-rest-api|LM Studio REST API]] — all API endpoints
- [[wiki/claude-code/lmstudio-headless-service|Headless Service (llmster)]] — daemon mode for servers
- [[wiki/claude-code/lmstudio-server-settings|Server Settings]] — port, auth, CORS, JIT loading
- [[wiki/claude-code/lmstudio-chat-completions|Chat Completions]] — OpenAI-compat `/v1/chat/completions`
- [[wiki/claude-code/lmstudio-llmster-systemd|llmster systemd unit]] — run llmster at boot on Linux
- [[wiki/claude-code/lmstudio-idle-ttl-auto-evict|Idle TTL & Auto-Evict]] — memory management

## Sources

- lmstudio.ai/docs/cli