obsidian/wiki/infrastructure/_index.md
2026-04-27 18:04:42 +01:00

3.9 KiB
Raw Blame History

tags updated
infrastructure
index
2026-04-27

Infrastructure — Index

Server inventory for all SSH-accessible machines. Last audited: 2026-04-24. Update this section whenever you SSH in and notice changes.

Oliver Agency Servers (GCP)

Article Server IP Role
wiki/infrastructure/server-optical optical-web-1 10.220.168.5 Main AI prod — 35+ apps, systemd
wiki/infrastructure/server-optical-dev optical-dev 10.220.168.9 Docker staging — ppt-tool, cc-dashboard, semblance, 15+ apps
wiki/infrastructure/server-optical-prod optical-prod 10.220.168.8 Minimal / secondary prod
wiki/infrastructure/server-librechat librechat-dev + prod 10.220.168.2 / .4 LibreChat AI chat platform (both envs)
wiki/infrastructure/server-modocmms modcomms-01 10.220.168.6 ModoCMMS staging + prod (Apache)
wiki/infrastructure/server-baic web-03 10.220.72.13 Main web host — 40+ domains, oliver.agency
wiki/infrastructure/server-box-cli box-cli-01 10.220.176.3 Ford/L'Oréal hotfolder, CentOS 7, 1TB NFS

Personal / Aimpress

Article Server IP Role
wiki/infrastructure/server-aimpress c2-15-uk1 57.128.160.249 Aimpress VPS — Mailcow, n8n, Traefik
wiki/infrastructure/server-pve pve 192.168.1.48 Proxmox homelab — 8 containers + Kali VM

Quick Reference

Article Purpose
wiki/infrastructure/ssh-aliases All aliases, IPs, keys, health-check one-liner

⚠ Known Issues

Add date when you discover an issue. Move to Resolved when fixed, then delete after 2 weeks.

🔴 Critical

  • optical 2026-04-24DISK 99% FULL — 5.9 GB free on 533 GB. Top offenders: /opt/ferrero-opentext 12 GB, /opt/backups 8.9 GB, /opt/sandbox-notebookllamalm-nextjs 8.5 GB — action needed
  • optical 2026-04-24SSL cert expires May 8 2026 — ai-sandbox.oliver.solutions — renew before May 8
  • optical 2026-04-24notebookllama-backend.service FAILED — crashed, taking 8.5 GB disk

🟠 Security

  • optical 2026-04-24 — All databases bound to 0.0.0.0: Redis ×3 (:6379/:6380/:6399), PostgreSQL ×3 (:5432/:5433/:5437), MongoDB ×3 (:27017/:27019/:27021), Neo4j (:7474/:7475/:7687/:7688)
  • librechat-prod 2026-04-24 — MongoDB :27017 on 0.0.0.0 — publicly exposed, no auth config found
  • baic 2026-04-24 — PostgreSQL :5432 + rpcbind :111 on 0.0.0.0
  • optical-dev 2026-04-24 — PostgreSQL :5436/:5491/:5493 + olivas :8000 + cc-dashboard :8800 on 0.0.0.0
  • baic 2026-04-21 — Grafana default admin:admin password unchanged

🟡 Capacity

  • librechat-prod 2026-04-24 — data directory 197 GB (484 GB total, 65%) — monitor growth
  • pve local-lvm 2026-04-2471% full (100/141 GB) — monitor
  • aimpress 2026-04-24 — 26.58 GB reclaimable Docker images (docker image prune -a)
  • baic 2026-04-24 — large vhosts: ustudio.global 22 GB, ustudiostaging2 19 GB, ie.oliver.agency 13 GB

🔵 Maintenance

  • optical-dev 2026-04-24 — hp-prod-tracker + dow-prod-tracker containers unhealthy (healthcheck misconfigured, apps running fine)
  • box-cli 2026-04-24 — CentOS 7 EOL since Jun 2024 — needs OS migration
  • pve 2026-04-21 — Uptime Kuma webhook to monitoring-agent not yet configured

Resolved

  • pve CT 102 (docker) — resolved 2026-04-24 — Docker data-root moved to /mnt/data/docker, now 51%
  • pve CT 105 (immich) — resolved 2026-04-24 — PostgreSQL + cache moved to data-hdd, now 62%
  • pve — resolved 2026-04-24 — Proxmox security updates applied (libngtcp2, cluster libs)