MS-S1 MAX Server¶
Setup guide for the MS-S1 MAX mini-PC running Ubuntu Server with ZFS, KVM, and Docker.
AI-Authored, Selectively Verified
This site was initially generated by Claude (Anthropic) and is being progressively reviewed against this specific hardware (MS-S1 MAX, AMD Strix Halo, Ubuntu 26.04 LTS). The Ubuntu install path, the ZFS section, the AI/ROCm guidance, and the rebuild checklist have been manually reviewed; some operational sections remain unverified. Always review root/sudo commands before executing.
Project Goals¶
Build a clean, minimal Ubuntu Server host that:
- Uses ZFS for all important data
- Runs full desktop OSes (Windows 11, Linux) via KVM/QEMU with GPU passthrough
- Runs services (Nextcloud, Plex, etc.) via Docker
- Remains rebuildable, understandable, and low-maintenance
The Hardware¶
This guide is built around the Minisforum MS-S1 MAX, a mini-PC featuring AMD's Strix Halo APU - a chip that combines CPU and GPU on a single die with access to all system memory.
AMD Ryzen AI Max+ 395 (Strix Halo APU):
+--------------------------------------------------+
| |
| +-------------+ +------------------+ |
| | Zen 5 | | RDNA 3.5 | |
| | 16 cores | | 40 CUs | |
| | 32 threads| | (gfx1151) | |
| +------+------+ +--------+---------+ |
| | | |
| +------------+-------------+ |
| | |
| +------------v-------------+ |
| | Unified Memory | |
| | 128GB LPDDR5X-8000 | |
| | Quad-channel, soldered | |
| | ~256 GB/s peak | |
| +--------------------------+ |
| |
+--------------------------------------------------+
| Component | Specification | Why It Matters |
|---|---|---|
| CPU | Ryzen AI Max+ 395 (Strix Halo, 16C / 32T) | Fast prompt processing |
| GPU | Radeon 8060S, RDNA 3.5 (40 CUs, gfx1151) | Accelerated inference |
| RAM | 128GB LPDDR5X-8000, quad-channel | Fits 70B+ models; ~256 GB/s peak |
| Architecture | Unified memory | No VRAM bottleneck |
Unlike discrete GPUs limited to 24GB VRAM, the APU shares all 128GB with both CPU and GPU, enabling models that won't fit on any consumer graphics card. The LPDDR5X-8000 quad-channel bus delivers roughly 3x the bandwidth of typical desktop DDR5, which is what makes integrated-GPU inference practical at this scale.
Hardware details Hardware architecture
Quick Links¶
-
Getting Started
Hardware specs, architecture overview, and prerequisites
-
Ubuntu Server
Installation, post-install config, networking, and firewall
-
ZFS Storage
Pool creation, datasets, and snapshot policies
-
Virtualization
KVM setup, GPU passthrough, and Windows 11 VM
-
Docker Services
Nextcloud, Plex, and other containerized services
-
Operations
Backup, recovery, and rebuild procedures
-
AI & Local LLMs
Run local LLMs with Ollama, llama.cpp, and AI coding tools
Design Philosophy¶
| Principle | Implementation |
|---|---|
| Host OS is boring | Ubuntu Server LTS, no desktop, SSH-only |
| Data lives outside containers | ZFS is the source of truth |
| VMs are first-class | KVM/QEMU with GPU passthrough |
| Services are containerized | Docker + Compose with bind mounts |
| Everything is recoverable | Reinstall host without touching data |
What This Project Avoids¶
- ZFS on root
- Desktop environment on the host
- Manual iptables rules
- Docker volumes for critical data
- Nested virtualization
- "All-in-one" hypervisor distros