Skip to content

Hardware

Minisforum MS-S1 MAX

The MS-S1 MAX is a compact mini-PC suitable for a home server with virtualization capabilities and local AI inference.

Authoritative spec sheet: minisforum.com/products/ms-s1-max.

Why This Hardware

The choice of an APU (Accelerated Processing Unit) over a discrete GPU setup is deliberate:

Memory capacity over bandwidth: Discrete GPUs like the RTX 4090 offer ~1 TB/s bandwidth but are limited to 24GB VRAM. For LLMs, model size matters more than raw speed - a 70B parameter model simply won't fit in 24GB regardless of bandwidth.

Simplicity: No PCIe passthrough complexity, no separate power requirements, no thermal challenges from a 300W+ GPU in a compact enclosure.

Power efficiency: The platform peaks at ~160W and sustains around ~130W at the wall (320W PSU), versus 300-450W just for a high-end discrete GPU. For a 24/7 home server, this translates to meaningful electricity savings.

Cost effectiveness: A single unified system versus coordinating a CPU, motherboard, and expensive discrete GPU.

The Strix Halo Advantage

The AMD Ryzen AI Max+ 395 (Strix Halo) represents AMD's most capable APU to date:

  • Zen 5 architecture: Latest CPU cores with improved IPC
  • RDNA 3.5 graphics: 40 compute units, same architecture family as discrete RX 7000 series
  • Quad-channel LPDDR5X-8000: 256-bit memory bus delivers ~256 GB/s peak, ~3x a typical desktop DDR5 board
  • Unified memory controller: Both CPU and GPU access the same memory pool
  • AI accelerator: Dedicated XDNA 2 NPU (though less relevant for LLM inference)

This APU is essentially a workstation-class part in a mini-PC form factor — the same silicon in high-end mobile workstations, with maximum memory and thermal headroom.

For detailed architecture information, see Hardware Architecture.

Specifications

Component Specification
CPU AMD Ryzen AI Max+ 395 (Strix Halo)
Cores/Threads 16 cores / 32 threads (Zen 5)
GPU AMD Radeon 8060S (RDNA 3.5, 40 CUs)
GPU ID gfx1151
RAM 128GB LPDDR5X-8000 MT/s, quad-channel, soldered (~256 GB/s peak)
Internal NVMe (slot 1) 2 TB, PCIe 4.0 x4
Secondary NVMe (slot 2) 4 TB, PCIe 4.0 x1 (slower; ~2 GB/s ceiling)
Networking 2 x 10GbE (Realtek RTL8127)
Display HDMI 2.1 FRL, single output (up to 8K@60 / 4K@120)
USB Front: 1 x USB 3.2 Gen2, 2 x USB4 (40 Gbps), 2 x USB 2.0. Rear: 2 x USB4 V2 (80 Gbps), 1 x USB 3.2 Gen2
Expansion PCIe 4.0 x4 slot (full-length x16 connector)
Power 320W external PSU; ~160W peak / 130W sustained at the wall

Asymmetric NVMe slots

The second M.2 slot is only PCIe 4.0 x1, capping it at ~2 GB/s vs the ~8 GB/s available on slot 1. This is fine for ZFS-backed media and cold data, but VM disks and hot databases should live on the primary 2 TB drive.

APU for AI Workloads

The integrated RDNA 3.5 GPU shares system memory with the CPU, enabling:

  • Large model support: 70B+ parameter models fit in 128GB RAM
  • No VRAM limitation: Discrete GPUs typically max at 24GB
  • Simpler setup: No PCIe passthrough configuration needed
  • Real memory bandwidth: LPDDR5X-8000 quad-channel is in a different class than the dual-channel DDR5-5600 of typical desktop boards

Trade-offs

The APU approach involves a clear trade-off:

Aspect MS-S1 MAX (APU) Discrete GPU (RTX 4090)
Memory capacity 128GB 24GB
Memory bandwidth ~256 GB/s (LPDDR5X-8000 quad-channel) ~1 TB/s (GDDR6X)
Tokens/second Moderate Higher
Model size support 70B+ at Q6/Q8, 405B at low quant 70B requires CPU offloading
Power consumption ~130W sustained system 300-450W just for the GPU
Setup complexity Simple PCIe passthrough needed

The practical impact: Expect ~6-9 tok/s on a 70B Q4 model, ~15-20 tok/s on a 32B Q4, and ~50-70 tok/s on an 8B Q4 with ROCm/HIP. Discrete GPUs are faster per token but can't run the larger models at all without CPU offloading (which is much slower than this box).

For bandwidth calculations and deeper technical analysis, see Hardware Architecture.

See BIOS Setup for optimizing APU performance and GPU Setup for ROCm installation.

Storage Layout

Internal NVMe (2 TB, PCIe 4.0 x4) — host OS + hot data

Partition Size Filesystem Mount
EFI 512 MB FAT32 /boot/efi
Boot 1 GB ext4 /boot
Root 1 TB ext4 /
Free ~1 TB ZFS pool member

Secondary NVMe (4 TB, PCIe 4.0 x1) — bulk ZFS data

Entire disk allocated as a second ZFS pool member. The x1 link is the bottleneck — use this drive for media, backups, model files, and other read-heavy or cold data; keep VM disks and hot databases on the primary drive.

Why ext4 for Root?

  • Extremely stable
  • Excellent recovery tooling
  • Zero operational surprises
  • Root filesystem is infrastructure, not a feature

Note

/boot lives on the same disk as / — not on ZFS, not on a separate drive.