Hardware¶

Minisforum MS-S1 MAX¶

The MS-S1 MAX is a compact mini-PC suitable for a home server with virtualization capabilities and local AI inference.

Why This Hardware¶

The choice of an APU (Accelerated Processing Unit) over a discrete GPU setup is deliberate:

Memory capacity over bandwidth: Discrete GPUs like the RTX 4090 offer ~1 TB/s bandwidth but are limited to 24GB VRAM. For LLMs, model size matters more than raw speed - a 70B parameter model simply won't fit in 24GB regardless of bandwidth.

Simplicity: No PCIe passthrough complexity, no separate power requirements, no thermal challenges from a 300W+ GPU in a compact enclosure.

Power efficiency: The APU runs at 55-120W TDP versus 300-450W for high-end discrete GPUs. For a 24/7 home server, this translates to meaningful electricity savings.

Cost effectiveness: A single unified system versus coordinating a CPU, motherboard, and expensive discrete GPU.

The Strix Point Advantage¶

The AMD Ryzen AI Max+ 395 represents AMD's most capable APU to date:

Zen 5 architecture: Latest CPU cores with improved IPC
RDNA 3.5 graphics: 40 compute units, same architecture as discrete RX 7000 series
Unified memory controller: Both CPU and GPU access the same DDR5 pool
AI accelerator: Dedicated XDNA 2 NPU (though less relevant for LLM inference)

This APU is essentially a laptop chip pushed to desktop power limits - the same silicon in high-end gaming laptops, but configured with maximum memory and thermal headroom.

For detailed architecture information, see Hardware Architecture.

Specifications¶

Component	Specification
CPU	AMD Ryzen AI Max+ 395 (Strix Point)
Cores/Threads	16 cores / 32 threads (Zen 5)
GPU	AMD Radeon Graphics (RDNA 3.5, 40 CUs)
GPU ID	gfx1151
RAM	128GB DDR5-5600 (dual-channel)
Internal NVMe	2 TB
Secondary NVMe	4 TB
TDP	55-120W configurable
Display	HDMI 2.1, DisplayPort 2.1, USB-C

APU for AI Workloads¶

The integrated RDNA 3.5 GPU shares system memory with the CPU, enabling:

Large model support: 70B+ parameter models fit in 128GB RAM
No VRAM limitation: Discrete GPUs typically max at 24GB
Simpler setup: No PCIe passthrough configuration needed
Lower power: ~65W vs 300W+ for high-end discrete GPUs

Trade-offs¶

The APU approach involves a clear trade-off:

Aspect	MS-S1 MAX (APU)	Discrete GPU (RTX 4090)
Memory capacity	128GB	24GB
Memory bandwidth	~90 GB/s	~1 TB/s
Tokens/second	Lower	Higher
Model size support	70B+ at Q6/Q8	70B requires offloading
Power consumption	55-120W	300-450W
Setup complexity	Simple	PCIe passthrough needed

The practical impact: Expect 5-15 tokens/second for large models versus 30-60 on discrete GPUs. But you can run models that discrete GPU users cannot run at all without CPU offloading (which makes them even slower).

For bandwidth calculations and deeper technical analysis, see Hardware Architecture.

See BIOS Setup for optimizing APU performance and GPU Setup for ROCm installation.

Storage Layout¶

Internal NVMe (2 TB)¶

Partition	Size	Filesystem	Mount
EFI	512 MB	FAT32	`/boot/efi`
Boot	1 GB	ext4	`/boot`
Root	1 TB	ext4	`/`
Free	~1 TB	—	ZFS pool

Secondary NVMe (4 TB)¶

Entire disk allocated to ZFS pool.

Why ext4 for Root?¶

Extremely stable
Excellent recovery tooling
Zero operational surprises
Root filesystem is infrastructure, not a feature

Note

/boot lives on the same disk as / — not on ZFS, not on a separate drive.