UK’s trusted IT infrastructure partner since 2003
sales@servnetuk.com
0800 987 4111
Servnet
ConfiguratorGet in Touch
Supermicro AS-4125GS-TNRT 4U AMD EPYC GPU server front view
Supermicro AS-4125GS-TNRT
8 GPUs
Max PCIe double-width cards
160 Cores
Max CPU cores (dual EPYC 9965X)
6TB DDR5
Max ECC memory (24 DIMMs)
PCIe 5.0
CPU-to-GPU interconnect per slot
Authorised Reseller
📦 Free UK Delivery
🔒 Full Warranty
📞 24/7 Support
Key Specifications
Form Factor4U Rackmount; 26.57"H × 27"W × 41"D; net weight: 65.5 lbs
CPUDual Socket SP5 — AMD EPYC 9005 (up to 192C/384T, 400W TDP) or EPYC 9004 (up to 96C/192T) Series
GPU SupportUp to 8× double-width, full-length PCIe GPU; NVIDIA H100, H100 NVL, L40, L40S, A100, A30, A16, RTX 6000 Ada, RTX A6000, A40; AMD Instinct MI210
GPU InterconnectPCIe 5.0 x16 direct-attached; optional NVIDIA NVLink Bridge (GPU pairs) or AMD Infinity Fabric Link
Memory24× DDR5 DIMM slots; up to 6TB ECC; 6000MT/s (EPYC 9005) or 4800MT/s (EPYC 9004)
Storage2× hot-swap 2.5" SATA; 4× hot-swap 2.5" NVMe; 1× M.2 NVMe (onboard)
Network2× 10GbE RJ45 LAN + 1× 1GbE dedicated BMC; additional NICs via PCIe slots
I/O2× rear USB, 1× VGA, 1× COM port
+5 more specs below ↓
📄
Download Official Datasheet
PDF · Official manufacturer document
SupermicroSKU: AS-4125GS-TNRT

Supermicro AS-4125GS-TNRT 4U AMD EPYC GPU SuperServer — Up to 8× PCIe GPU

Flexible 4U AMD EPYC GPU Platform — Up to 8 PCIe GPUs for AI, HPC, and Mixed Workloads

The AS-4125GS-TNRT is Supermicro's versatile 4U AMD EPYC GPU SuperServer supporting up to 8 double-width PCIe GPUs across a wide compatibility matrix including NVIDIA H100, H100 NVL, L40, L40S, A100, A30, RTX 6000 Ada, and AMD Instinct MI210. Dual AMD EPYC 9005/9004 Series CPUs with PCIe 5.0, up to 6TB DDR5, and 4× 2,000W Titanium power supplies make it an adaptable, multi-vendor GPU platform for organisations requiring flexibility across GPU generations.

  • Up to 8× double-width PCIe GPU support — wide compatibility matrix
  • Compatible: NVIDIA H100, H100 NVL, L40, L40S, A100, RTX 6000 Ada, AMD MI210
  • Dual AMD EPYC 9005/9004 — up to 160 cores / 320 threads total system
  • Up to 6TB DDR5 ECC across 24 DIMM slots
  • PCIe 5.0 x16 CPU-to-GPU interconnect
  • 2× 10GbE RJ45 onboard + 1GbE dedicated BMC; expandable via PCIe NIC
  • 4× 2,000W Titanium (96%) redundant power supplies
  • Optional NVIDIA NVLink Bridge or AMD Infinity Fabric™ Link for GPU pairs

Get Pricing — Speak to Our Team

Request a Quote

Competitive pricing · Response within 4 hours

Enquiring about: AS-4125GS-TNRT

Response within 4 business hours · No obligation

Key Features

Everything you need in one device

🎛️

Multi-GPU, Multi-Vendor Flexibility

Unlike HGX-based systems locked to NVIDIA SXM GPUs, the AS-4125GS-TNRT supports 8 PCIe-slot GPUs from multiple vendors: NVIDIA H100 PCIe (80GB), H100 NVL (96GB, dual-die), L40S (48GB, ideal for inference), RTX 6000 Ada (48GB), A100 (80GB), A30 (24GB), A16, as well as AMD Instinct MI210 (64GB). This multi-vendor flexibility protects against supply chain constraints and allows GPU selection to match specific workload requirements.

🔴

Dual AMD EPYC 9005/9004 — Industry-Leading Core Density

AMD EPYC 9005 Series (Zen 5, up to 192 cores per socket) and EPYC 9004 Series (Zen 4, up to 96 cores per socket) deliver exceptional CPU-side compute alongside GPU acceleration. With up to 160 total system cores (dual EPYC 9965X) and 512MB of L3 cache per socket, the host CPU can perform substantial data preprocessing, model compilation, and inference post-processing without creating CPU bottlenecks for GPU-bound workloads.

🏎️

PCIe 5.0 — Maximum GPU Bandwidth

PCIe 5.0 x16 slots deliver 128 GB/s bidirectional bandwidth per GPU — double the bandwidth of PCIe 4.0. This is critical for workloads that frequently transfer large tensors between CPU DRAM and GPU HBM, including reinforcement learning from human feedback (RLHF), model inference with dynamic batching, and HPC pre/post-processing pipelines that interleave CPU and GPU computation.

💾

Massive DDR5 Memory — Up to 6TB

24 DIMM slots supporting up to 6TB of DDR5 ECC memory (6000MT/s with EPYC 9005) provide exceptional host memory bandwidth and capacity. This enables in-memory preprocessing of large training datasets, large CPU-side caches for GPU inference serving, and memory-intensive HPC workloads that exceed typical GPU HBM capacities and require CPU-side buffering.

🔗

Optional NVLink Bridge for GPU Pairs

For NVIDIA H100 PCIe and selected GPUs, optional NVLink Bridges connect adjacent GPU pairs with dedicated high-bandwidth interconnect — enabling tensor parallelism between paired GPUs without consuming PCIe bandwidth. AMD Infinity Fabric™ Link is optionally available for AMD Instinct GPU pairs. This makes the AS-4125GS-TNRT suitable for both single-GPU workloads and multi-GPU tensor-parallel inference serving.

🌡️

Air-Cooled — No Infrastructure Modifications Required

All GPU options in the AS-4125GS-TNRT are air-cooled via PCIe card blowers and chassis fans. The system operates in standard data centre air-cooling environments without CDU or liquid cooling modifications — making it suitable for colocation deployments, retrofitting into existing facilities, and use cases where liquid cooling infrastructure investment is not justified by workload requirements.

About the AS-4125GS-TNRT

The Supermicro AS-4125GS-TNRT is a 4U dual-socket AMD EPYC GPU SuperServer designed for workloads requiring flexible GPU selection, high CPU core counts, and large host memory capacity alongside multi-GPU acceleration. It supports up to 8 double-width, full-length PCIe GPU cards with a compatibility matrix spanning NVIDIA H100, L40S, A100, RTX series, and AMD Instinct MI210.

The platform's flexibility stems from its PCIe-native GPU architecture — all GPUs connect via standard PCIe 5.0 x16 slots rather than a proprietary SXM/HGX baseboard. While this means GPU-to-GPU communication uses PCIe bandwidth rather than NVLink at full NVSwitch speed (except where NVLink Bridges optionally connect pairs), it enables multi-vendor GPU support and future GPU upgradability without chassis replacement.

Dual AMD EPYC processors (up to 192 cores per socket with 9005 Series, or 96 cores per socket with 9004 Series) give the AS-4125GS-TNRT exceptional CPU-side processing capability relative to Intel Xeon Scalable alternatives. This is particularly beneficial for workloads with significant CPU pre/post-processing: reinforcement learning environments, simulation-based training, data-intensive inference pipelines, and HPC codes with balanced CPU/GPU parallelism.

Storage flexibility includes 2 hot-swap 2.5" SATA bays, 4 hot-swap 2.5" NVMe bays, and 1 M.2 slot for OS boot. Onboard 2× 10GbE provides baseline connectivity; the available PCIe 5.0 slots support 400GbE NIC cards for high-bandwidth cluster networking. Power is delivered by 4× 2,000W Titanium-level supplies (2+2 or 3+1 redundancy).

Technical Specifications

Form Factor4U Rackmount; 26.57"H × 27"W × 41"D; net weight: 65.5 lbs
CPUDual Socket SP5 — AMD EPYC 9005 (up to 192C/384T, 400W TDP) or EPYC 9004 (up to 96C/192T) Series
GPU SupportUp to 8× double-width, full-length PCIe GPU; NVIDIA H100, H100 NVL, L40, L40S, A100, A30, A16, RTX 6000 Ada, RTX A6000, A40; AMD Instinct MI210
GPU InterconnectPCIe 5.0 x16 direct-attached; optional NVIDIA NVLink Bridge (GPU pairs) or AMD Infinity Fabric Link
Memory24× DDR5 DIMM slots; up to 6TB ECC; 6000MT/s (EPYC 9005) or 4800MT/s (EPYC 9004)
Storage2× hot-swap 2.5" SATA; 4× hot-swap 2.5" NVMe; 1× M.2 NVMe (onboard)
Network2× 10GbE RJ45 LAN + 1× 1GbE dedicated BMC; additional NICs via PCIe slots
I/O2× rear USB, 1× VGA, 1× COM port
Power Supply4× 2,000W redundant Titanium (96%) — 2+2 or 3+1 configuration
CoolingAir-cooled; chassis fans + GPU card blowers; no CDU required
ChipsetAMD SoC integrated (EPYC 9005/9004)
ManagementIPMI 2.0, dedicated BMC, SuperDoctor; ACPI power management
SecuritySecure Boot, TPM 2.0, cryptographically signed firmware
📄 Download Datasheet (PDF)
Use Cases

Ideal for a wide range of deployments

🤖

AI Inference — Mixed GPU Fleet

Organisations running diverse inference workloads benefit from the AS-4125GS-TNRT's multi-GPU compatibility. L40S GPUs (48GB, optimised for inference throughput) can be deployed for high-concurrency LLM serving; H100 PCIe cards for latency-sensitive applications; A30 or A16 for lighter models or virtual GPU (vGPU) deployments. A single chassis can be configured for the specific GPU mix that matches the inference portfolio.

🔬

HPC and Simulation

AMD EPYC's memory bandwidth advantage (up to 460 GB/s per socket with EPYC 9005 and 12-channel DDR5) over Intel Xeon makes the AS-4125GS-TNRT particularly attractive for HPC codes that require strong CPU-side memory performance alongside GPU acceleration — computational fluid dynamics, molecular dynamics, seismic processing, and finite element analysis all benefit from balanced CPU+GPU bandwidth.

🎓

ML Research and Development

Research environments that iterate across multiple GPU generations and experiment with different GPU types — comparing H100 and L40S for specific inference tasks, testing AMD MI210 for compatibility, or evaluating different GPU memory configurations — benefit from the AS-4125GS-TNRT's flexibility. The ability to reconfigure GPU hardware without changing the chassis makes it a sustainable research platform across hardware generations.

FAQ

Frequently Asked Questions

Q. How does the AS-4125GS-TNRT differ from the SYS-821GE-TNHR?

The SYS-821GE-TNHR uses NVIDIA's HGX SXM platform (8 GPUs on an HGX baseboard with NVSwitch) and is purpose-built for maximum AI training performance with 900 GB/s all-to-all GPU bandwidth. The AS-4125GS-TNRT uses standard PCIe GPU cards (up to 8), supporting a much wider range of GPU models from multiple vendors. PCIe-attached GPUs have lower GPU-to-GPU bandwidth than NVSwitch, but the multi-vendor flexibility, AMD EPYC CPU options, and air-cooling make it a more versatile and accessible platform.

Q. Can the AS-4125GS-TNRT be upgraded with higher-GPU-count configurations?

The system supports up to 8 double-width GPU cards as shipped. GPUs can be replaced with newer models within the supported compatibility list — for example, migrating from A100 to H100 PCIe without chassis replacement. This upgrade path makes the AS-4125GS-TNRT a longer-lived platform investment versus HGX-based systems where the GPU baseboard is integral to the chassis design.

Why Servnet

Why buy from Servnet?

Trusted UK IT reseller since 2003 — supplying businesses of all sizes with genuine, competitively priced technology.

🏆

Authorised UK Reseller

Servnet is an authorised IT reseller with direct access to leading product lines at competitive pricing.

💬

Expert Pre-Sales Advice

Our engineers will help you select the right product and advise on compatibility with your existing infrastructure.

🚚

Fast UK Delivery

Most products ship from UK stock. Express delivery available for urgent deployments.

🔒

Genuine, Warranted Kit

All products are genuine, brand-new, with full manufacturer warranty and RMA support.

📋

Configuration Services

We can pre-configure your hardware before shipping — reducing on-site deployment time and cost.

📞

24/7 Support

Post-sale support from our certified engineers. We're here if anything goes wrong.

Related Products

Supermicro SYS-821GE-TNHR 8U NVIDIA HGX H100 H200 GPU server front view
Supermicro

SYS-821GE-TNHR

Flagship 8-GPU AI Training Platform — Up to 8x NVIDIA H200 SXM in 8U Air-Cooled

Supermicro SYS-421GE-TNHR2-LCC 4U liquid-cooled NVIDIA HGX H100 H200 GPU AI server front view
Supermicro

SYS-421GE-TNHR2-LCC

High-Density 4U AI Training Server — 8× NVIDIA HGX H100/H200 with Direct-to-Chip Liquid Cooling

Ready to order the AS-4125GS-TNRT?

Contact our team for a competitive quote, volume pricing, or compatibility advice.

✉ Get a Quote by Email☎ 0800 987 4111

Mon–Fri 09:00–17:30 · sales@servnetuk.com · Servnet Ltd, Fetcham, Surrey