avx-512 / spec cpu2026 / zen 5

SPEC released CPU2026, an updated CPU benchmark suite with 52 workloads (up from 43) and larger code footprints, aiming to modernize tests while keeping portability. TechScan examined CPU2026 on Linux with GCC 14.2.0 (-O3, native) to focus on hardware behavior; GCC 15.2.0 was avoided due to issues. SPEC CPU2026 uses an Ampere eMAG 8180 as the reference (score 1.0), which the author criticizes as unrepresentative of modern hardware and skewing perceptions of performance. Tests show current Intel

Why It Matters

Benchmarking and microbenchmark results shape hardware design and compiler targeting; updates to SPEC and AVX-512 wins affect how architects and performance engineers evaluate Zen 5 and other CPUs. Understanding workload mix, reference choices, and real-world SIMD gains helps engineers prioritize optimizations and platform selection.

Latest Changes

SPEC released CPU2026 with 52 workloads, up from 43, and larger code footprints to better reflect modern software.

TechScan evaluated CPU2026 on Linux with GCC 14.2.0 (-O3, -march=native) to highlight hardware behavior and avoided GCC 15.2.0 due to issues.

SPEC selected an Ampere eMAG 8180 as the reference baseline (score 1.0), prompting criticism for not reflecting contemporary server CPUs.

AVX-512 implementations demonstrated large wins: an IPv6 text parser using AVX-512 on an Intel Xeon Gold core ran about 12× faster than inet_pton.

Timeline

2026-05-23 — SPEC released CPU2026, increasing workloads to 52 and enlarging program footprints.

2026-05-23 — TechScan published an evaluation of CPU2026 on Linux using GCC 14.2.0 (-O3, native) and highlighted hardware-focused results.

2026-05-23 — Coverage noted SPEC's choice of the Ampere eMAG 8180 as the reference baseline, drawing criticism for being unrepresentative.

2026-05-25 — Daniel Lemire published an AVX-512 IPv6 text parser achieving roughly 12× speedup over inet_pton on a single Intel Xeon Gold core.

Recent News (4)

Parsing IPv6 Addresses Crazily Fast with AVX-512

Daniel Lemire demonstrates an AVX-512 SIMD implementation that parses IPv6 text addresses about 12x faster than the standard inet_pton on a single Intel Xeon Gold core. Using 512-bit registers to locate colons, expand bytes, permute hex digits, and combine values with multiply-accumulate, the branch-minimized routine achieves ~71 million addresses/sec versus inet_pton's ~5.7 million in his benchmark, with far fewer instructions and higher instruction throughput. The code and benchmark details are published on Lemire's blog, showing practical speedups for high-throughput networking or logging systems where IPv6 parsing is a bottleneck. This matters for server-side networking stacks, probes, and telemetry that need extremely fast text-to-binary IP conversion.

10pts

HNmfiguiere2h ago

Evaluating Spec CPU2026

SPEC updated its long-standing CPU benchmark suite to SPEC CPU2026, increasing workloads from 43 to 52 and enlarging individual programs to better reflect modern code. The author evaluated CPU-oriented performance using GCC 14.2.0 on Linux, focusing on hardware comparisons. SPEC CPU2026’s reference baseline uses an Ampere eMAG 8180 (score 1.0), which the author criticizes as anachronistic and too slow compared with modern desktop CPUs. Tests show Intel’s recent Lion Cove and AMD Zen 5 delivering similar integer results while Zen 5 often leads in floating-point, partly due to GCC emitting AVX-512 and wide-vector code for several workloads (e.g., 706.stockfish, 749.fotonik3d). The article highlights concerns about the reference choice and the suite’s implications for evaluating contemporary CPU designs.

18pts

Zelizdw

Why It Matters

Latest Changes

Timeline

What to Watch

Recent News (4)