HP ZBook Ultra G1a (Ryzen AI Max+ PRO 395)
No editorial image yet — generic vendor mark shown. Credentials in spec table below.
The flagship Strix Halo mobile workstation: a 14" laptop with 128GB LPDDR5X-8000 unified memory (up to ~96GB allocatable to the Radeon 8060S iGPU). The portable form of the 128GB unified-memory class — runs large local LLMs on the go.
Affiliate disclosure: as an Amazon Associate and partner of other retailers, we earn from qualifying purchases. The verdict on this page is our editorial opinion; affiliate links never influence what we recommend.
Sub-scores sum to 221 / 1000. Headline = 221 × 0.70 (Estimated-confidence discount) = 155. This is an algorithmic performance-tier score — distinct from, and often lower than, the editorial “Our verdict” below, which weighs value and real-world fit (especially for hardware we haven’t measured yet). How scoring works →
Extrapolated from 256 GB/s bandwidth — 25.6 tok/s estimated. No measured benchmarks yet.
Plain-English: Doesn't fit modern chat models usefully — vision models won't fit.
Verdicts extrapolated from catalog VRAM + bandwidth + ecosystem flags. Hover any chip for the rationale. Want measured numbers? Submit your own run with runlocalai-bench --submit.
What it does well
The HP ZBook Ultra G1a is the closest thing to a portable Mac-Studio-class local-AI machine on x86. Its Ryzen AI Max+ PRO 395 exposes up to ~96GB of a 128GB unified pool to the Radeon 8060S iGPU, so this 14" laptop runs 70B-class models locally — capacity no discrete-GPU laptop short of the 24GB 5090 Mobile can approach, and far beyond it for raw model size. For someone who needs to run big models on the move without a cloud, it's nearly unique.
Where it struggles
It's expensive ($4,000+) and shares Strix Halo's bandwidth ceiling (256 GB/s), so token speed on large models is modest — a 'fits big models portably' machine, not a fast one. As a 14" laptop it's thermally constrained versus the desktop Strix Halo boxes, so sustained inference throttles sooner. ROCm-on-Linux is where it shines; Windows GPU-offload support is more limited. No CUDA.
Bottom line
The premium pick for running 70B-class models on a genuine laptop. Worth it only if portability of big-model inference is the specific need; the Strix Halo desktops (Framework, GMKtec) give the same capability for half the price if you don't need it mobile.
Overview
The flagship Strix Halo mobile workstation: a 14" laptop with 128GB LPDDR5X-8000 unified memory (up to ~96GB allocatable to the Radeon 8060S iGPU). The portable form of the 128GB unified-memory class — runs large local LLMs on the go.
Some links above are affiliate links. We may earn a commission at no extra cost to you. How we make money.
Specs
| System RAM (typical) | 128 GB |
| Power draw (peak) | 120 W |
| Released | 2025 |
| MSRP | $3999 |
| Backends | ROCm Vulkan |
Models that fit
Open-weight models small enough to run on HP ZBook Ultra G1a (Ryzen AI Max+ PRO 395) with usable context.
Hardware worth comparing
The closest alternatives by price, memory bandwidth, and form factor, plus a step up and down — so you can frame the buying decision against real options.
- 10.0/10MacBook Pro 16" M4 Maxapple · 546 GB/s
- 9.6/10ASUS ROG Strix Scar 18 (RTX 5090 Mobile)nvidia · 24 GB VRAM
- 9.6/10Razer Blade 16 (2025, RTX 5090 Mobile)nvidia · 24 GB VRAM
- 8.0/10Apple MacBook Air (M4)apple · 120 GB/s
- 7.1/10NVIDIA GeForce RTX 5070 Laptop GPUnvidia · 12 GB VRAM
- 1.5/10NVIDIA GeForce RTX 3050 Ti (Mobile)nvidia · 4 GB VRAM
Frequently asked
Does HP ZBook Ultra G1a (Ryzen AI Max+ PRO 395) support CUDA?
Where next?
Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.