openPR Logo
Press release

How Many GPUs Can a Single SSD Feed? Memblaze PBlaze7 7A40 Breaks Records in MLPerf Storage v2.0 Benchmark

02-03-2026 08:31 AM CET | IT, New Media & Software

Press release from: Beijing Memblaze Technology Co. Ltd.

How Many GPUs Can a Single SSD Feed? Memblaze PBlaze7 7A40 Breaks

Beijing, February 2026 - In the AI era, where computing power equates to productivity, the response speed of storage systems has become a critical variable in determining large model training efficiency. In the recently released MLPerfTM Storage v2.0 benchmarks, Memblaze, in collaboration with industry partners, achieved multiple top rankings with a massive aggregate data bandwidth of 513GB/s. This milestone stems not only from breakthroughs in underlying hardware but also from the high-efficiency synergy across the entire storage architecture.

MLPerfTM Storage, governed by the global authority MLCommons, is widely regarded as the "Olympics" of AI performance. Compared to v1.0, the v2.0 update introduces more rigorous testing dimensions: it requires multiple experimental repetitions to verify average performance, implements strict environment validation, and introduces the critical Checkpoint save/load phase. These evolutions are designed to eliminate "burst peak" interference, revealing the true resilience and sustained output of a storage system under prolonged, high-pressure, and complex AI workloads.

For enterprise SSD manufacturers, beyond large-scale storage architectures, it is essential to return to raw local storage performance. Testing a single SSD in its simplest configuration provides a transparent view of its native capabilities under identical workloads.

"We cannot predict which servers or system configurations our users will choose. Our mission is to provide the highest possible SSD performance to serve as a rock-solid, reliable foundation for any AI training task."

The following benchmarks were conducted using a single PBlaze7 7A40 7.68TB PCIe 5.0 SSD on a standard 2U server equipped with dual Xeon 8457C processors and 1TB of system RAM, running Fedora with XFS. The dataset was set above 5TB to minimize system memory caching effects, strictly adhering to MLPerfTM Storage v2.0 specifications.

3D-UNet: Sustained High Throughput for Medical AI

3D-UNet (Medical Imaging) requires immense bandwidth. With a batch size of 7, a single NVIDIA H100 GPU processes data every 0.323s, requiring at least 2.85GB/s of sustained throughput to maintain >90% utilization.

In a 4-H100 GPU simulation, the PBlaze7 7A40 delivered a stable 11,978 MB/s, achieving a GPU utilization of 98.94%. This result outperforms official data from Vendor A (11,450 MB/s) and Vendor B (11,568 MB/s). In an 8-A100 GPU setup, the drive reached an even higher 12,208 MB/s.

ResNet-50: Massive Concurrency and Linear Scaling

ResNet-50 demands high-frequency file index processing and real-time decompression via TensorFlow, testing an SSD's random read latency.

In a 64-H100 GPU environment, the PBlaze7 7A40 provided 12,284 MB/s (98.49% utilization). When pushed to 72 GPUs (simulating an additional 8-GPU node), bandwidth climbed to 13,119 MB/s while maintaining over 93% utilization.

Comparative Edge: At the same 64-H100 load, the PBlaze7 7A40 significantly exceeded the 11,562 MB/s mark set by industry peer Vendor B.

CosmoFlow: Stability Across Millions of Small Files

CosmoFlow utilizes 1.94 million samples, each averaging just 2.8MB. This creates a massive I/O request load for metadata and file retrieval.

While the MLPerfTM passing threshold is 70% GPU utilization, the PBlaze7 7A40 powered 16 H100 GPUs at over 90% utilization. It supported up to 24 H100 GPUs while maintaining 74.35% utilization at a bandwidth of 13,457 MB/s.

Checkpoint: Minimizing Downtime for Large Models

In LLM training (like Llama-405B), GPU resources sit idle during Checkpoint save/load cycles. Every second saved is a direct reduction in compute cost.

The PBlaze7 7A40 achieved a save speed of 8.43 GB/s and a load speed of 7.45 GB/s, completing save/recovery operations for the massive model in just 6 and 7 seconds, respectively.

Compared to legacy PCIe 4.0 SSDs, which often see save times double due to bandwidth bottlenecks, the PBlaze7 7A40 significantly reduces non-computing time, maximizing the Return on Investment (ROI) for AI clusters.

Infrastructure for the AI Era

The MLPerfTM Storage v2.0 results confirm that the PBlaze7 7A40 PCIe Gen5 SSD is a premier infrastructure component for AI. By drastically reducing "I/O Wait" times across high-throughput, high-concurrency, and metadata-heavy workloads, Memblaze continues to provide the "storage backbone" required for the next generation of Large Language Models.

Qiong Wu | PR Manager, Marketing Department
Mobile: +86 15810719739
E-mail: qiong.wu@memblaze.com
Address: B2-A302, Dongsheng Technology Park, No.66 Xixiaokou Road, Haidian District, 100192, Beijing China

Memblaze is the world's leading supplier of enterprise-level SSD (Solid State Drive) products and solutions. The PBlaze series SSD launched by Memblaze has been widely used in database, virtualization, cloud computing, big data, artificial intelligence and other fields, providing stable and reliable high-speed storage solutions for many customers in Internet, cloud service, finance, telecommunications and other industries.
For more information, please visit Memblaze.com

This release was published on openPR.

Permanent link to this press release:

Copy
Please set a link in the press area of your homepage to this press release on openPR. openPR disclaims liability for any content contained in this release.

You can edit or delete your press release How Many GPUs Can a Single SSD Feed? Memblaze PBlaze7 7A40 Breaks Records in MLPerf Storage v2.0 Benchmark here

News-ID: 4374503 • Views:

More Releases from Beijing Memblaze Technology Co. Ltd.

Why Enterprise NVMe SSDs Are Critical to Modern AI Infrastructure
Why Enterprise NVMe SSDs Are Critical to Modern AI Infrastructure
Over the past two years, AI has shifted from a "race of model capabilities" to a competition centered on compute and data infrastructure. As vector databases, Retrieval-Augmented Generation (RAG), model training, fine-tuning, and large-scale inference continue to expand, the importance of the storage has been amplified to an unprecedented degree. Unlike traditional OLTP/OLAP workloads, AI workloads exhibit a "hydraulic-press-like" pressure pattern on storage: intensive random reads, massive sequential reads, continuous sequential
Memblaze Showcases New PBlaze7 7A40 SSDs to Power the Future of Cloud and AI at Tech Week Singapore
Memblaze Showcases New PBlaze7 7A40 SSDs to Power the Future of Cloud and AI at …
October 8-9, 2025 - Memblaze, a global leader in enterprise PCIe SSDs and solutions, showcased new additions to its PBlaze7 7A40 series at Tech Week Singapore, one of the most influential technology events in Asia. Featuring higher performance, ultra-high capacity, and exceptional energy efficiency, the new SSDs are designed to meet the rapidly growing demands of cloud computing and artificial intelligence (AI). With more than 14 years of expertise in enterprise
Breaking Boundaries: How the PBlaze7 7940 Redefines TLC SSDs for AI Applications
Breaking Boundaries: How the PBlaze7 7940 Redefines TLC SSDs for AI Applications
In today's AI infrastructure, storage is often divided between high-performance TLC SSDs and high-capacity QLC SSDs. TLC drives handle tasks like training, fine-tuning, and inference, while QLC SSDs support data ingestion and archiving with cost-efficient density. This role split has become the norm. But as compute density increases-especially with modern GPU deployments-TLC SSDs are taking on more than just the "hot tier." Memblaze's PBlaze7 7940 PCIe 5.0 SSD exemplifies this shift. Speed,
Memblaze Ships Over 500,000 PCIe 5.0 SSDs, Strengthening Leadership in High-Performance Enterprise Storage
Memblaze Ships Over 500,000 PCIe 5.0 SSDs, Strengthening Leadership in High-Perf …
Beijing Memblaze Technology Co., Ltd. today announced that cumulative shipments of its PBlaze7 series PCIe 5.0 enterprise NVMe SSDs have surpassed 500,000 units. This milestone highlights Memblaze's position as one of the few vendors globally to bring PCIe 5.0 SSDs into large-scale deployment and underscores its leading capabilities in product delivery and high-performance storage innovation. "We are deeply grateful to our customers and partners for their continued trust and support,"

All 5 Releases


More Releases for GPU

Revolutionizing GPU Cooling: Tone Cooling Technology Co., Ltd Unveils High-Perfo …
Tone Cooling Technology Co., Ltd., a leading innovator in thermal solutions, proudly announces the launch of its next-generation Custom GPU Cold Plates, purpose-built to redefine high-performance computing. These state-of-the-art cooling components deliver unmatched heat dissipation, precision customization, and whisper-quiet operation, positioning Tone Cooling Technology as the go-to China manufacturer for GPU cold plates. Designed with modern demands, these cold plates offer tailored solutions for gamers, PC builders, and data center professionals
GPU As Arvice Market Size Analysis by Application, Type, and Region: Forecast to …
USA, New Jersey- According to Market Research Intellect, the global GPU As Arvice market in the Internet, Communication and Technology category is projected to witness significant growth from 2025 to 2032. Market dynamics, technological advancements, and evolving consumer demand are expected to drive expansion during this period. The GPU as a Service (GPUaaS) market is projected to experience substantial growth from 2025 to 2032. In 2023, the market was valued at
Borg Media Launches GPUPrices.ai, a Breakout GPU Comparison Tool Showing GPU Pri …
Innovative, detail-rich platform transforms how gamers, PC builders, and tech enthusiasts research and compare graphics cards PORTLAND, Ore. - February 17, 2025 - Borg Media LLC today announced the launch of GPUPrices.ai [https://gpuprices.ai/]. This innovative, detail-rich GPU comparison tool transforms how gamers, PC builders, and tech enthusiasts research and compare graphics cards by showing GPU prices in real time. The site aggregates data from multiple sources, including top retailers, review sites,
Nvidia Market Share in AI GPU Chips & Global GPU Market: Growth, Trends, and Fut …
The global 𝐆𝐫𝐚𝐩𝐡𝐢𝐜𝐬 𝐏𝐫𝐨𝐜𝐞𝐬𝐬𝐢𝐧𝐠 𝐔𝐧𝐢𝐭 (𝐆𝐏𝐔) 𝐦𝐚𝐫𝐤𝐞𝐭 has been experiencing significant growth over the past decade, primarily driven by advances in artificial intelligence (AI), machine learning, data science, and high-performance computing (HPC). A major contributor to this surge is Nvidia Corporation, a leader in the production of AI-powered GPUs that dominate the AI and data center segments. Nvidia's innovative AI GPU chips are reshaping industries, from gaming and autonomous vehicles
Global Graphic Processing Units (GPU) Market linked to Innovations and Developme …
As per a new market research report launched by Inkwood Research, the Global Graphic Processing Units (GPU) Market is anticipated to reach $169.82 billion by 2028, rising with a CAGR of 33.32% over the forecasting years. Browse 53 market data Tables and 48 Figures spread over 226 Pages, along with in-depth analysis on Global Graphic Processing Units (GPU) Market by Type, Device, End-User Industry, and by Geography This insightful market research report
Global Microprocessor And GPU Market Report 2022, By Architecture, By Gpu Type, …
Avail a limited period discount of 33% on our uniquely designed Opportunities and Strategies market research reports. Contact us today and make winning strategies! https://www.thebusinessresearchcompany.com/opportunities-and-strategies-reports The global microprocessor and GPU market report by the business research company identifies the growing adoption of the Internet of Things (IoT) enabled devices and equipment will positively influence the market for microprocessors and GPU in the forecast period to be a major growth driver for the