Can’t afford Nvidia’s expensive AI accelerators? Then consider this 10.8Kw server cluster with 32 Intel GPUs and 768GB VRAM

Arina Makeeva Avatar
Illustration

For many businesses and research institutions, the high cost of advanced AI accelerators is a significant barrier to entry. Recognizing this challenge, Taiwanese graphics card manufacturer Sparkle has unveiled a powerful alternative aimed at delivering competitive performance without the hefty price tag associated with Nvidia’s offerings.

The newly introduced C741-6U-Dual 16P is a dense GPU server designed to support an impressive array of configurations, housing up to 32 Intel GPUs and providing a staggering 768GB of VRAM. This system positions itself as an affordable solution for intensive AI workloads, enabling a range of applications from machine learning models to data-intensive research.

At the heart of this server is the potential to utilize 16 Arc Pro B60 Dual graphics cards, each equipped with two Battlemage BMG-G21 GPUs. When fully outfitted, this setup yields a remarkable total of 81,920 GPU cores. Such capabilities enable users to tackle demanding parallel computing tasks that were once thought to be reserved for systems with exorbitant price tags.

To sustain this level of performance, Sparkle has engineered an advanced cooling system alongside a robust power supply design. The total power output can reach 10,800W through the use of five 2,700W titanium power supplies, ensuring reliability during heavy computational tasks. For lighter configurations, a smaller setup can operate efficiently at 7,200W, utilizing four 2,400W units.

The architectural design of the C741-6U-Dual 16P embraces the latest technology standards. By utilizing PCIe 5.0 x8 interfaces, each GPU connects directly to the CPU, promoting high data bandwidth and minimizing potential bottlenecks. Additionally, the server supports up to 32 DDR5 memory slots, enabling expansive memory configurations to be implemented alongside the Intel Xeon Scalable processors.

With an emphasis on heat management, the server is equipped with an impressive array of up to 15 cooling fans. Such features are critical for maintaining optimal performance during continuous heavy workloads, a necessity for organizations relying on stable and efficient computing resources.

While specific performance metrics in large-scale inference or training tasks are yet to be disclosed, the flexibility offered by the hardware attracts researchers and developers looking for a cost-effective parallel computing solution. This capability is particularly crucial in fields such as artificial intelligence and data science, where scalability and efficiency can make or break a project.

As Sparkle has yet to announce pricing details for the C741-6U-Dual 16P, interested parties are encouraged to inquire directly through the company’s website. This strategic move to enter a competitive market with a robust solution is indicative of the ongoing evolution within the GPU segment, as businesses seek alternatives that not only reduce costs but also maintain high-performance standards.

In summary, Sparkle’s new GPU server provides an attractive and practical entry point for those looking to harness the power of AI without the financial burden of higher-priced hardware. With its impressive specifications and thoughtful design features, the C741-6U-Dual 16P is set to shake up the market and may well become a preferred choice for budget-conscious leaders in tech.

Leave a Reply

Your email address will not be published. Required fields are marked *