Tiny company steals AMD’s thunder and challenges Nvidia with old-tech PCIe AI accelerator that runs 700B LLMs locally, sipping just 240W thanks to decade-old DDR4 and 28nm chips

Arina Makeeva Avatar
Illustration

A new player in the AI hardware space, Skymizer, is shaking up traditional assumptions about the computational requirements for hosting large language models (LLMs). With the introduction of their HTX301 PCIe AI accelerator, Skymizer is challenging the industry giants, AMD and Nvidia, by utilizing surprisingly outdated technology to achieve remarkable performance.

The HTX301 card is a game-changer, claiming the capability to run LLMs with up to 700 billion parameters on a single device while consuming only 240 watts of power. This is a significant reduction in energy consumption compared to contemporary accelerators, paving the way for more sustainable and efficient AI processing.

What makes the HTX301 particularly intriguing is its use of older 28-nanometer chips and standard LPDDR4 and LPDDR5 memory, as opposed to the pricier and newer HBM or GDDR solutions typically favored in high-performance computing. This choice reflects a strategic pivot from relying solely on the latest technology, suggesting that efficiency can be achieved without cutting-edge hardware.

Built on Skymizer’s proprietary HyperThought platform, the HTX301 card boasts advanced LPU IP designed specifically for large language model workloads. Each PCIe card is equipped with six HTX301 chips, which together provide an impressive maximum memory capacity of 384 GB. Such a configuration not only enhances performance but also provides ample memory for complex calculations necessary in AI applications.

The efficiency of the card is underscored by its ability to deliver 30 tokens per second at 0.5 TOPS and 100 GB per second bandwidth. Additionally, Skymizer employs efficient compression techniques for both weights and the key-value cache, enabling the HTX301 to outperform popular open-source models like llama.cpp by margins ranging from 9% to 17.8%. This blend of efficiency and performance positions the HTX301 as a formidable competitor in the AI accelerator market.

One of the most significant implications of Skymizer’s card is the potential to democratize access to powerful AI capabilities. By providing a solution that fits into standard air-cooled servers, the HTX301 eliminates the need for organizations to invest heavily in redesigning their data center power and cooling systems. This could dramatically lower the barriers to entry for businesses looking to harness the power of AI without venturing into the complexities and expenses associated with hyperscale GPU infrastructures.

The operational advantages extend beyond cost savings; concerns related to privacy and unpredictable cloud costs can deter enterprises from adopting large-scale AI solutions. Skymizer is addressing these issues head-on, suggesting that their HTX301 can fulfill enterprise needs for data sovereignty and predictable infrastructure costs while facilitating robust AI performance on-premises.

However, all these technological claims hinge on real-world testing. Skymizer plans to preview the HTX301 at the upcoming Computex event, where it hopes to validate its performance metrics against real-world workloads. The capabilities outlined on paper are impressive, but practical evaluations will be crucial in determining if the HTX301 can consistently deliver on its promises across various scenarios.

In a landscape where AMD recently unveiled its Instinct MI350P PCIe card, boasting 144 GB of HBM3E memory and up to 4,600 TFLOPS of peak performance, the HTX301’s lower power consumption highlights an intriguing divergence in design philosophy. As AI application demands grow exponentially, Skymizer’s innovation could represent a pivotal moment in shifting the narrative away from merely seeking brute computational power toward a more balanced approach that also considers efficiency and cost-effectiveness.

As we continue to watch the development of AI technologies and their infrastructure needs, Skymizer’s entry into the market could signify a turning point for both established players and emerging startups looking for scalable AI solutions.

Leave a Reply

Your email address will not be published. Required fields are marked *