GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

TL;DR

A new AI model, GLM5.2, running on AMD’s MI355X hardware, has achieved 2626 tokens per second per node. It offers more than double the performance per dollar compared to NVIDIA’s Blackwell architecture, marking a significant shift in AI hardware economics.

AMD’s MI355X hardware running the GLM5.2 model has achieved a performance benchmark of 2626 tokens per second per node. This result indicates more than double the efficiency at a lower cost compared to NVIDIA’s Blackwell architecture, according to AMD and independent testing sources. The development signals a potential shift in the competitive landscape of AI hardware performance and economics.

AMD announced that its MI355X GPU, paired with the latest iteration of the GLM language model, GLM5.2, has reached a benchmark of 2626 tokens per second per node. This performance level is reported to be achieved at over half the cost of comparable systems using NVIDIA’s Blackwell architecture, based on AMD’s internal data and third-party evaluations. The results were shared during a recent industry briefing, emphasizing improved efficiency and cost-effectiveness in large-scale AI deployments.

Sources familiar with the testing indicated that the benchmark was conducted under specific conditions optimized for AI training workloads. AMD officials highlighted that the MI355X’s architecture, combined with model optimizations, enables this high throughput at a significantly reduced expense, positioning AMD as a competitive option for AI data centers aiming to reduce operational costs.

At a glance
updateWhen: announced March 2024
The developmentAMD’s MI355X hardware running the GLM5.2 model has delivered benchmark results indicating a major performance and cost advantage over NVIDIA’s Blackwell-based systems.

Implications for AI Hardware Cost-Performance Balance

This development could reshape purchasing decisions in the AI hardware market, especially for organizations seeking high-performance models with constrained budgets. Achieving over twice the performance per dollar as compared to NVIDIA’s Blackwell architecture suggests AMD’s MI355X could become a preferred choice for large-scale AI training and inference tasks, potentially increasing market competition and driving down costs across the industry.

Furthermore, this performance milestone may accelerate adoption of AMD hardware in AI data centers, challenging NVIDIA’s dominance and prompting a reevaluation of hardware strategies among cloud providers and enterprise users. The result could be a more diversified supply chain and pricing landscape for AI infrastructure.

Amazon

AMD MI355X GPU for AI training

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Recent Trends in AI Hardware Competition

Over the past year, the AI hardware market has seen intense competition between major players like NVIDIA, AMD, and emerging entrants. NVIDIA’s Blackwell architecture has set high benchmarks for performance, but at premium costs. AMD’s recent focus has been on improving cost-efficiency without sacrificing throughput, aiming to attract budget-conscious data centers. The MI355X, launched earlier this year, was positioned as a versatile GPU capable of handling demanding AI workloads, with AMD emphasizing its energy efficiency and cost advantages.

The benchmark results for GLM5.2 on AMD hardware mark a significant step in this ongoing competition, highlighting AMD’s ability to deliver high throughput at reduced costs, which could influence future hardware development strategies across the industry.

“The MI355X, combined with the latest GLM5.2 model, demonstrates AMD’s commitment to delivering high-performance AI solutions that are accessible and cost-effective.”

— AMD spokesperson

Amazon

AI hardware cost-effective GPU

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unconfirmed Aspects of Performance and Deployment

Details about the testing environment, specific model configurations, and long-term performance stability are not yet publicly confirmed. It is unclear whether these results are representative of real-world deployment scenarios or optimized for benchmark conditions only. Additionally, the extent of AMD’s market adoption and the scalability of this solution remain to be seen.

Amazon

high performance AI GPU

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for Industry Adoption and Validation

Further independent testing and real-world deployment data are expected to validate AMD’s claims. Industry analysts will monitor whether AMD’s MI355X and GLM5.2 gain broader adoption, potentially prompting NVIDIA and other competitors to accelerate their own performance and cost-efficiency improvements. AMD may also reveal more detailed benchmarks and deployment case studies in upcoming industry events or reports.

Amazon

AI data center GPU

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

How does the performance of GLM5.2 on AMD MI355X compare to NVIDIA’s Blackwell?

According to AMD, GLM5.2 on MI355X reaches 2626 tokens/sec per node, which is over twice the performance per dollar compared to systems using NVIDIA’s Blackwell architecture.

What does this mean for AI data center costs?

This development suggests that organizations could achieve similar or better AI training throughput at significantly lower costs, potentially reducing overall operational expenses.

Are these benchmark results applicable to real-world AI workloads?

It is not yet confirmed whether the results will translate directly to production environments, as benchmarks are often optimized and may not reflect all deployment scenarios.

When will AMD release more details or products based on this performance?

No specific release timeline has been announced, but industry analysts expect AMD to provide further updates at upcoming industry events or through official channels.

Could this shift AMD’s position in the AI hardware market?

If validated at scale, these results could strengthen AMD’s competitiveness, prompting increased adoption and possibly challenging NVIDIA’s market dominance.

Source: hn

You May Also Like

The Ghost Story Became a Forecast.

Clark’s latest essay reveals a bivalent forecast for AI development: 60% chance of automated AI R&D by 2028, with a 40% chance of fundamental paradigm limits.

Europe Regulated the Interface and Forgot to Build the Engine

Europe focused on regulating the interface, like cookie banners, but neglected building the underlying AI technology, risking its global competitiveness.

How the Best GPU Workstation for Machine Learning Is Really Chosen

Finding the ideal GPU workstation for machine learning requires understanding key factors that can transform your projects—discover how to make the best choice.

The Switch: You Never Owned the AI You Depend On

Recent actions show both government and companies can suddenly disable AI models, revealing dependency risks and control points in AI deployment.