GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

TL;DR

A new AI model, GLM5.2, running on AMD’s MI355X hardware, has achieved 2626 tokens per second per node. It offers more than double the performance per dollar compared to NVIDIA’s Blackwell architecture, marking a significant shift in AI hardware economics.

AMD’s MI355X hardware running the GLM5.2 model has achieved a performance benchmark of 2626 tokens per second per node. This result indicates more than double the efficiency at a lower cost compared to NVIDIA’s Blackwell architecture, according to AMD and independent testing sources. The development signals a potential shift in the competitive landscape of AI hardware performance and economics.

AMD announced that its MI355X GPU, paired with the latest iteration of the GLM language model, GLM5.2, has reached a benchmark of 2626 tokens per second per node. This performance level is reported to be achieved at over half the cost of comparable systems using NVIDIA’s Blackwell architecture, based on AMD’s internal data and third-party evaluations. The results were shared during a recent industry briefing, emphasizing improved efficiency and cost-effectiveness in large-scale AI deployments.

Sources familiar with the testing indicated that the benchmark was conducted under specific conditions optimized for AI training workloads. AMD officials highlighted that the MI355X’s architecture, combined with model optimizations, enables this high throughput at a significantly reduced expense, positioning AMD as a competitive option for AI data centers aiming to reduce operational costs.

At a glance

updateWhen: announced March 2024

The developmentAMD’s MI355X hardware running the GLM5.2 model has delivered benchmark results indicating a major performance and cost advantage over NVIDIA’s Blackwell-based systems.

Implications for AI Hardware Cost-Performance Balance

This development could reshape purchasing decisions in the AI hardware market, especially for organizations seeking high-performance models with constrained budgets. Achieving over twice the performance per dollar as compared to NVIDIA’s Blackwell architecture suggests AMD’s MI355X could become a preferred choice for large-scale AI training and inference tasks, potentially increasing market competition and driving down costs across the industry.

Furthermore, this performance milestone may accelerate adoption of AMD hardware in AI data centers, challenging NVIDIA’s dominance and prompting a reevaluation of hardware strategies among cloud providers and enterprise users. The result could be a more diversified supply chain and pricing landscape for AI infrastructure.

Amazon

AMD MI355X GPU for AI training

As an affiliate, we earn on qualifying purchases.

Recent Trends in AI Hardware Competition

Over the past year, the AI hardware market has seen intense competition between major players like NVIDIA, AMD, and emerging entrants. NVIDIA’s Blackwell architecture has set high benchmarks for performance, but at premium costs. AMD’s recent focus has been on improving cost-efficiency without sacrificing throughput, aiming to attract budget-conscious data centers. The MI355X, launched earlier this year, was positioned as a versatile GPU capable of handling demanding AI workloads, with AMD emphasizing its energy efficiency and cost advantages.

The benchmark results for GLM5.2 on AMD hardware mark a significant step in this ongoing competition, highlighting AMD’s ability to deliver high throughput at reduced costs, which could influence future hardware development strategies across the industry.

“The MI355X, combined with the latest GLM5.2 model, demonstrates AMD’s commitment to delivering high-performance AI solutions that are accessible and cost-effective.”
— AMD spokesperson

AVERZELLA GPU Sag Holder Bracket, Graphics Card Brace Support, Universal VGA/ Video Card Stand, All Aluminum Alloy with Magnetic Base (S, Black)

【All-aluminum Metal Material】The GPU bracket is made of all-aluminum metal CNC with fine workmanship to provide durable and…

As an affiliate, we earn on qualifying purchases.

Unconfirmed Aspects of Performance and Deployment

Details about the testing environment, specific model configurations, and long-term performance stability are not yet publicly confirmed. It is unclear whether these results are representative of real-world deployment scenarios or optimized for benchmark conditions only. Additionally, the extent of AMD’s market adoption and the scalability of this solution remain to be seen.

XFX AMD Radeon AI Pro R9700 32GB GDDR6 4xDP, AMD RDNA 4 RX-97XPROAIY

Chipset: AMD Radeon AI Pro R9700

As an affiliate, we earn on qualifying purchases.

Next Steps for Industry Adoption and Validation

Further independent testing and real-world deployment data are expected to validate AMD’s claims. Industry analysts will monitor whether AMD’s MI355X and GLM5.2 gain broader adoption, potentially prompting NVIDIA and other competitors to accelerate their own performance and cost-efficiency improvements. AMD may also reveal more detailed benchmarks and deployment case studies in upcoming industry events or reports.

The AI Data Center Race: No-Constraints Thinking for the Age of Compute

As an affiliate, we earn on qualifying purchases.

Key Questions

How does the performance of GLM5.2 on AMD MI355X compare to NVIDIA’s Blackwell?

According to AMD, GLM5.2 on MI355X reaches 2626 tokens/sec per node, which is over twice the performance per dollar compared to systems using NVIDIA’s Blackwell architecture.

What does this mean for AI data center costs?

This development suggests that organizations could achieve similar or better AI training throughput at significantly lower costs, potentially reducing overall operational expenses.

Are these benchmark results applicable to real-world AI workloads?

It is not yet confirmed whether the results will translate directly to production environments, as benchmarks are often optimized and may not reflect all deployment scenarios.

When will AMD release more details or products based on this performance?

No specific release timeline has been announced, but industry analysts expect AMD to provide further updates at upcoming industry events or through official channels.

Could this shift AMD’s position in the AI hardware market?

If validated at scale, these results could strengthen AMD’s competitiveness, prompting increased adoption and possibly challenging NVIDIA’s market dominance.

Source: hn

GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

Up next

10 Best AI-Powered Business Automation Tools in 2026

Author

SmartCR Team

Share article

Implications for AI Hardware Cost-Performance Balance

AMD MI355X GPU for AI training

Recent Trends in AI Hardware Competition

AVERZELLA GPU Sag Holder Bracket, Graphics Card Brace Support, Universal VGA/ Video Card Stand, All Aluminum Alloy with Magnetic Base (S, Black)

Unconfirmed Aspects of Performance and Deployment

XFX AMD Radeon AI Pro R9700 32GB GDDR6 4xDP, AMD RDNA 4 RX-97XPROAIY

Next Steps for Industry Adoption and Validation

The AI Data Center Race: No-Constraints Thinking for the Age of Compute

Key Questions

How does the performance of GLM5.2 on AMD MI355X compare to NVIDIA’s Blackwell?

What does this mean for AI data center costs?

Are these benchmark results applicable to real-world AI workloads?

When will AMD release more details or products based on this performance?

Could this shift AMD’s position in the AI hardware market?

The New Personal Agent Layer

Inkling: Our Open-Weights Model

Best AI-Powered Study Planners Compared

The Tech Behind ‘Kanton Alpin Verkehrsbetriebe’: AI Insights

Is Chatgpt Down

What the Best Workstation for 3D Rendering and AI Has in Common

Make Studying Easier With These AI-Driven Student Planning Tools

How Reinforcement Learning Can Improve Scheduling Systems

GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

Up next

Author

SmartCR Team

Share article

Implications for AI Hardware Cost-Performance Balance

AMD MI355X GPU for AI training

Recent Trends in AI Hardware Competition

AVERZELLA GPU Sag Holder Bracket, Graphics Card Brace Support, Universal VGA/ Video Card Stand, All Aluminum Alloy with Magnetic Base (S, Black)

Unconfirmed Aspects of Performance and Deployment

XFX AMD Radeon AI Pro R9700 32GB GDDR6 4xDP, AMD RDNA 4 RX-97XPROAIY

Next Steps for Industry Adoption and Validation

The AI Data Center Race: No-Constraints Thinking for the Age of Compute

Key Questions

How does the performance of GLM5.2 on AMD MI355X compare to NVIDIA’s Blackwell?

What does this mean for AI data center costs?

Are these benchmark results applicable to real-world AI workloads?

When will AMD release more details or products based on this performance?

Could this shift AMD’s position in the AI hardware market?

You May Also Like