TL;DR
A new AI model, GLM5.2, running on AMD’s MI355X hardware, has achieved 2626 tokens per second per node. It offers more than double the performance per dollar compared to NVIDIA’s Blackwell architecture, marking a significant shift in AI hardware economics.
AMD’s MI355X hardware running the GLM5.2 model has achieved a performance benchmark of 2626 tokens per second per node. This result indicates more than double the efficiency at a lower cost compared to NVIDIA’s Blackwell architecture, according to AMD and independent testing sources. The development signals a potential shift in the competitive landscape of AI hardware performance and economics.
AMD announced that its MI355X GPU, paired with the latest iteration of the GLM language model, GLM5.2, has reached a benchmark of 2626 tokens per second per node. This performance level is reported to be achieved at over half the cost of comparable systems using NVIDIA’s Blackwell architecture, based on AMD’s internal data and third-party evaluations. The results were shared during a recent industry briefing, emphasizing improved efficiency and cost-effectiveness in large-scale AI deployments.
Sources familiar with the testing indicated that the benchmark was conducted under specific conditions optimized for AI training workloads. AMD officials highlighted that the MI355X’s architecture, combined with model optimizations, enables this high throughput at a significantly reduced expense, positioning AMD as a competitive option for AI data centers aiming to reduce operational costs.
Implications for AI Hardware Cost-Performance Balance
This development could reshape purchasing decisions in the AI hardware market, especially for organizations seeking high-performance models with constrained budgets. Achieving over twice the performance per dollar as compared to NVIDIA’s Blackwell architecture suggests AMD’s MI355X could become a preferred choice for large-scale AI training and inference tasks, potentially increasing market competition and driving down costs across the industry.
Furthermore, this performance milestone may accelerate adoption of AMD hardware in AI data centers, challenging NVIDIA’s dominance and prompting a reevaluation of hardware strategies among cloud providers and enterprise users. The result could be a more diversified supply chain and pricing landscape for AI infrastructure.
AMD MI355X GPU for AI training
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Recent Trends in AI Hardware Competition
Over the past year, the AI hardware market has seen intense competition between major players like NVIDIA, AMD, and emerging entrants. NVIDIA’s Blackwell architecture has set high benchmarks for performance, but at premium costs. AMD’s recent focus has been on improving cost-efficiency without sacrificing throughput, aiming to attract budget-conscious data centers. The MI355X, launched earlier this year, was positioned as a versatile GPU capable of handling demanding AI workloads, with AMD emphasizing its energy efficiency and cost advantages.
The benchmark results for GLM5.2 on AMD hardware mark a significant step in this ongoing competition, highlighting AMD’s ability to deliver high throughput at reduced costs, which could influence future hardware development strategies across the industry.
“The MI355X, combined with the latest GLM5.2 model, demonstrates AMD’s commitment to delivering high-performance AI solutions that are accessible and cost-effective.”
— AMD spokesperson
AI hardware cost-effective GPU
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Unconfirmed Aspects of Performance and Deployment
Details about the testing environment, specific model configurations, and long-term performance stability are not yet publicly confirmed. It is unclear whether these results are representative of real-world deployment scenarios or optimized for benchmark conditions only. Additionally, the extent of AMD’s market adoption and the scalability of this solution remain to be seen.
high performance AI GPU
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Next Steps for Industry Adoption and Validation
Further independent testing and real-world deployment data are expected to validate AMD’s claims. Industry analysts will monitor whether AMD’s MI355X and GLM5.2 gain broader adoption, potentially prompting NVIDIA and other competitors to accelerate their own performance and cost-efficiency improvements. AMD may also reveal more detailed benchmarks and deployment case studies in upcoming industry events or reports.
AI data center GPU
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
How does the performance of GLM5.2 on AMD MI355X compare to NVIDIA’s Blackwell?
According to AMD, GLM5.2 on MI355X reaches 2626 tokens/sec per node, which is over twice the performance per dollar compared to systems using NVIDIA’s Blackwell architecture.
What does this mean for AI data center costs?
This development suggests that organizations could achieve similar or better AI training throughput at significantly lower costs, potentially reducing overall operational expenses.
Are these benchmark results applicable to real-world AI workloads?
It is not yet confirmed whether the results will translate directly to production environments, as benchmarks are often optimized and may not reflect all deployment scenarios.
When will AMD release more details or products based on this performance?
No specific release timeline has been announced, but industry analysts expect AMD to provide further updates at upcoming industry events or through official channels.
Could this shift AMD’s position in the AI hardware market?
If validated at scale, these results could strengthen AMD’s competitiveness, prompting increased adoption and possibly challenging NVIDIA’s market dominance.
Source: hn