Overview

Rethinking the Datacenter for the Agentic AI Era

AI is driving data centers toward specialized, workload-optimized infrastructure that emphasizes power efficiency, scalability, and performance. Arm delivers the CPU foundation for AI data centers, integrating seamlessly with accelerators to orchestrate AI agents, process data and support scalable AI workloads, such as recommendation engines, large language models, retrieval-augmented generation and more. Paired with a robust software ecosystem, Arm compute platform enables hyperscalers to scale AI infrastructure efficiently while improving performance, cost and energy outcomes.

Download eBook

Benefits

More Compute, Higher Efficiency, Better Price-Performance

Arm delivers energy-efficient compute that pairs seamlessly with a broad range of AI accelerators—helping you achieve strong performance and efficiency while lowering total cost of ownership.

Partners

Enabling Industry Leaders Though Infrastructure Optimized for Real-World Performance

Arm empowers industry leaders to build scalable, AI-optimized cloud infrastructure with computing solutions tuned for real-world AI performance. Designed for performance, power efficiency, and seamless scalability, Arm CPUs are perfectly suited to orchestrate accelerators for the most demanding AI and cloud workloads.

Compute Platform

Powerful Agentic AI Performance with Arm Neoverse

Designed to handle demanding AI workloads efficiently, Arm Neoverse CPUs deliver high throughput and performance per watt—making them ideal AI head node and orchestration engines in AI datacenters. From recommendation engines and language model inference to retrieval-augmented generation (RAG), Neoverse scales across a broad range of agentic AI applications.

Explore Arm Neoverse for AI Workloads Learn About AI/ML on CPU

Arm Compute Platform for Every AI Workload

As AI progresses from classic machine learning to generative AI and now agentic models, workloads are becoming increasingly compute and power intensive. Meeting these demands requires a shift to purpose-built CPUs which empower AI systems to dynamically match each workload with the right processor, optimizing for performance, power efficiency, and cost.

Arm Neoverse CPUs provide a power-efficient, scalable compute platform that integrates seamlessly with GPUs, NPUs and custom accelerators and delivers increased performance, flexibility, efficiency, and scalability.

Explore Heterogeneous Computing Solutions

Software and Developer Tools

Optimize AI Workloads with Arm Software and Tools

Developers need optimized tools to deploy AI quickly and efficiently with little effort. The Arm software ecosystem—including Arm Kleidi libraries and broad framework support—helps accelerate time to deployment and boost AI workload performance across cloud and edge.

Resources

Latest News and Resources

NEWS and BLOGS
Report
Podcasts
White paper

Key Takeaways

Arm enables datacenter transformation from general-purpose platforms to specialized, workload-optimized AI infrastructure built for efficiency and scalability.
Neoverse CPUs deliver high throughput, power efficiency, and lower TCO for AI applications including recommendation engines and large language model inference.
Arm-based processors from partners like Google, AWS, Microsoft and NVIDIA achieve up to 8x training and 4.5x inference performance gains over x86 systems.
Heterogeneous Arm-based infrastructure dynamically matches workloads with CPUs, GPUs, NPUs, and custom accelerators for optimal performance and cost.
Arm’s Kleidi libraries, frameworks, and developer tools streamline AI deployment and workload optimization across cloud and edge environments.

Frequently Asked Questions: AI in the Datacenter

What makes Arm ideal for AI in datacenters?

Power-efficient performance: Arm Neoverse CPUs deliver industry-leading performance-per-watt, reducing energy costs and improving operational efficiency.
Lower total cost of ownership (TCO): Scalable architectures optimized for modern AI workloads help businesses reduce infrastructure spend.
Flexible, workload-optimized systems: Arm-based platforms seamlessly integrate with GPUs, NPUs, and custom accelerators to deliver the right compute for every AI task.
Trusted by hyperscalers: By 2025, half of compute shipped to top hyperscalers is projected to be Arm-based—underscoring growing confidence in Arm for large-scale AI deployment.
Unified AI infrastructure: A mature software ecosystem and broad adoption support seamless integration across diverse compute engines in cloud and datacenter environments

How do Arm-based platforms enhance AI performance and reduce cloud costs across industry partners like NVIDIA, Google Cloud, and AWS?

Arm-based platforms boost AI performance and efficiency at scale:

NVIDIA: Up to 8x faster ML training and 4.5x better LLM inference (GPT-65B) with Arm CPUs + Grace Hopper compared to x86-based systems.
Google Cloud: When compared to x86-based alternatives, Axion processors deliver up to 3x better MLPerf performance, 2.5x higher inference throughput, and 64% lower costs.
AWS: Graviton CPUs, built on Arm, power over 50% of AWS’s recent capacity, offering industry-leading price-performance and energy efficiency.

Together, these innovations enable faster, more cost-effective AI across cloud and hyperscale platforms.

What tools does Arm offer to developers for AI workloads?

Developers can accelerate workloads using:

Arm Kleidi Libraries
Optimized frameworks and toolchains
Migration tutorials and learning paths for cloud/server development

Rethinking the Datacenter for the Agentic AI Era

Inside the AI Datacenter: Custom Silicon and the Power of the Arm Ecosystem

More Compute, Higher Efficiency, Better Price-Performance

Enabling Industry Leaders Though Infrastructure Optimized for Real-World Performance

Powerful Agentic AI Performance with Arm Neoverse

Up to 3x better recommendation model performance on Google Axion vs. x86 ⁵.

Up to 2.5x higher AI inference throughput with 64% cost savings compared to x86 alternatives ⁶.

Broad hyperscaler adoption and multi-cloud availability.

Arm Compute Platform for Every AI Workload

Optimize AI Workloads with Arm Software and Tools

Accelerate AI with Arm Kleidi and Developer Tools

Start Developing on Servers and in the Cloud

Latest News and Resources

Arm and NVIDIA: Converged AI for the Rubin Era

Why CPUs Sit at the Center of AI Infrastructure

Arm and AWS: Converged AI on Graviton5 Datacenters

Benchmarking Sustainable Datacenter Performance

AI in Datacenters

The Dawn of a New Era for Arm in the Datacenter

AI in Datacenters

Arm and NVIDIA Redefine AI in Datacenters

AI in Datacenters

The Future of AI Infrastructure with Arm and Industry Expert Matt Griffin

Build a Scalable AI Platform from Cloud to Edge

Key Takeaways

Frequently Asked Questions: AI in the Datacenter

What makes Arm ideal for AI in datacenters?

How do Arm-based platforms enhance AI performance and reduce cloud costs across industry partners like NVIDIA, Google Cloud, and AWS?

What tools does Arm offer to developers for AI workloads?

Stay Connected

Arm Account

Rethinking the Datacenter for the Agentic AI Era

AI Summary

Inside the AI Datacenter: Custom Silicon and the Power of the Arm Ecosystem

More Compute, Higher Efficiency, Better Price-Performance

Enabling Industry Leaders Though Infrastructure Optimized for Real-World Performance

Powerful Agentic AI Performance with Arm Neoverse

Up to 3x better recommendation model performance on Google Axion vs. x86 5.

Up to 2.5x higher AI inference throughput with 64% cost savings compared to x86 alternatives 6.

Broad hyperscaler adoption and multi-cloud availability.

Arm Compute Platform for Every AI Workload

Optimize AI Workloads with Arm Software and Tools

Accelerate AI with Arm Kleidi and Developer Tools

Start Developing on Servers and in the Cloud

Latest News and Resources

Arm and NVIDIA: Converged AI for the Rubin Era

Why CPUs Sit at the Center of AI Infrastructure

Arm and AWS: Converged AI on Graviton5 Datacenters

Benchmarking Sustainable Datacenter Performance

AI in Datacenters

The Dawn of a New Era for Arm in the Datacenter

AI in Datacenters

Arm and NVIDIA Redefine AI in Datacenters

AI in Datacenters

The Future of AI Infrastructure with Arm and Industry Expert Matt Griffin

Build a Scalable AI Platform from Cloud to Edge

Key Takeaways

Key Takeaways

Frequently Asked Questions: AI in the Datacenter

What makes Arm ideal for AI in datacenters?

How do Arm-based platforms enhance AI performance and reduce cloud costs across industry partners like NVIDIA, Google Cloud, and AWS?

What tools does Arm offer to developers for AI workloads?

Stay Connected

Up to 3x better recommendation model performance on Google Axion vs. x86 ⁵.

Up to 2.5x higher AI inference throughput with 64% cost savings compared to x86 alternatives ⁶.