Arm + PyTorch: Accelerating AI on Arm Everywhere from Cloud to Edge

Arm + PyTorch

This ground–breaking collaboration between Arm and the PyTorch team at Meta enables.


Together we are democratizing the AI innovation for developers – enabling them to seamlessly integrate the newest quantized models into their applications with no additional modifications or optimizations, saving time and resources.


Check out the ExecuTorch Beta release, optimized for Arm everywhere.

Read News
PyTorch logo

Faster PyTorch Inference on Arm in the Cloud

Arm, in collaboration with our partners, enhances PyTorch’s inference performance on Arm Neoverse servers.

  • Developers automatically benefit: Arm integrates performance optimizations, libraries, and microkernels directly into the PyTorch framework.
  • Expanding collaborations with cloud service providers: EnableS AI developers everywhere.
  • Arm is enabling the entire ML stack and workflow: By collaborating with the entire ecosystem, from ML software companies like Databricks to the largest developer platform we enable like GitHub, we show developers exactly how to build AI workloads on Arm CPUs.

Learn about the Arm reference implementation of a Graviton optimized chatbot here.

Watch Webinar
ExecuTorch  logo

Accelerating Generative AI at the Edge on Arm with ExecuTorch 

The collaboration between Arm and the PyTorch team at Meta is making AI accessible to the broadest range of devices and developers.

  • Arm compute platform and ExecuTorch framework: Enables smaller, optimized models for faster generative AI at the edge.
  • New quantized Llama 3.2 models: Ideal for on-device and edge AI applications on Arm, providing reduced memory footprint and improved accuracy, performance and portability.
  • Scale across edge devices: 20 million Arm developers create and deploy more intelligent AI-based applications quicker at scale across billions of edge devices.

The Executorch Beta release is here.

Watch Video

Pytorch + Arm: Enabling LLM integration Everywhere

Pytorch Repositories and Developer Resources