OVERVIEW

Accelerate On Device AI with Arm SME2

SME2 is the latest CPU extension on Arm Lumex CSS, the advanced subsystem for next-gen devices, designed to accelerate matrix-oriented compute workloads directly on device. It improves performance for AI and ML models, especially those that rely on operations like matrix multiplication, common in transformers, convolutional neural networks (CNNs), and large language models (LLMs).

BENEFITS

Why SME2 Matters?

*Compared to the previous generation CPU cluster under the same conditions.

FEATURES

Built for Modern AI

See SME2 Documentation

USE CASES

SME2 in Action

SME2 powers intelligent, low-latency, private on-device AI workloads, enabling use cases such as agentic calling, personalized workout coaching, immersive NPC interactions, and neural imaging.

Neural Camera Denoising

Powered by SME2, neural camera denoising runs AI-based image restoration directly on the CPU, achieving 4K at around 30 fps on SME2-enabled C1 CPUs, while only requiring around 1W to perform the enhancement. It delivers sharp, low-noise images even at 1 lux while keeping power use low. Implemented via Arm C Language Extensions, SME2 gives developers a flexible, CPU-only path to ISP-class imaging without relying on NPUs or fixed-function hardware.

Read the blog

DEVELOPER

Built for Developers

Thanks to native SME2 support across leading AI frameworks and runtime libraries—including PyTorch, ONNX Runtime, XNNPACK, and llama.cpp—developers can access SME2 benefits without changing a single line of code. SME2-enhanced performance is also portable across Arm-based platforms, from iOS and iPadOS to macOS and, soon, Android.

Explore the new Arm Developer Launchpad for SME2 to understand SME2 acceleration and use cases, supported hardware, step-by-step tutorials, and hands-on learning paths.

Start Building With SME2 Read Developer Blog

Frequently Asked Questions: SME2

What is Arm SME2?

SME2 (Scalable Matrix Extension 2) is an advanced set of CPU instructions in the Armv9.3-A architecture designed to accelerate AI and ML workloads, particularly matrix-heavy tasks like LLMs and computer vision. It integrates seamlessly with popular AI frameworks via Arm KleidiAI, delivering higher performance and efficiency without code changes.

How does SME2 improve AI performance on devices?

By executing matrix operations directly on the CPU, SME2 enables up to 6X faster inference for large language models and 3X improvements in vision and audio processing—without requiring separate NPUs or cloud resources.

Which devices will support SME2?

Available now on iPhone 17 (A19), Apple M series devices, and flagship Android phones.

How does SME2 benefit developers?

SME2 integrates automatically with frameworks like Pytorch, ONNX Runtime, and XNNPACK, so developers can accelerate AI workloads without rewriting code. Developers can explore Arm AI on mobile resources for toolchains, SDKs, and training to get started quickly.

Can SME2 help with generative AI applications?

Absolutely. SME2 accelerates generative AI tasks, such as real-time translation, photo/video enhancement, audio generation, and motion analysis, directly on-device. This enables faster, more private, and more energy-efficient user experiences. Developers can learn how to implement these capabilities with Arm AI on mobile resources.

Accelerate On Device AI with Arm SME2

Why SME2 Matters?

Built for Modern AI

SME2 in Action

On-Device AI for Enhanced User Experiences: AI Yoga Tutor

Agentic AI Call Handling

Live Translation

Music Generation

Neural Camera Denoising

Built for Developers

Boosting Mobile AI with Arm SME2 and Google Android

Latest News and Resources

Powering the Future of Mobile AI with Arm Lumex CSS Platform

Boost Android App Efficiency with SME2

Accelerate AI Performance with New Arm C1 CPUs

Stay Connected

Frequently Asked Questions: SME2

What is Arm SME2?

How does SME2 improve AI performance on devices?

Which devices will support SME2?

How does SME2 benefit developers?

Can SME2 help with generative AI applications?

Arm Account

Accelerate On Device AI with Arm SME2

AI Summary

Why SME2 Matters?

Built for Modern AI

SME2 in Action

On-Device AI for Enhanced User Experiences: AI Yoga Tutor

Agentic AI Call Handling

Live Translation

Music Generation

Neural Camera Denoising

Built for Developers

Boosting Mobile AI with Arm SME2 and Google Android

Latest News and Resources

Powering the Future of Mobile AI with Arm Lumex CSS Platform

Boost Android App Efficiency with SME2

Accelerate AI Performance with New Arm C1 CPUs

Stay Connected

Frequently Asked Questions: SME2

What is Arm SME2?

How does SME2 improve AI performance on devices?

Which devices will support SME2?

How does SME2 benefit developers?

Can SME2 help with generative AI applications?