The Compute Platform for On-Device GenAI
AI Summary
On-device generative AI in mobile computing relies on the Arm Lumex Compute Subsystem (CSS) Platform to balance performance, efficiency, and accessibility for AI-native, advanced experiences. As AI models evolve rapidly, Arm enables innovation across the compute stack, ensuring mobile devices can handle demanding workloads. This foundation supports scalable AI solutions that transform how applications deliver intelligent and responsive capabilities.
Applications for AI-Powered Mobile Experiences
-
Generative AI
-
CPU Inference
-
Machine Learning
Gen AI on Mobile
Generative AI is consolidating the smartphone as the center of personal and professional compute. Arm technology offers an efficient foundation for AI acceleration at scale, enabling innovative generative AI use cases, including group chat summarization and real-time assistants, to run entirely on mobile.
AI Inference on CPU
With the growing number of AI applications, there comes an exponential increase in the need for AI inference capabilities. Arm CPUs provide the technology foundation for inference to run on mobile, bringing AI into the hands of billions of people around the world.
Classical ML and Deep Learning
Arm is the foundation of real-world machine learning (ML), providing a flexible compute platform to match your workload demands in an energy-efficient way, from face ID and image classification to speech recognition and email spam filters.
The Foundation for AI Innovation
Built on the latest Armv9.3-A architecture, Arm Lumex CSS delivers the performance, efficiency, and developer-ready integration needed for next-generation smartphones. With industry-leading IPC, a new flagship GPU, and day-one software support through Kleidi and SME2, Arm Lumex CSS empowers SoC designers and OEMs to accelerate AI innovation, faster, smarter, and across all device tiers.
Armv9 is the foundation for on-device generative AI. It provides the programming tools and environment necessary to innovate at pace for the rapidly expanding AI market. It delivers faster than ever compute for high-performance use cases and includes a suite of security features.
The Mali G1-Ultra GPU delivers immersive graphics in mobile gaming with next-gen ray tracing (RTUv2) and advanced AI acceleration. Optimized for flagship devices, it combines immersive visuals, real-time intelligence, and energy-efficient performance.
Arm Kleidi accelerates AI workloads on Arm CPUs by seamlessly integrating with popular machine learning frameworks. Combined with Arm’s new Scalable Matrix Extension 2 (SME2), Kleidi delivers zero-code performance boosts, up to 5x AI performance uplift, across generative AI, natural language processing, vision, and speech tasks, enabling developers to effortlessly achieve superior AI performance and efficiency on mobile and edge devices.
Generative AI on Smartphones
Energy-Efficient Image Generation
Stability AI and Arm are transforming image and audio generation. Balancing performance and efficiency, the Arm compute platform is ideal for creating impressive visuals and graphics with Stability AI's Stable Diffusion models and original sounds with Stable Audio.
Time-Saving AI Productivity Tools
Arm Kleidi accelerates response times for group chat summarization and learning assistant demos by unlocking the performance of Armv9 CPUs. Multiple messages or emails are quickly distilled into key points in an easily digestible format. Helpful facts and explanations are provided in moments.
Evolving Chatbots to Real-Time Assistants
By combining an LLM with automatic speech recognition and speech generation models, it is possible to have real-time conversations with context retention. Running this virtual assistant demo in flight mode shows the capabilities of the Arm CPU to process generative AI workloads entirely on-device.
Mobile Development on Arm
From documentation and tutorials to specialized tools and libraries, here’s everything you need to build mobile applications on Arm-based devices.
GenAI
The Use of GenAI in Game Development
Explore how generative AI is being incorporated into the game development pipeline at different stages.
Learning Paths
Mobile, Graphics, and Gaming
Explore how generative AI is being incorporated into the game development pipeline at different stages.
Latest News and Resources
- NEWS and BLOGS
- WEBINARS
- WHITE PAPERS
- GUIDES
- REPORTS
Mobile AI
The Arm Platform: Redefining Mobile Experiences with AI
Learn about Arm CSS for Client, the latest Arm Armv9 CPUs and GPUs, the benefits and opportunities of running AI on device, and how to innovate and speed time to market.
Mobile AI
The CPU Cluster: Redefining Mobile Experiences with AI
Discover the features, benefits, and performance enhancements of Arm Cortex-X925, Cortex-A725 CPU, Cortex-A520 CPU and Arm DSU-120.
Generative AI
Scale Generative AI with Flexibility and Speed
The race to scale new generative AI capabilities is creating opportunities for innovation and challenges – learn how to beat them.
Software AI Acceleration
Why Software is Crucial to Achieving AI’s Full Potential
How to choose the right open-source solutions to help accelerate generative AI and reduce the footprint of AI models.
Mobile AI
How Arm Enables Mobile AI Everywhere
Explore the numbers behind AI on Arm. This infographic shows how the Arm compute platform empowers developers and device makers to deliver next-generation AI-driven experiences to billions of users worldwide.
AI Workloads on Arm
Guide to Understanding AI Inference on CPU
Demand for running AI workloads on CPU is growing. Our helpful guide explores the benefits and considerations for CPU inference across a range of sectors, including Mobile AI.
Mobile AI
AI: Supercharging the Future of Mobile Graphics
See how AI workloads on mobile devices are creating advanced computing capabilities and performance, which translates to improved mobile graphics, intelligent interactions, and immersive gaming experiences.
The New Frontier for On-Device AI
Smaller models and accelerated compute are transforming AI on mobile.
Arm AI Readiness Index
Our comprehensive analysis of global AI readiness reveals how technology leaders across enterprises are adopting practical use cases for leading the next wave of mobile AI applications.
Key Takeaways
Key Takeaways
- On-device generative AI uses the Arm compute platform to deliver high-performance, efficient, and accessible mobile intelligence.
- Rapid advancements in AI models are driving software requirements that surpass traditional hardware capabilities.
- Arm enables innovation across the compute stack to address the growing complexity of mobile AI workloads.
- The platform ensures efficient performance and scalability for running generative AI directly on mobile devices.
- Mobile applications benefit from responsive, intelligent AI experiences supported by Arm’s foundational compute technologies.
Stay Connected
Subscribe to stay up to date on the latest news, case studies, and insights.