Blog

December 18, 2020

Cloud AI, Edge AI, Endpoint AI. What’s the Difference?

Arm technology enables AI throughout the compute spectrum. Here, we explain the benefits and limitations of AI compute in cloud, edge and endpoint devices.

By Arm Editorial Team

Today, the majority of what we call artificial intelligence (AI) is machine learning (ML), a subset of AI that involves machines learning from sets of data. In general, the greater the amount of data to learn from, the more the AI is able to infer meaning and the more useful it becomes.

Hundreds of thousands of gigabytes of data are generated every day in AI applications ranging from consumer devices to healthcare, logistics to smart manufacturing. With this much data generated, the key consideration is where that data should be processed.

Arm defines three categories within the compute spectrum: cloud, edge and endpoint. We can employ ML to process data within each of these but choosing the most suitable category isn’t as simple as going where the most compute performance is—as performance is only one governing factor in ensuring the learnings inferred from data remain useful.

What is Cloud AI?

Cloud AI refers to AI processing within powerful cloud data centers. For a long time, cloud AI was the obvious choice of compute platform to crunch enormous amounts of data. Were it not for the concept of shunting data from the edge and endpoint into cloud servers for hyper-efficient processing, AI would not be at the stage of maturity it enjoys today.

It’s likely the majority of AI heavy lifting will always be performed in the cloud due to its reliability, cost-effectiveness and concentration of compute—especially when it comes to training machine learning (ML) algorithms on historic data that doesn’t require an urgent response. Many consumer smart devices rely on the cloud for their ‘intelligence’: for example, today’s smart speakers give the illusion of on-device intelligence yet the only on-device AI they are capable of is to listen out for the trigger word (‘keyword spotting’).

Cloud AI is undisputed in its ability to solve complex problems using ML. Yet as ML’s use cases grow to include many mission-critical, real-time applications, these systems will live or die on how quickly decisions can be made. And when data has to travel thousands of miles from device to data center, there’s no guarantee that by the time it has been received, computed and responded to it will still be useful.

Applications such as safety-critical automation, vehicle autonomy, medical imaging and manufacturing all demand a near-instant response to data that’s mere milliseconds old. The latency introduced in asking the cloud to process that weight of data would in many cases reduce its value to zero.

It’s for this reason that many companies are now looking past the cloud to processing AI elsewhere in the compute infrastructure, moving compute nearer the data.

What is Edge AI?

In a world where data’s time to value or irrelevancy may be measured in milliseconds, the latency introduced in transferring data to the cloud threatens to undermine many of the Internet of Things (IoT’s) most compelling use cases.

Edge AI moves AI and ML processing from the cloud to powerful servers at the edge of the network such as offices, 5G base stations and other physical locations very near to their connected endpoint devices. By moving AI compute closer to the data, we eliminate latency and ensure that all of that data’s value is retained.

Basic devices such as network bridges and switches have given way to powerful edge servers that add data center-level hardware into the gateway between endpoint and cloud. These powerful new AI-enabled edge servers, driven by new platforms such as Arm Neoverse, are designed to increase compute while decreasing power consumption, creating massive opportunities to instrument our cities, factories, farms, and environment to improve efficiency, safety, and productivity.

Edge AI has the potential to benefit both the data and the network infrastructure itself. At a network level, it could be used to analyze the flow of data for network prediction and network function management, while enabling edge AI to make decisions over the data itself offers significantly reduced backhaul to the cloud, negligible latency and improved security, reliability and efficiency across the board.

Another key function of edge AI is sensor fusion: combining the data from multiple sensors to create complex pictures of a process, environment or situation. Consider an edge AI device in an industrial application, tasked with combining data from multiple sensors within a factory to predict when mechanical failure might occur. This edge AI device must learn the interplay between each sensor and how one might affect the other and apply this learning in real-time.

There’s also a key security and resilience benefit in moving sensitive data no further than the edge: The more data we move to a centralized location, the more opportunities arise for that data’s integrity to be compromised. As the nature of compute changes, the edge is playing an increasingly crucial role in supporting diverse systems with a range of power and performance requirements. To deliver on service level agreements at scale for enterprises, the edge must embrace cloud-native software principles.

Arm is enabling this through Project Cassini, an open, collaborative, standards-based initiative to deliver a cloud-native software experience across a secure Arm edge ecosystem.

What is Endpoint AI?

Arm defines endpoint devices as physical devices connected to the network edge, from sensors to smartphones and beyond. As so much data is generated at the endpoint, we can maximise the insight we gain from that data by empowering endpoint devices to think for themselves and process what they collect without moving that data anywhere.

Due to their powerful internal hardware, smartphones have long been a fertile test-bed for endpoint AI. A smartphone camera is a prime example: it’s gone from something that takes grainy selfies to being secure enough for biometric authentication and powerful enough for computational photography – adding background blur (or a pair of bunny ears) to selfies in real-time.

This technology is now finding its way into smaller IoT devices. You may hear it referred to as the ‘AIoT’. In February 2020, Arm announced its solution for adding AI into even the smallest Arm-powered IoT devices. The Arm Cortex-M55 CPU and Arm Ethos-U55 micro neural processing unit (microNPU) combine to boost the performance of Arm-based Internet of Things (IoT) solutions by nearly 500 times—while retaining the trademark energy-efficient, cost-effective benefits our technology is known for. This technology will help to bring the benefits of Arm-powered compute to the IoT’s most challenging environments.

TinyML is an emerging sub-field of Endpoint AI, or AIoT, that enables ML processing in some of the very smallest endpoint devices containing microcontrollers no bigger than a grain of rice and consuming mere milliwatts of power.

Of course, endpoint AI also has its limitations: these devices are far more constrained in terms of performance, power and storage than edge AI and cloud AI devices. Data collected by one endpoint AI sensor can also have limited value on its own, as without the ‘top-down’ view of other data streams that sensor fusion at the edge enables, it is harder to see the full picture.

A combined, secure approach

Cloud AI, Edge AI and Endpoint AI each have their strengths and limitations. Arm’s range of heterogeneous compute IP scales the complete compute spectrum, ensuring that whatever your AI workload, Arm has a solution to enable it to be processed efficiently by putting intelligent compute power where it makes the most sense.

Most importantly, Arm technology ensures that data used in AI processing remains secure, from cloud to edge to endpoint. The Arm Platform Security Architecture (PSA) provides a platform, based on industry best-practice, that enables security to be consistently designed in at both a hardware and firmware level, while PSA Certified assures device manufacturers that their IoT devices are built secure. Within Arm processors, Arm TrustZone security technology simplifies IoT security and offers the ideal platform on which to build a device that adheres to PSA principles.

Powering innovation through AI

AI is empowering change, driving innovation, and creating exciting new possibilities. Arm is forging a path to the future with solutions designed to support the rapid development of AI. Discover how Arm combines the hardware, software, tools, and strategic partners you need to accelerate development.

By Arm Editorial Team

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Brian Fuller and Jack Melling

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Media Information

Latest on X

; Arm @Arm ·

10h 1922339331768721694

Today at #FTCar, Dipti Vachani joined @VolvoGroup’s Christopher Lindstrom, @BentleyMotors 's Phil Gush & @ftlive’s Tim Bradshaw to explore chip supply, the AI-defined era & what it takes to build a resilient automotive ecosystem—from SDVs to smarter supply strategies.

📸…

Reply on Twitter 1922339331768721694 Retweet on Twitter 1922339331768721694 0 Like on Twitter 1922339331768721694 2 Twitter 1922339331768721694

; Arm @Arm ·

12 May 1921994648190390522

Curious about AI?

Learn the fundamentals of AI & ML, train real models, and explore ethical challenges in just 4 weeks.

Taught by Arm engineers + top academics.

Start your AI journey today 👉

Reply on Twitter 1921994648190390522 Retweet on Twitter 1921994648190390522 5 Like on Twitter 1921994648190390522 20 Twitter 1921994648190390522

; Arm @Arm ·

9 May 1920868845700456661

The expansion of the PyTorch Foundation is an amazing opportunity to unlock further AI innovation from cloud to edge. Arm is proud to support this next chapter in open-source AI 🤝

PyTorch @PyTorch

PyTorch Foundation has expanded into an umbrella foundation. @vllm_project and @DeepSpeedAI have been accepted as hosted projects, advancing community-driven AI across the full lifecycle.

Supporting quotes provided by the following members: @AMD, @Arm, @AWS, @Google, @Huawei,…

Reply on Twitter 1920868845700456661 Retweet on Twitter 1920868845700456661 2 Like on Twitter 1920868845700456661 9 Twitter 1920868845700456661

; Arm @Arm ·

9 May 1920841570292498695

For the past 10 years, Arm has supported @UNICEF in exploring how scalable, open source, and sustainable tech can help improve children’s health, education, climate resilience, and access to information. 🤝

Here’s what a decade of impact looks like 👇

Reply on Twitter 1920841570292498695 Retweet on Twitter 1920841570292498695 1 Like on Twitter 1920841570292498695 16 Twitter 1920841570292498695

; Arm @Arm ·

7 May 1920209472758108434

We closed FYE25 with a new milestone - crossing $1B in quarterly revenue for the first time. 🎉

As AI continues to drive demand for Armv9, custom silicon, and Arm compute subsystems, we’re proud to enable the AI revolution at scale, from cloud to edge:

Reply on Twitter 1920209472758108434 Retweet on Twitter 1920209472758108434 9 Like on Twitter 1920209472758108434 54 Twitter 1920209472758108434

; Arm @Arm ·

2 May 1918204313974820988

'@arcee_ai ran a 32 billion-parameter model on an Arm-based CPU. That’s not a typo.

On the latest Arm Viewpoints podcast, Arcee AI's Chief Evangelist @julsimon explains how quantization + clever engineering is paving the way for SLMs in the enterprise: https://okt.to/nrEkIK

Reply on Twitter 1918204313974820988 Retweet on Twitter 1918204313974820988 8 Like on Twitter 1918204313974820988 31 Twitter 1918204313974820988

; Arm @Arm ·

1 May 1918064706452902116

'@Spotify. @GoogleCloud. Arm.

One (virtual) room. One big shift in cloud performance.

Hear how Arm Neoverse-based Google Axion processors are bringing better performance, efficiency, and TCO to Spotify's cloud-native and AI workloads. Sign up: https://okt.to/P3ronG

Reply on Twitter 1918064706452902116 Retweet on Twitter 1918064706452902116 4 Like on Twitter 1918064706452902116 12 Twitter 1918064706452902116

; Arm @Arm ·

1 May 1917876678346387530

🗓️ Join us on Wednesday May 7 for our next financial results conference call.

After market close, we'll report our earnings results for the fourth quarter and fiscal year 2025, followed by a conference call to review our results and business outlook: https://okt.to/5JkpNx

Reply on Twitter 1917876678346387530 Retweet on Twitter 1917876678346387530 5 Like on Twitter 1917876678346387530 16 Twitter 1917876678346387530

; Arm @Arm ·

30 Apr 1917501399731986568

⚡ If you've ever thought "5GHz must mean fast," think again.

@EposVox breaks down why the IPC (Instructions Per Cycle) metric for modern CPU performance best represents real-world user experiences and why it matters more than ever.

📺 Watch now.

Why your fast CPU still feels slow

This video is sponsored by Arm. Learn why IPC matters to mobile and how Arm is powering the future of smartpho...

okt.to

Reply on Twitter 1917501399731986568 Retweet on Twitter 1917501399731986568 5 Like on Twitter 1917501399731986568 25 Twitter 1917501399731986568

; Arm @Arm ·

29 Apr 1917285483454906633

The latest tools from @AIatMeta - Llama Guard 4, Firewall and Prompt Guard 2 - help embed policy-grade safety into gen AI interactions.

With real-time multimodal filtering combined with Arm’s power-efficient compute platform, they support secure, scalable AI from cloud to edge.

AI at Meta @AIatMeta

Major updates from LlamaCon!

We’re advancing AI security with new open-source Llama protection tools and new AI- powered solutions for the defender community.

Developers can now access:

-- Llama Guard 4, a customizable safeguard that supports protections for text and image…

Reply on Twitter 1917285483454906633 Retweet on Twitter 1917285483454906633 4 Like on Twitter 1917285483454906633 12 Twitter 1917285483454906633

; Arm @Arm ·

29 Apr 1917272821459255774

AI's future won’t be built on power alone; it will be built on smarter foundations.

Richard Grisenthwaite, Arm’s EVP and Chief Architect, shares why strong architectural foundations are key to scaling AI sustainably in @computing_news ⏬

Tackling AI’s challenges starts with strong architectural foundations

Tackling AI’s soaring energy demands needs both hardware, software and scientific advancements, says Arm EVP Richard Grisenthwaite

okt.to

Reply on Twitter 1917272821459255774 Retweet on Twitter 1917272821459255774 7 Like on Twitter 1917272821459255774 28 Twitter 1917272821459255774

; Arm @Arm ·

29 Apr 1917205744446828995

From data centers to edge devices, CEO Rene Haas and @Synopsys' Sassine Ghazi are showcasing how collaboration is fueling the future of AI.

Find out what’s next in AI innovation from their conversation at the 35th annual Synopsys User Group: https://okt.to/NG8efa

Reply on Twitter 1917205744446828995 Retweet on Twitter 1917205744446828995 6 Like on Twitter 1917205744446828995 19 Twitter 1917205744446828995

; Arm @Arm ·

28 Apr 1916810390316388793

Innovate alongside compliance. 🤝

In the AI Readiness Index, Sr. Director of Government Affairs Vince Jesaitis explores the evolving global regulatory landscape from the EU AI Act's risk-based approach to the U.S. sectoral model.

Download the report: https://okt.to/meYpHL

Reply on Twitter 1916810390316388793 Retweet on Twitter 1916810390316388793 2 Like on Twitter 1916810390316388793 11 Twitter 1916810390316388793

; Arm @Arm ·

26 Apr 1916128925303873572

40 years ago today, the Arm architecture was born with five simple words:

"Hello World, I am ARM"

We were honored to welcome Sophie Wilson, Steve Furber and Jamie Urquhart back to our Cambridge HQ to celebrate the legacy they helped create and the future we’re building on Arm.

Reply on Twitter 1916128925303873572 Retweet on Twitter 1916128925303873572 19 Like on Twitter 1916128925303873572 86 Twitter 1916128925303873572

; Arm @Arm ·

25 Apr 1915707418119463228

Celebrating 40 years of Arm architecture at the turkey barn where it all began. 👏

Earlier this week, we unveiled a new plaque in Swaffham Bulbeck honoring the architecture that sits at the foundation of modern computing, alongside the pioneers who helped shape it.

Reply on Twitter 1915707418119463228 Retweet on Twitter 1915707418119463228 17 Like on Twitter 1915707418119463228 73 Twitter 1915707418119463228

; Arm @Arm ·

25 Apr 1915646972049924305

Chiplets are redefining the automotive industry. How do we fulfill their potential?

➕ Collaboration
➕ Standardization
➕ Resource sharing

We're joining forces with @imec_int's Automotive Chiplet Program to make it a reality. Find out how: https://okt.to/xAuDk1

Reply on Twitter 1915646972049924305 Retweet on Twitter 1915646972049924305 6 Like on Twitter 1915646972049924305 17 Twitter 1915646972049924305

; Arm @Arm ·

24 Apr 1915539191930224738

Take a trip down memory lane with us. 🛣️

From the BBC Micro to AI-powered devices, the Arm architecture has shaped 40 years of innovation, powering smartphones, IoT, our cars, the cloud, and more.

Let’s look back at the tech that changed everything.

Reply on Twitter 1915539191930224738 Retweet on Twitter 1915539191930224738 18 Like on Twitter 1915539191930224738 49 Twitter 1915539191930224738

; Arm @Arm ·

22 Apr 1914694202257985963

Deploying AI is just the start. The real challenge? Running it efficiently, securely and at scale.

In our report with @techreview we explore why heterogeneous compute is key to enabling edge AI everywhere from our phones to IIoT.

📥 See why: https://okt.to/lQVD69

Reply on Twitter 1914694202257985963 Retweet on Twitter 1914694202257985963 7 Like on Twitter 1914694202257985963 15 Twitter 1914694202257985963

; Arm @Arm ·

21 Apr 1914401272934904175

The industry is developing comprehensive standards, like those from PSA Certified and others, to secure the next generation of custom silicon. 🔐

Learn more about how to secure the silicon solutions of the future in our recent report: https://okt.to/6ND1AB

Reply on Twitter 1914401272934904175 Retweet on Twitter 1914401272934904175 5 Like on Twitter 1914401272934904175 30 Twitter 1914401272934904175

; Arm @Arm ·

17 Apr 1912967821048361032

As demand for performance and efficiency grows, tech giants are rethinking their infrastructure - and turning to Arm.

Mohamed Awad joins @EETimes to explore how our flexible, power-efficient architecture is shaping the future of computing:

Why Do Hyperscalers Design Their Own CPUs? - EE Times

With huge financial investment required to enter the realm of custom silicon design, why is this route so appealing to hyperscalers?

okt.to

Reply on Twitter 1912967821048361032 Retweet on Twitter 1912967821048361032 9 Like on Twitter 1912967821048361032 24 Twitter 1912967821048361032

; Arm @Arm ·

17 Apr 1912855791905685974

Smaller models. Bigger impact. 💥

@Arcee_AI is using Arm-based CPUs to run SLMs in parallel for agentic AI. With 4x performance gains from quantization and KleidiAI, @julsimon shares why the future of AI is efficient, scalable - and built on Arm CPUs: https://okt.to/UtMRxA

Reply on Twitter 1912855791905685974 Retweet on Twitter 1912855791905685974 6 Like on Twitter 1912855791905685974 22 Twitter 1912855791905685974

; Arm @Arm ·

17 Apr 1912834701976383543

There's just over a week left to enter the Arm Silicon Startups Contest! 🚨

If you're an early-stage startup, this is your chance to bring your silicon solution to market quicker, with $250,000 in Arm technology credit available to the winner.

Details: https://okt.to/FBoaGD

Reply on Twitter 1912834701976383543 Retweet on Twitter 1912834701976383543 8 Like on Twitter 1912834701976383543 14 Twitter 1912834701976383543

; Arm @Arm ·

16 Apr 1912507472998289749

At 19, most of us were still figuring life out. @alexandr_wang was already building @Scale_AI.

He joins Rene Haas on the Tech Unheard podcast to talk all things AI, leadership and why being young is often one of his biggest strengths.

🎧 Listen now: https://okt.to/a7dY2B

Reply on Twitter 1912507472998289749 Retweet on Twitter 1912507472998289749 2 Like on Twitter 1912507472998289749 18 Twitter 1912507472998289749

; Arm @Arm ·

14 Apr 1911906586995351626

What happens in Vegas… deserves to be shared!

At #GoogleCloudNext, the team celebrated 1 year of Arm Neoverse-based Google Axion processors with updates on migration and momentum, as more companies turn to Arm to balance performance, efficiency and cost:

Reply on Twitter 1911906586995351626 Retweet on Twitter 1911906586995351626 6 Like on Twitter 1911906586995351626 22 Twitter 1911906586995351626

Cloud AI, Edge AI, Endpoint AI. What’s the Difference?

What is Cloud AI?

What is Edge AI?

What is Endpoint AI?

A combined, secure approach

Powering innovation through AI

Editorial Contact

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X