Next-generation Armv8.1-M architecture: Delivering enhanced machine learning and signal processing for the smallest embedded devices
By Rhonda Dirvin, senior director, Automotive and IoT Business, Arm
- Arm Helium technology is a new M-Profile Vector Extension bringing enhanced compute capabilities to the Armv8.1-M architecture
- Delivering up to 15x performance uplift for machine learning and up to 5x uplift to signal processing tasks on the smallest of edge devices
- For next-generation Cortex-M processors, aimed at small, embedded devices, where local decision-making is required
The drive towards a world of a trillion connected devices is accelerating and will continue to do so, but only if we can find ways to efficiently expand the compute capabilities on a greater number of constrained devices at the far edge of the network. Increasing the compute capabilities in these devices will immediately open the door for developers to write machine learning (ML) applications directly for the device for decision-making at the source, thus enhancing data security while cutting down on network energy consumption, latency and bandwidth usage.
To achieve this, we’re introducing Arm Helium technology, the M-Profile Vector Extension (MVE) for the Arm Cortex-M series processors that will enhance the compute performance of the Armv8.1-M architecture on the secure foundation of Arm TrustZone. Helium will deliver up to 15x more ML performance and up to 5x uplift to signal processing for future Arm Cortex-M processors, unlocking new market opportunities for our partners where performance challenges have limited the use of low-cost and highly energy-efficient devices.
The next level of compute
Advanced digital signal processing (DSP) is available today through Arm Neon technology in richer Cortex-A based devices. For more constrained applications, Arm also added DSP extensions in its higher performance Cortex-M processors (Cortex-M4, Cortex-M7, Cortex-M33 and Cortex-M35P). Both technologies can be used to accelerate ML compute in certain applications.
For the most constrained embedded systems where energy efficiency is prioritized, historically the solution has been coupling a Cortex processor with a DSP in SoCs, which adds complexity to both hardware and software design. As we look to incorporate more ML capabilities on to these devices, several existing SoC development challenges are intensified and require a higher level of expertise in utilizing different toolchains, programming, debugging and working with complex proprietary security solutions.
Armv8.1-M with Helium eliminates these challenges by delivering real-time control code, ML and DSP execution without compromising efficiency. In turn, millions of software developers will be able to securely scale intelligent applications that take advantage of DSP capabilities across a wider range of devices, enabling enhanced support for emerging applications across three key categories; vibration and motion, voice and sound, and vision and image processing. This will improve the user experience in future devices such as sensor hubs, wearables, audio devices and industrial applications powered by next-generation SoCs based on Cortex-M with Helium technology.
In addition to the added performance and lower development costs, SoC design and development teams will immediately recognize other benefits including:
- Optimized cost, power and design efforts by consolidating functionality
- Simple deployment of TrustZone due to Armv8.1-M being designed to Platform Security Architecture (PSA) specifications
- Single toolchain for both control and signal processing software development
- Easier software development due to a comprehensive set of tools, models and libraries from the well-established Helium ecosystem, many of which are already being used by Cortex-M developers today
Simplifying software development
Software development will be made simpler due to Helium’s unified tool chain, libraries and models. The Helium toolchain includes the Arm Development Studio, encompassing Arm Keil MDK, Arm Models (which are immediately available to developers for code modelling) and various software libraries including CMSIS-DSP and CMSIS-NN, allowing developers to choose the best fit for their needs. And for signal processing applications we make it even simpler by removing the need for a dedicated DSP or function accelerator and eliminating another layer of design complexity.
Powering the next generation of embedded and IoT devices
Helium is the latest example of the value Arm’s Project Trillium brings to ML applications by supporting frameworks and libraries right down to the hardware. There is no one product that fits all as all SoC developers must innovate within different performance, area, power and cost constraints.
Toolchain and models for Helium are available today and we expect Helium to be available in silicon within the next two years. Find out about all Armv8.1-M architecture enhancements in this whitepaper. Read more about how to get access to the Arm tools supporting Helium.
Arm technology is at the heart of a computing and data revolution that is transforming the way people live and businesses operate. Our energy-efficient processor designs and software platforms have enabled advanced computing in more than 200 billion chips and our technologies securely power products from the sensor to the smartphone and the supercomputer. Together with 1,000+ technology partners we are at the forefront of designing, securing and managing all areas of AI-enhanced connected compute from the chip to the cloud.
All information is provided "as is" and without warranty or representation. This document may be shared freely, attributed and unmodified. Arm is a registered trademark of Arm Limited (or its subsidiaries). All brands or product names are the property of their respective holders. © 1995-2021 Arm Group.