Login

Important information

This site uses cookies to store information on your computer. By continuing to use our site, you consent to our cookies.

ARM websites use two types of cookie: (1) those that enable the site to function and perform as required; and (2) analytical cookies which anonymously track visitors only while using the site. If you are not happy with this use of these cookies please review our Privacy Policy to learn how they can be disabled. By disabling cookies some features of the site will not work.

Cortex-R4 Processor

Cortex-R4 Processor Image (View Larger Cortex-R4 Processor Image)

The Cortex®-R4 processor is the first deeply embedded real-time processor based on the ARMv7-R architecture. It is for use in high-volume, deeply embedded System-on-Chip applications, for example, hard disk drive controllers, wireless baseband processors, consumer products and electronic control units for automotive systems.

The Cortex-R4 processor delivers substantially higher performance, real-time responsiveness, reliability and dependability with high error resistance, and offers more features than other processors in its class. This processor offers excellent energy efficiency and cost effectiveness for ASIC, ASSP and MCU embedded applications. The Cortex-R4 processor is very flexible and can be configured at synthesis time to optimize its feature set for a precise match with application requirements.

The Cortex-R series processors contribute to safe system design by enabling the processor to operate deterministically and not have its execution blocked for an unpredictable number of cycles by the external memory system or other bus masters; thereby enabling high availability.

 


Cortex-R4

The Cortex-R4 processor, designed for implementation on advanced silicon processes with an emphasis on improved energy efficiency, real-time responsiveness, advanced features and ease of system design. The processor provides a highly flexible and efficient two-cycle local memory interface, enabling SoC designers to minimize system cost and energy consumption.

Summary of Cortex-R4 Key Features

  • Fast – high performance 1.66 DMIPS/MHz
    • Power-efficient, 8-stage dual issue pipeline with instruction pre-fetch and branch prediction
    • ARMv7R architecture - Thumb-2 / ARM instructions
    • Hardware divide, SIMD, DSP, SP/DP FPU option
    • Harvard I + D caches, 64-bit AMBA AXI-3
  • Deterministic – fast interrupt response
    • Vectored Interrupt Controller port
    • Low Latency Interrupt Mode (LLIM) to accelerate interrupt entry whenever possible without waiting for the current instruction or memory access to complete
    • Tightly-Coupled Memory system which provides a second level-1 memory besides the cache for storing critical code and data such as interrupt service routines which can then be immediately executed without waiting for cache evictions and fetches from main memory
    • Low Latency Peripheral Port (LLPP) which provides for peripheral device access with a guaranteed low latency
  • Reliable – error handling built into core
    • Memory Protection Unit
    • ECC and Parity protection on L1 memories
    • Dual core lock-step configuration
  • Cost-effective and low cost of ownership

ARM Cortex-R4 Processor

Feature

Description

Micro-architectureEight-stage pipeline with instruction pre-fetch, branch prediction and selected dual-issue execution. Parallel execution paths for load-store, MAC, shift-ALU, divide and floating point. 1.66 Dhrystone MIPS/MHz. Hardware divider. Binary compatibility with classic ARM9 and ARM11 embedded processor famlies.
Instruction SetARMv7-R architecture with Thumb-2 and thumb. DSP extensions. Optional floating-point unit.
Cache controllersHarvard memory architecture with optional integrated Instruction and Data cache controllers. Cache sizes configurable from 4 to 64 KB. Cache lines are either write-back or write-through.
Tightly-Coupled MemoriesOptional Tightly-Coupled Memory interfaces. Use TCMs for highly deterministic or low-latency applications that may not respond well to caching, e.g. instruction code for interrupt service routines and data that requires intense processing. One or two logical TCMs, A and B, can be used for any mix of code and data. TCM size can be up to 8 MB. TCM B has two physical ports, B0 and B1, for interleaving incoming DMA data streams.
Interrupt interfaceStandard interrupt, IRQ, and non-maskable fast interrupt, FIQ and inputs are provided together with a VIC interrupt controller vector port. The GIC interrupt controller can also be used if more complex priority-based interrupt handling is required. The processor includes low-latency interrupt technology that allows long multi-cycle instructions to interrupted and restarted. Lengthy memory accesses are also deferred in certain circumstances. Worst-case interrupt response can be as low as 20-cycles using the FIQ alone.
Memory Protection UnitOptional MPU configures attributes for either eight or twelve regions, each with resolution down to 32 Bytes. Regions can overlap, and the highest numbered region has highest priority.
Floating Point UnitOptional Floating Point Unit (FPU) implements the ARM Vector Floating Point architecture VFPv3 with 16 double-precision registers, compliant with IEEE754. The FPU performance, optimized for single precision calculations, has full support for double precision. Operations include add, subtract, multiply, divide, multiply and accumulate, square root, conversions between fixed and floating-point, and floating-point constant instructions.
ECCOptional single-bit error correction and two-bit error detection for cache and/or TCM memories with ECC bits. Single-bit soft errors are automatically corrected by the processor.
ParityOptional support for parity bit error detection in caches and/or TCMs.
Master AXI bus64-bit AMBA AXI bus master for Level-2 memory and peripheral access.
Slave AXI busOptional 64-bit AMBA AXI bus slave port allows DMA masters to access the dual-port TCM B interface for high speed streaming of data in and out of the processor.
DebugDebug Access Port is provided. Its functionality can be extended with DK-R4.
TraceAn interface suitable for connection to CoreSight Embedded Trace Module is present.
Dual coreA dual processor configuration implements a redundant Cortex-R4 CPU in lock step with offset clocks and comparison logic for fault tolerant/fault detecting dependable systems.
ConfigurationSynthesizable Verilog RTL with facility to configure options for synthesis.

Cortex-R4 Performance Power and Area

Processor area, frequency and power consumption are highly dependent on process, libraries and optimizations. The table below estimates a typical single processor implementation of the Cortex-R4 processor on mainstream low power process technology (40nm LP) with high-density, standard-performance cell libraries and 32KB instruction cache and 32KB data cache.

Cortex-R4 Single Processor

40nm LP

Maximum clock frequencyAbove 600MHz 
Performance1.66 DMIPS/MHz 
Total area (Including Core+RAM+Routing)From 0.5 mm2
EfficiencyFrom 24 DMIPS/mW 

Use ARM System IP, Development Tools and Physical IP to implement complete Cortex-R4 systems.

CoreLink™ and CoreSight™ System IP

NIC-400Configurable hierarchic low latency interconnect for AMBA 3 AXI, AHB-Lite and APB components. Configurations can range from a single bridge component, such as an AHB to AXI protocol bridge, to a large infrastructure of 128 masters and 64 slaves in combinations of different AMBA protocols.
QOS-400Added to NIC-400 to minimize average latency and guarantee worst-case latency and bandwidth of critical interfaces such as DDR memory.
DMC-34xDynamic memory controllers providing highly efficient interfaces to DRAM by leveraging AXI interconnect features to optimize memory request scheduling and using built-in Quality of Service controls to manage the initiator's latency and bandwidth requirements. Memory types supported include SDR, DDR, LPDDR (Mobile DDR), eDRAM, DDR2 and LPDDR2 (Mobile DDR2).
SMC-35xStatic memory controllers interface AXI interconnects to a range of non-volatile memories with highly configurable parameters. Memory types supported include SRAM, NAND Flash and NOR Flash.
L2C-310Level-2 cache controller designed to boost performance while reducing overall traffic to system memory and therefore SoC energy consumption. Reducing demands on off-chip memory bandwidth frees up resources for other masters.
DMA-330A highly flexible micro-programmable Direct Memory Access controller for high-end high-performance energy-efficient AXI-based processing systems.
PL192An AMBA AHB advanced Vectored Interrupt Controller (VIC) supporting up to 32 vectored interrupts with programmable priority level and masking.
GIC400An AMBA AHB and AXI scalable, configurable, low gate count Interrupt Controller that stores vector addresses in memory. Options include multi-processor and TrustZone support.
ETM-R4The Embedded Trace Macrocell provides real-time instruction and data trace and is configured to capture information before and after a specified sequence of events with the processor at full speed.
DK-R4A complete Debug Kit including ETM-R4 and a fully featured Debug Access Port (DAP) to complement the DAP-Lite shipped with every Cortex-R4. DK components include DAP, cross trigger, ETM, AMBA bus trace, serial wire debug, trace funnel, trace buffer, trace port interface and serial wire viewer.

Development Tools

The ARM Development Suite 5 (DS-5™) tool suite, as well as a wide range of third party tools, operating systems and EDA flows support all Cortex-R series processors. ARM DS-5 software development tools are unique in their ability to provide solutions that take full advantage of the complete ARM technology portfolio. Tools specific to Cortex-R4 are:

ARM DS-5ARM Compiler 5.0 with Thumb-2 optimized for Cortex-R4.
Fast ModelsWith ARM Fast Models, software development can begin prior to silicon availability. These extensively validated programmer’s view models provide access to ARM-based systems suitable for early software development.
Versatile ExpressThe Versatile™ Express family of development platforms provides the right environment for prototyping the next generation of system-on-chip designs.
Soft Macrocell ModelA Soft Macrocell Model (SMM) is an FPGA implementation of an ARM processor, built with ARM development boards

Physical IP

ARM optimized Physical IP platforms for best-in-class implementations of Cortex-R4 on leading semiconductor process technologies.

Standard cell logic librariesAvailable in a variety of different architectures, ARM Standard Cell Libraries support a wide performance range for all types of SoC designs. Designers can choose between different libraries and optimize their designs for speed, power and/or area.
Memory compilers and registersA broad array of silicon proven SRAM, Register File and ROM memory compilers for all types of SoC designs ranging from performance critical to cost sensitive and low power applications.
Interface IPA broad portfolio of silicon-proven Interface IP designed to meet varying system architectures and standards. General Purpose I/O, Specialty I/O, High Speed DDR and Serial Interfaces, optimized to deliver high data throughput performance with low pin counts.

White Papers 



 
» 
Latest Forum Posts
» 
Blogs
 
» 
Cortex-R4 Powered Products
Go Left
Go Right

Maximise