Login

重要なお知らせ

このサイトはcookieを利用して、コンピュータに情報を保存しています。続けるには、同意が必要です。 cookie.

ARMのWebサイトでは2種類のcookieを利用しています:(1)サイトの機能を有効にし、要求に対して素早く反応できるようにするもの。(2)分析のためのcookieで、当サイト利用時に限り匿名でWeb訪問者をトラックするもの。cookieの利用に同意されない場合は、弊社のプライバシーポリシーをご確認いただき、cookieを無効にする方法を選択できます。cookieを無効にすると、サイトのいくつかの機能が使用できなくなりますのでご注意ください

Cortex-A53 Processor

Cortex-A53 Processor Image (View Larger Cortex-A53 Processor Image)
The Cortex™-53 processor is ARM's most efficient application processor ever, delivering today's mainstream smartphone experience in a quarter of the power in the respective process nodes. 

The Cortex-A53 extremely power efficient ARMv8 processor is capable of supporting 32-bit ARMv7 code and 64-bit code in the AArch64 execution state. It delivers more performance at higher power efficiency than the Cortex-A7 processor, and is capable of deployment as a standalone main applications processor or as a companion to Cortex-A57 in a big.LITTLE processor.

The Cortex-A53 processor can be implemented individually or paired with the Cortex-A57 processor in a big.LITTLE configuration for optimum performance, scalability and energy efficiency.

 


Using ARM's big.LITTLE technology the Cortex-A53 processor will efficiently run your connected life. This connected life will not just be the social media, news and email updates that you receive today, but will also enable devices to be contextually aware and connect to the Internet of things. Contextual awareness is already happening in smartphones today, but with the Cortex-53 processor's ultra efficiency will allow your smartphone to use its sensor information, calendar, contact information and location-based services to provide relevant information.

The Cortex-A53 processor

  • Can deliver the compute power of today’s high-end smartphone, in lowest power and area footprint, enabling all-day battery life for typical device uses 
  • Eficiently run legacy ARM 32-bit applications
  • Features cache coherent interoperability with ARM Mali™ family graphics processing units (GPUs) for GPU compute applications
  • Offers optional reliability and scalability features for high-performance enterprise applications
  • Connects seamlessly to ARMs interconnect with up to 16 cores configurations with more in the future

The Cortex-A53 processor delivers significantly more performance than its predecessors at a higher level of power efficiency, effectively taking the performance of the LITTLE core above that of the Cortex-A9 processor, which defines today's high-end mobile platforms.

The performance graph below shows measured results on Cortex-A9 and early Cortex-A7 test platforms. The results are projected to deliver performance in a quad-core configuration on a "rate" benchmark that tests integer and some floating-point performance with a mix of large and medium data sets. These stress the memory system of each CPU and the L2 cache. The "rate" portion of the benchmark duplicates the same code on the 2nd, 3rd, and 4th CPUs in each system, and measures the delivered aggregate performance. Rate benchmarks like this evaluate the ability of the multi-processing system to handle memory traffic and coherence requirements in a multiprocessing context.

The Cortex-A53 is able to deliver more performance than Cortex-A9 systems of comparable speed on this benchmark. The performance graph below shows measure results running various Android™ benchmarks on dual-core Cortex-A9, dual-core Cortex-A7, and projected results on dual-core Cortex-A53 platforms based on measured uplift vs. Cortex-A7 on similar types of code. The results show that Cortex-A53 delivers comparable but slightly lower performance relative to Cortex-A9 running at the same frequency, while Cortex-A53 delivers higher performance than Cortex-A9 at the same frequency.

BenchmarkCortex-A5Cortex-A7Cortex-A53*
Dhrystone (DMIPS/MHz)1.61.92.3
CoreMark (CoreMark/MHz)2.32.63.0
SPEC Int 2000 (base)290350450**

* Estimated
** Provisonal


Cortex-A53 MPCore
ArchitectureARMv8
Multicore
  • 1-4X SMP within a single processor cluster
  • Multiple coherent SMP processor clusters through AMBA® 4 technology
ISA Support
  • AArch32 for full backward compatibility with ARMv7
  • AArch64 for 64b support and new architectural features
  • TrustZone® security technology
  • NEON™ Advanced SIMD
  • DSP & SIMD extensions
  • VFPv4 Floating point
  • Hardware virtualization support
Debug & TraceCoreSight™ DK-A53


Cortex-A53 Microarchitectural Features
FeatureBenefitsAArch32AArch64
ARM v8 architecture64 and 32-bit execution states for scalable high performanceYesYes
Hardware-accelerated cryptography3x-10x better software encryption performance Useful for small granule decrypt/encrypt too small to efficiently offload to HW accelerator (e.g. https)YesYes
NEON technologyCan accelerate multimedia and signal processing algorithms such as video encode/decode, 2D/3D graphics, gaming, audio and speech processing, image processing, telephony, and sound synthesis. Also useful in accelerating floating point code with SIMD execution.YesYes
Floating point unitHardware support for floating point operations in half-, single- and double-precision floating point arithmetic.Now with IEE754-2008 enhancements.YesYes
Load Acquire, Store Release instructionsDesigned for C++11, C11, Java memory models. Improves performance of thread-safe code by eliminating explicit memory barrier instructionsYesYes
Large Physical Address reachEnables the processor to access beyond 4GB of physical memory.YesYes
TrustZone® TechnologyEnsures reliable implementation of security applications ranging from digital rights management to electronic payment.YesYes
Hardware VirtualizationEnables multiple software environments and their applications to simultaneously access the system capabilitiesYesYes
Automatic event signallingFor power-efficient, high-performance spinlocks.YesYes
Double Precision Floating Point SIMDAllows SIMD vectorisation to be applied to a much wider set of algorithms (e.g. scientific / High Performance Computing (HPC) and supercomputer).NoYes
64-bit Virtual address reachEnables virtual memory beyond 4GB 32b limit. Important for modern desktop and server software using memory mapped file i/o, sparse addressing.NoYes
Larger register files31 x 64b general purpose registers: increases performance, reduces stack use. Fewer stack spills, enabling more aggressive compilers. SIMD usable for more applications, e.g. HPCNoYes
Efficient 64-bit immediate generationLess need for literal poolsNoYes
Large PC-relative addressing range(+/-4GB) for efficient data addressing within shared libraries and position-independent executablesNoYes
Tagged PointersUseful for dynamically typed languages such as Javascript, and for garbage collectionNoYes
64k pagesReduce TLB miss rates and depth of page walksNoYes
New exception modelReduces OS and Hypervisor software complexityNoYes
Enhanced Cache managementUser space cache operations improve dynamic code generation efficiency, Data Cache Zero for fast clearNoYes


Cortex-A53 Architectural Features
FeatureBenefits
In-Order PipelineLower power consumption. Performance improvements are sought elsewhere in the design, e.g. the memory system and issue capability.
Increased dual-issue capabilityIncreased peak instruction throughput via duplication of execution resources, and dual instruction decoders.
Power optimized L2 cacheEfficiency optimized L2 cache design delivers lower latency and balances performance with efficiency.
512 entry main TLBImproved performance on code with complex memory access patterns, e.g. web browsing. Larger main TLB than Cortex-A7 and Cortex-A9.
Small, fast uTLBs10 entry uTLB with an extremely short miss penalty to reload from the main TLB allows excellent performance in a small area and power footprint.
Advanced Branch Predictor4Kbit Conditional Predictor, 256 entry indirect predictor increase branch hit rate.
64B cache linesFully aligned with Cortex-A57 microarchitecture to simplify cache management software in big.LITTLE systems. 64B line sizes a good tradeoff for modern memory access patterns.
Non-blocking I-fetch with multi-line pre-fetchIncreased instruction throughput across more types of benchmarks, from control code to processing intensive loops.
Dual identical ALU pipelinesIncreased opportunity to dual-issue instructions, at a small additional area.
64b store pathBalances store bandwidth with dynamic power consumption, focused on a highly efficient design tradeoff.
Multi-stream pre-fetcherGreater data flow into the main datapath increases overall performance on a wide range of code.
Increased D-side throughput3-outstanding load miss capability (per-core, excluding prefetches); 8-outstanding transactions (per-core)
Extensive power-saving featuresHeirarchical clock gating, power domains, advanced retention modes.


Cortex-A53 Advanced Multicore Features
The processor also utilizes the widely established ARM MPCore multicore technology, enabling performance scalability and control over power consumption to exceed the performance of today's comparable high-performance devices while remaining within tight mobile power constraints. Multicore processing provides the ability for any of the four component processors, within a cluster, to shut down when not in use, for instance when the device is in standby mode, to save power. When higher performance is required, every processor is in use to meet the demand while still sharing the workload to keep power consumption as low as possible.
Snoop Control UnitThe SCU is responsible for managing the interconnect, arbitration, communication, cache-2-cache and system memory transfers, cache coherence and other capabilities for the processor. The Cortex-A53 MPCore processor also exposes these capabilities to other system accelerators and non-cached DMA driven peripherals to increase performance and reduce system wide power consumption. This system coherence also reduces software complexity involved maintaining software coherence within each OS driver.
Accelerator Coherence PortThis AMBA 4 AXI™ compatible slave interface on the SCU provides an interconnect point for masters that are better interfaced directly with the Cortex-A53 processor. This interface supports all standard read and write transactions without additional coherence requirements. However, any read transactions to a coherent region of memory will interact with the SCU to test whether the information is already stored in the L1 caches. The SCU will enforce write coherence before the write is forwarded to the memory system and may allocate into the L2 cache, removing the power and performance impact of writing directly to off chip memory
Generic Interrupt ControllerImplementing the standardized and architected interrupt controller, the GIC provides a rich and flexible approach to inter-processor communication and the routing and prioritization of system interrupts. Under software control, each interrupt can be distributed across CPU, hardware prioritized, and routed between the operating system and TrustZone software management layer. This routing flexibility and the support for virtualization of interrupts into the operating system, provides one of the key features required to enhance the capabilities of a solution utilizing a hypervisor.

The Cortex-A53 MPCore processor incorporates a broad range of ARM technology including System IP, Physical IP, and development tools that also provide support. A broad range of SoC and software design solutions, tools and services from the ARM Connected Community™ compliment this technology. That provides ARM Partners with a smooth path through the development, verification and production of full function, compelling devices while significantly reducing time-to-market.

System IP

The ARM ™ interconnect and memory controller IP addresses the critical challenge of efficiently moving and storing data between up to 16 Cortex-A50 MPCore processors, high performance media processors and dynamic memories to optimize the system performance and power consumption of the SoC. The CoreLink system IP enables SoC designers to maximize the utilization of system memory bandwidth and reduce static and dynamic latencies. While the ARM CoreSight technology provides complete on-chip debug and correlated, real-time trace visibility for all cores of the Cortex-A53 MPCore processor, reducing risk and speeding development of high quality multiprocessing software. The new AMBA® 4 Cache Coherent Network (CCN) provides Optimum system bandwidth and latency. The CCN provides AMBA 4 AXI™ Coherency Extensions (ACE) compliant ports for full coherency between multiple Cortex-A53 MPCore processors, better utilizing caches and simplifying software development. This feature is essential for high bandwidth applications including gaming, servers and networking that require clusters of coherent single and multicore processors. Combined with the ARM CoreLink network interconnect and memory controller IP, the CCN increases system performance and power efficiency.


Physical IP

ARM Physical IP Platforms deliver process optimized IP, for best-in-class implementations of the Cortex-A53 processor at 40nm and below. A set of high performance Optimization Packs (POP™) containing advanced ARM Physical IP for 28nm technologies supports the Cortex-A53, to enabling rapid development of leadership physical implementations. ARM is also working early to assure a roadmap to 20nm optimizations. Optimization packs support ARM’s strategy of offering specifically targeted Physical IP to enable Partners to achieve tuned implementations of ARM cores. ARM is uniquely able to design the optimization packs in parallel with the Cortex-A53 MPCore processor architecture, enabling the processor and physical IP combination to deliver workstation class performance in a mobile power envelope while facilitating rapid time-to-market.


Tools Support

The ARM Development Suite 5 (DS-5™) tool suite fully supports all ARM processors as well as a wide range of third party tools, operating systems and EDA flows. ARM DS-5 software development tools are unique in their ability to provide solutions that take full advantage of the complete ARM technology portfolio. The ARM Development Studio 5 (DS-5™) provides a complete range of software tools to create, debug and optimize systems based on the Cortex-A53 MPCore processor. It incorporates the DS-5 Debugger, whose powerful and intuitive graphical environment enables fast debugging of bare metal, Linux and Android native applications. In addition, its new ARM Streamline™ Performance Analyzer simplifies the identification of hot spots in software and load balancing between cores. The ARM Compiler, which already includes specific optimizations for the Cortex-A15 MPCore processor, enables early software development before silicon availability and an ARM Versatile™ Reference Virtual Platform built on ARM Fast Models technology. This Virtual Platform is available for a free 6-month evaluation.


 

Graphics Processors

The Mali™ family of products combine to provide the complete graphics stack for all embedded graphics needs, enabling device manufacturers and content developers to deliver the highest quality, cutting edge graphics solutions across the broadest range of consumer devices.


 

Support

ARM training courses and Active Assist on-site system-design advisory services enable licensees to integrate efficiently the Cortex-A53 MPCore processor into their design to realize maximum system performance with lowest risk and fastest time-to-market.

 


Maximise