CoreLink CCI-400 Cache Coherent Interconnect
The CCI-400 is a high performance, power efficient interconnect designed to interface between processors and the dynamic memory controller, such as the CoreLink DMC-400. It is the first product to implement AMBA® 4 ACE™, which brings system coherency, virtual memory management signalling and barriers.
Coherency enables scaling. The latest SoC designs have more processors and more shared data due to an increase in processor cores and accelerator engines including multimedia, and GPU. These additional processors increase system performance and improve power efficiency but all the data shared between these processors needs to be managed.
To manage shared data there are three techniques:
- Disable caching: all shared memory is written externally to DDR. This is the simplest solution but expensive in external accesses and latency.
- Software managed coherency: any data stored in processor caches must be cleaned and flushed to external memory before passing to accelerators and other hardware. This requires the CPU software to actively manage cached data.
- Hardware managed coherency: the system interconnect ensures all shared data is coherent in the system, reduces external memory accesses and removes the need for software to manage caches. This offers improved performance and power efficiency.
The CCI-400 enables hardware managed coherency between two AMBA 4 ACE processor clusters such as Cortex-A15 and Cortex-A7, allowing scaling of system performance up to 8x cores in total, and supporting big.LITTLE processing.
I/O coherency, or one way coherency, is also supported for up to three accelerator engines implementing the AMBA 4 ACE-Lite protocol.
The CCI-400 benefits are not limited to coherency, this product also supports the virtualization extensions including a low latency connection to a system MMU, such as CoreLink MMU-400, to allow virtualization of hardware devices. This could be used to take advantage of multiple OS’s running on the same hardware, or simply a more efficient way to share limited physical memory.
The CCI-400 also supports the propogation of barriers to enforce the ordering of transactions, while allowing the processors to generate many outstanding transactions, so minimizing CPU stalls awaiting the completion of preceding transactions.








