Everything You Need

Build reliable and optimized code for the right results on multiple Server and HPC architectures, from the latest compilers and C++ 11 standards to Intel, 64-bit Arm,  AMD, OpenPOWER and Nvidia GPU hardware. Arm Forge combinesArm DDT, the leading debugger for time-saving high performance application debugging, and Arm MAP, the trusted performance profiler for invaluable optimization advice.

Features and Benefits
Cross Platform

Moving to a new architecture or system is challenging enough without having to learn a new tool chain at the same time. Arm DDT and MAP run everywhere — on your own laptop, the latest supercomputer, and tomorrow's upcoming architectures. With Forge you can automatically detect memory bugs and profile behavior at all scales on Arm 64-bit, Intel Xeon, Intel Xeon Phi, NVIDIA GPUs , and OpenPOWER.

Fast Debug

Arm DDT is the debugger of choice for developing of C++, C or Fortran parallel, and threaded applications on CPUs, GPUs and Intel Xeon Phi. Its powerful intuitive graphical interface helps you easily detect memory bugs and divergent behavior at all scales, making Arm DDT the number one debugger in research, industry and academia.

Low-overhead Profiling

Profile your code without distorting application behavior. Arm MAP is Arm Forge's scalable low-overhead profiler of C++, C, and Fortran with no instrumentation or code changes required. It helps developers accelerate their code by revealing the causes of slow performance. From multicore Linux workstations to the largest supercomputers, you can profile realistic test cases with typically less than 5% runtime overhead.

Short Learning Curve

Arm DDT offers a powerful intuitive GUI that sets the standard for multi-process and multi-threaded debugging. Complex software debugging is made simple whether you're working on a PC or offline, with the help of zero-click variable comparisons, built-in memory debugging, and powerful array visualizations — for today's increasingly parallel processors, clusters, and supercomputers. 

Wide Issue Coverage

Arm MAP exposes a wide set of performance indicators, including MPI metrics, PAPI counters, IO  metrics, energy metrics and even your own custom metrics. Profile computation (with self and child and call tree representations over time), thread activity (to identify over-subscribed cores and sleeping threads that waste CPU time for OpenMP and pthreads), instruction types, as well as synchronization and I/O performance.

Single and Multi Threaded Profiling

Arm MAP profiles parallel, multithreaded, and single threaded C, C++, Fortran, and F90 codes, providing in-depth analysis and bottleneck pinpointing to the source line. Unlike most profilers , it can profile pthreads, OpenMP or MPI for parallel and threaded code, including communication and workload imbalance issues for MPI and multi-process codes.

Explore More
Arm Forge Editions


Arm Forge Arm Forge Pro
Scalable profiling   √  √
Memory debugging
 √  √
CPU, MPI, I/O profiling
 Edit and compile  √
Native remote client   √   √
Version control integration 



CPU hardware counters    -  √
Full GPU profiling and debugging    Optional extra  √
  Energy performance metrics -  √
 Custom metrics API -  √
Case Studies
Software Development

By analyzing issues that slow simulations, Arm tools help improve the throughput of computer aided engineering software on supercomputers and clusters. Specifically tailored to parallel and high performance code, the tools help ISVs, open source developers, and in-house code teams increase the throughput, reliability, and scalability of their software.

Talk with an Expert

Learn how Arm Forge can accelerate your software development.

Contact Us
Related Products and Services
Explore Other Arm Tools and Software
Allinea Studio

Allinea Studio

Arm Allinea Studio is a complete suite of high performance tools for developing Arm-based HPC solutions. It includes Arm Forge, Arm C/C++ Compiler, Arm Fortran Compiler, Arm Performance Libraries, and Arm Performance Reports.

Arm Performance Reports

Arm Performance Reports

Arm Performance Reports provide detailed information on a wide area of performance indicators, including: time spent in various types of instructions, memory usage, and time spent in thread computation and synchronization.