Entitlements could not be checked due to an error reaching the service. Showing non-confidential search results only.
Overview

Analyze and Optimize Workloads on Arm Neoverse

screenshot

Arm Performix is a performance analysis toolkit for developers building and running large application workloads on Arm Neoverse–based servers. It collects performance counter data directly from the hardware as applications run and presents clear metrics and insights that help identify performance bottlenecks efficiently.

Designed for cloud and infrastructure use cases, Arm Performix simplifies performance analysis on Arm by combining data collection with guided, accessible analysis.

Download Toolkit

Download Arm Performix

Download the full toolkit (UI + CLI), or choose the CLI only.

Windows logo

Windows

Download (x86)
learning icon

Mac

Download (x86)
Linux logo

Linux

Download (x86)

Download Arm Performix CLI

Windows logo

Windows

Download CLI (x86)
Apple logo

Mac

Download CLI (x86)
Linux logo

Linux

Download CLI (x86)

Arm Community Forum

Need help with Arm Performix or want to share feedback with Arm?

Visit Community Forum
Developer Resources

Get Started with Learning Paths and User Guide

code icon

Arm Performix Install Guide.

Start Learning Path
code icon

Find code hotspots with Arm Performix.

Start Learning Path
learning icon

Tune application performance with Arm Performix CPU microarchitecture analysis.

Start Learning Path
Icon of a book symbolizing tutorials and learning.

Learn how to capture and analyze performance data with the Arm Performix User Guide.

Read User Guide
Benefits and Features

Purpose-Built Performance Optimization

Built for the Arm Infrastucture Ecosystem

Arm Performix is regularly tested on Arm Neoverse platforms such as AWS Graviton, Microsoft Cobalt, and Google Axion, and supports bare-metal and virtualized environments where performance counter data is available. It is designed for:

  • Cloud backend and application developers
  • Platform and system software teams
  • Core library, runtime, and framework developers
  • Performance and infrastructure engineers
line graph icon

Why Arm Performix


Performance efficiency is critical for scalability, cost, and reliability in modern infrastructure. Arm Performix helps developers:

  • Identify bottlenecks faster
  • Focus optimization efforts where it matters
  • Validate platform behavior with confidence
  • Maintain performance over time

Tool Benefits and Features

icon for Superior Price-Performance

Accelerated
Insights

Identify bottlenecks in minutes to build confidence you’re fixing the right problems. Visualize code hotspots and CPU microarchitecture behavior at-a-glance to eliminate trial and error and focus more time on improvement.

tool icon

Optimization,
Simplified

Performance analysis should be free, open and accessible to all. Our analysis combines detailed profiling with our deep architectural expertise and easy to follow suggestions. Actionable insights applicable to newcomers and deep performance experts.

95+% of CNCF Projects Support Arm icon

Seamless
Integration

Built for Agentic AI workflows, the powerful analysis and insights are available from within your favorite editor. DevOps engineers can automate profiling and regression tracking through your existing CI flow, enabling early detection and resolution of issues.

FAQs

What is Arm Performix?

Arm Performix is Arm’s performance profiling and analysis toolkit for developers building and running performance-critical workloads on Arm-based systems. It profiles running applications, collects system- and hardware-level metrics, and translates them into guided insights that help identify CPU, memory, and system bottlenecks.

Arm Performix supports large-scale cloud, infrastructure, and AI workloads on Arm platforms including AWS Graviton, Microsoft Cobalt, Google Axion, and next-generation Arm solutions.

Who is Arm Performix designed for?

Arm Performix is built for engineers responsible for validating, profiling, and optimizing workloads on Arm across development, deployment, and production:

  • System and Platform Engineers – platform bring-up, firmware, kernel, hardware validation
  • Core Library Developers – C/C++, SIMD/vectorization, performance-critical primitives
  • Runtime and Compiler Engineers – JITs, toolchains, AI frameworks, distributed runtimes
  • Cloud and Backend Developers – hyperscale services, SaaS platforms, CI/CD regression monitoring
What problems does Arm Performix solve?

Arm Performix reduces root cause analysis time from hours to minutes and lowers the barrier to production-grade performance optimization on Arm.

  • Multi-core, NUMA, and heterogeneous compute architectures
  • Performance interactions across CPU, memory, networking, and storage
  • OS-level, hardware-level, and application-level behavior
How does Arm Performix work?

With a single command, Arm Performix performs application and system profiling on Arm-based servers. Connecting to a local, on-prem, or cloud-based Arm system. It also:

  • Profiles a running workload
  • Collects system-level and application-level performance counter data
  • Identifies bottlenecks across compute, memory bandwidth, and core utilization
  • Provides guided next steps for optimization

Rather than exposing raw counters alone, Arm Performix explains what metrics mean and why they matter, enabling faster, more confident optimization decisions.

What platforms and environments are supported?

Arm Performix supports Arm Neoverse-based systems running Linux, including cloud environments such as AWS Graviton, Microsoft Cobalt, Google Axion, and next-generation Arm solutions. It is designed for both on-premises data center deployments and large-scale public cloud infrastructure.

How can I access Arm Performix? Is there a fee?

Arm Performix is free to download and use, so you can start analyzing and optimizing performance right away.

Can Arm Performix be integrated into CI/CD pipelines?

Yes. Arm Performix produces machine-readable output suitable for automation and regression tracking. Engineering teams can integrate it into CI/CD pipelines to:

  • Detect performance regressions early
  • Track optimization impact over time
  • Compare performance across architectures
  • Validate scaling characteristics before production rollout

This makes continuous performance validation a standard part of cloud-native development workflows.

What level of expertise do I need to use Arm Performix?

Arm Performix is designed to be accessible to all engineers without requiring deep Arm microarchitecture knowledge. It provides guided, architecture-aware insights that contextualize performance data and highlight root causes directly.

How is Arm Performix different from traditional performance tools?

Traditional CPU profiling and performance analysis tools often expose raw hardware counters without guidance. Arm Performix differs by providing:

  • A unified data collection and analysis workflow
  • Cross-layer visibility (system and application)
  • Bottleneck identification across compute and memory subsystems
  • Actionable, architecture-aware recommendations
  • Automation-ready outputs

It eliminates tool-hopping and manual counter interpretation, delivering production-ready performance insight tailored specifically for Arm environments.

How does Arm Performix support AI and cloud optimization?

For AI inference services, distributed runtimes, and large-scale backend systems, performance efficiency directly impacts infrastructure cost, fleet sizing, and power consumption.

Arm Performix helps teams profile AI inference workloads and cloud services running on Arm infrastructure by:

  • Validating workload scaling characteristics
  • Optimizing memory bandwidth utilization
  • Identifying CPU bottlenecks in inference pipelines
  • Increasing confidence in workloads migrated to Arm from X86

This ensures performance transparency across modern AI and cloud deployments.