Build and Scale AI With PyTorch on Arm Cloud
This is your technical guide to setting up, optimizing, and deploying high-performance AI models on Arm cloud infrastructure using PyTorch and open-source tools.
Benefits of Running PyTorch on Arm Cloud
Running PyTorch on Arm-based CPUs like AWS Graviton, Microsoft Cobalt, Nvidia Grace, Google Axion and others provides developers with a cost-effective and scalable way to deploy AI inference workloads using high-performance CPUs. By leveraging Arm architectural features like NEON and SVE, PyTorch can execute matrix-heavy models such as LLMs and Transformers efficiently. This enables greater portability, lower memory usage, and easier integration into production pipelines. Using Llama.CPP? Go to the Llama.CPP Developer Launchpad.
Get Started
Set Up
To start developing with PyTorch on Arm-based systems, you’ll first need to set up your development environment. This includes installing PyTorch, validating your setup, and running your first model using beginner tutorials.
Before starting, make sure you have:
- A 64-bit Arm CPU (Armv8-A or newer)
- Python 3.8+
- pip, virtualenv, and build essentials
Learn and Code
This section walks you through deploying and optimizing AI models with PyTorch on Arm — from running Transformers and setting up LLM inference to profiling performance with Arm-optimized tools and benchmarks.
Run Hugging Face Transformers on Arm
Deploy and run NLP models using Hugging Face libraries on Arm-based cloud instances. Execute tokenization, model inference, and performance validation.
Deploy an LLM Chatbot with KleidiAI
Set up a scalable inference pipeline for LLMs using PyTorch and Arm KleidiAI. Handle model loading, prompt processing, and real-time response generation on Arm CPUs. Prefer to follow along step-by-step with a video? Go to code-along.
Profile Inference with TorchBench on Arm
Use PyTorch Benchmarks to measure and analyze inference performance. You’ll run latency tests, capture throughput metrics, and identify optimization targets on Arm-based systems.
Arm Ecosystem Dashboard
The Arm Ecosystem Dashboard is your go-to resource for discovering cloud services, tools, and software stacks optimized for Arm. Whether you’re deploying on AWS, Azure, or GCP, this page helps you find the right partners, platforms, and verified solutions to accelerate development on Arm-based infrastructure.
Performance Tools
This section gives you access to tools that help you profile performance, migrate existing apps, automate cloud deployment, and benchmark workloads on Arm-based platforms.
Resources | Decription |
Streamline CLI | Collect and analyze performance data from Arm-based systems. Automate profiling workflows and integrate into CI pipelines. |
Migrate Ease | Identify and adapt workloads for Arm-based cloud environments. Automates analysis and optimization for a smoother migration. |
Runbooks | Step-by-step automation guides for configuring, running, and benchmarking workloads on Arm platforms. |
AWS Q CLI | Quickly launch and benchmark Arm-based instances on AWS using a streamlined command-line interface. |
AWS Perf (APerf) | Access low-level performance counters on Arm CPUs to analyze core behavior, frequency, and workload efficiency. |
What's Next?
- CODE-ALONGS
- DEVELOPER PROGRAM
- COURSES and LABS
- DEVELOPER RESEARCH
- MORE RESOURCES

Run Llama With PyTorch on Arm-Based Infrastructure – On Demand
Watch this hands-on code-along and expert Q&A to learn how to Run Llama With PyTorch on Arm-Based Infrastructure using best practices and open tooling.

Arm Developer Program
Have a technical question about PyTorch on Arm cloud or Arm cloud migration?
Join the Arm Developer Program and connect with a global community of developers and Arm engineers to build better apps on Arm. Get early access to tools, technical content, workshops, and support to help you debug, optimize, and ship your projects.
Course: Optimizing Gen AI on Arm Processors
Learn to optimize generative AI workloads on Arm for mobile, edge and cloud through hands-on labs and lectures.
Arm Developer Labs
Tackle real-world Arm-based cloud challenges with hands-on projects — perfect for building, learning, and prototyping.

Arm Developer Council
Join the Arm Developer Council to share feedback, help shape the tools and platforms you use — and receive a voucher for your time.