Arm Kleidi Libraries for Accelerating Any Framework on Arm
Kleidi libraries, a key component of Arm Kleidi, provide a lightweight suite of open source routines that simplifies integration of the latest Arm architecture for AI and machine learning (ML). With seamless framework integration and optimized support for Arm CPU designs, these libraries offer key advantages for developers:
- Optimized for Arm CPU designs: Built to fully leverage the capabilities of Arm Cortex-A, Cortex-X, and Arm Neoverse CPUs, ensuring maximum performance for ML and computer vision (CV) tasks.
- Low overhead, high efficiency: Minimal resource demands from Kleidi allow for smooth and fast execution of AI workloads, reducing latency and improving overall efficiency.
- Broad framework compatibility: Integrated with popular ML and AI frameworks like PyTorch, ExecuTorch, MediaPipe, ONNX Runtime, LiteRT, and llama.cpp, enabling rapid acceleration of inference models.
Features and Benefits
Flexible Kernel Assortment
Kleidi libraries offer flexible kernels that enhance AI on frameworks, improving capability and accuracy for AI, accelerating performance, and reducing memory overhead.
Low Overhead for Developers
The KleidiAI library directly integrates into key AI frameworks—MediaPipe (via XNNPACK), llama.cpp, PyTorch (via ATen), ONNX Runtime and MNN—giving developers automatic performance benefits with no added overhead.
Unleashing Mass-Market AI Performance
As Kleidi libraries help optimize AI at the framework level, benefitting hundreds of workloads on billions of Arm-based devices. Developers can run models on Kleidi-optimized frameworks to achieve top performance.
Enabling Generative and Agentic AI at Scale
Kleidi simplifies and accelerates deployment of demanding AI inference workloads on Arm. The KleidiAI library delivers best-in-class performance for GenAI and agentic AI deployed from cloud datacenters to constrained devices at the edge.
Optimizing AI for Everyone—Everywhere
Kleidi enables easy optimization across Arm Neoverse, Arm Cortex-X, and Cortex-A CPUs. Its performance libraries leverage technologies like Arm Neon, Arm Scalable Vector Extensions (SVE), and Arm Scalable Matrix Extensions (SME), to accelerate AI functions on Arm architecture.
Concise, Efficient, and Light
The libraries are lightweight and concise, with no dependencies or binary release, and avoid duplicating memory allocation or multithreading implementation—making them easy to integrate into framework codebases efficiently.
Get Started With Arm Kleidi Libraries
Access these developer enablement technologies and supporting resources to accelerate the performance of your AI workloads.