Highest inference performance and efficiency

Getting started

Based on a new, class-leading architecture, the Arm Ethos-N77 processor’s optimized design enables new features, enhances user experience and delivers innovative applications for a wide array of market segments including mobile, IoT, embedded, automotive, and infrastructure. It provides a up to 64x uplift in efficiency compared to CPUs, GPUs and DSPs through efficient convolution, sparsity and compression.

Download the Ethos-N77 datasheet:

Download now

Key features

Highest Performance

Delivers up to 4 TOPS of performance (2048 8-Bit MACs), scaling to 100s of TOPs in multicore deployments.

Optimized Design

Up to 225% convolution performance uplift using Winograd on 3x3 kernels, delivering up to 90% MAC utilization. 

Highest Efficiency

Achieving 5 TOPs/W through internally distributed SRAM, storing data close to the compute elements to save power and reduce DRAM access.

Futureproof

Supports a wide range of existing Machine Learning (ML) operations and future innovations through firmware updates and compiler technology.

Arm Ethos-N Block Diagram

Ethos-N77 premium ML inference processor contains 16 compute engines

Key benefits

  • Supports a variety of popular neural networks, including CNNs and RNNs, for classification, object detection, image enhancements, speech recognition and natural language understanding
  • Reduces system memory bandwidth by 1.5-3x through clustering sparsity and workload tiling, with lossless compression for weights and activations on select networks
  • Maximizes the number of parameters stored on-chip by storing compressed weights and activations in local SRAM and decompressing them on the fly
  • Leverages sparse power gating techniques to reduce power by up to 50%
  • Improves performance and extends battery life through intelligent data management techniques to minimize memory movement with up to 90% of accesses on chip
  • Supports TrustZone system security to safeguard sensitive data with support for secure and non-secure modes

Specifications

Key features Performance (at 1GHz)
2 TOP/s
MACs (8x8) 2048
Data types Int-8 and Int-16
Network support CNN and RNN
Efficient convolution Winograd support
Sparsity Yes
Secure mode TEE or SEE
Multicore capability 8 NPUs in a cluster
64 NPUs in a mesh
Memory system Embedded SRAM 1-4 MB
Bandwidth reduction
Extended compression technology, layer/operator fusion
Main interface
1xAXI4 (128-bit), ACE-5 Lite
Development platform Neural frameworks TensorFlow, TensorFlow Lite, Caffe2, PyTorch, MXNet, ONNX
Neural operator API Arm NN, AndroidNN
Software components Arm NN, neural compiler, driver and support library
Debug and profile Layer-by-layer visibility
Evaluation and early prototyping Arm Juno FPGA systems and cycle models


Ethos-N comparison table

    Ethos-N77
Ethos-N57
Ethos-N37
Key features Performance (at 1GHz)
4 TOP/s 2 TOP/s 1 TOP/s
MAC/Cycle (8x8) 2048 1024
512
Data types
Int-8 and Int-16
Network support CNN and RNN
Efficient convolution
Winograd support
Sparsity Yes
Secure mode
TEE or SEE
Multicore capability 8 NPUs in a cluster
64 NPUs in a mesh
Memory system Embedded SRAM 1-4 MB 512 KB 512 KB
Bandwidth reduction Extended compression technology, layer/operator fusion, clustering, and workload tilling
Main interface 1xAXI4 (128-bit), ACE-5 Lite
Development platform Neural frameworks TensorFlow, TensorFlow Lite, Caffe2, PyTorch, MXNet, ONNX
  Neural operator API Arm NN, AndroidNN
  Software components Arm NN, neural compiler, driver and support library
  Debug and profile Layer-by-layer visibility
  Evaluation and early prototyping Arm Juno FPGA systems and cycle models

Get support

Arm support

Arm training courses and on-site system-design advisory services enable licensees to efficiently integrate the Ethos-N77 processor into their design to realize maximum system performance with lowest risk and fastest time-to-market.

Arm training courses   Arm Design Reviews   Open a support case


Community Forums

Answered Address memory of the next instruction in A9 MPCore
  • R15 (PC Program Counter)
0 votes 970 views 3 replies Latest 4 days ago by dVaquerizo Answer this
Answered How to flush write buffer when memory attribute is normal_nc
  • Cache coherency
0 votes 894 views 4 replies Latest 4 days ago by bamvor_china Answer this
Answered How to specify virtual Address for pl011 uart in linux kernel
  • APB Peripherals
  • Arm11
  • PrimeCell UART (PL011)
  • Interrupt
0 votes 8240 views 10 replies Latest 14 days ago by Brayden Answer this
Answered how to return from exception generated by SMC instruction
  • Cortex-A53
  • EL1
  • EL3
  • EL2
  • AArch64
  • Armv8-A
  • Cortex-A
0 votes 4580 views 4 replies Latest 18 days ago by T6yson Answer this
Answered BCC vs BNE
  • CPSR
  • Arm Assembly Language (ASM)
0 votes 1472 views 6 replies Latest 18 days ago by 42Bastian Schick Answer this
Answered Definition of variables, an operation of variables with different data types and casting 0 votes 3273 views 9 replies Latest 21 days ago by Andy Neil Answer this
Answered Address memory of the next instruction in A9 MPCore Latest 4 days ago by dVaquerizo 3 replies 970 views
Answered How to flush write buffer when memory attribute is normal_nc Latest 4 days ago by bamvor_china 4 replies 894 views
Answered How to specify virtual Address for pl011 uart in linux kernel Latest 14 days ago by Brayden 10 replies 8240 views
Answered how to return from exception generated by SMC instruction Latest 18 days ago by T6yson 4 replies 4580 views
Answered BCC vs BNE Latest 18 days ago by 42Bastian Schick 6 replies 1472 views
Answered Definition of variables, an operation of variables with different data types and casting Latest 21 days ago by Andy Neil 9 replies 3273 views