Home

Community

Research Articles

October 17, 2019

Arm Research Summit 2019: Catch Up and Highlights

Reading time 11 minutes

The fourth annual Arm Research Summit, and the first in the US, held in Austin, Texas, created a collaborative space for academics, researchers and industry experts globally to discuss complex computing challenges in the future landscape. The 15-18 September was filled with discussion and networking, enabling delegates to discover new developments, opportunities and connections.

What You Missed

The keynote speakers kicked off each day, each talking about their respective interests and research areas. Topics of computer architecture and post-Moore's law, the Internet of Things, and IT disruptions were discussed, giving each day of the Summit a unique opening. As well as these inspiring keynote speakers and plenty of other presentations from experts in their research fields, the Summit hosted a poster session to give even more delegates the opportunity to present their work. The poster session on the second day saw Amna Shahab win the Best Poster award, her poster titled Exploring New Directions for Reducing Neural Model Training Time. The quality of Amna's work shines through, as she was also awarded 'Best Poster' at the 2018 Summit in Cambridge - we look forward to seeing more exciting research at future events!

Best Presenter was awarded to Tushar Krishna, who spoke in the ML Hardware and Applications session with Enabling Continuous Learning through Neural Network Evolution in Hardware. Jesse Beu from Arm Research, who was poster chair as well as session chair for Tushar's session, gave some insight into why both Amna and Tushar were crowned best in their respective categories:

“What I found especially interesting about both the Best Poster and the Best Presenter winners is that both embraced and reinforced a theme seen throughout the Summit, that as researchers we need to look beyond our abstraction layer and pursue cross-cutting research to maximize impact. Amna’s recognition of the link between hardware efficiency and statistical efficiency for ML training really resonated with attendees because this kind of insight only comes from understanding workloads in addition to the hardware they run on. Similarly, Tushar’s presentation was popular because it pushed the bounds on what it means to be an architect; it showed we shouldn’t limit ourselves to understanding just hardware tradeoffs, we should also look to how the applications plays off these tradeoffs.”

Congratulations to both Amna and Tushar!

In the spirit of collaboration, we've made the presentations across the four days available below. You can also watch recordings of keynote talks, including Rich Wolski, Matthew Scholl, and an overview of Arm Research given by Arm Fellow Stuart Biles.

Day 1

Workshop: Impact of Arm Hardware from an HPC Application Perspective (Present and Future-Looking) - Slides

Day 2

Welcome: Jonathan Beard: Opening Remarks - Video | Slides

Plenary: Stuart Biles: An Introduction to Arm Research - Video | Slides

Keynote: Kunle Olukotun: Computer Architecture in the Post Moore's Law Era*

HPC Applications I

Mohammed Zubair: Optimization of a Solver for Computional Materials and Structures Problems on Arm Processor - Slides

Rached Abdelkhalak: High Performance Stencil Computations for Oil and Gas Applications on Arm Architectures*

Brian Gravelle: Porting HPC Miniapps to Arm - Slides

Shinji Sumimoto: Performance Evaluation of SVE Enabled Arm Processor A64FX using Variable Vector Length - Slides

Biology and Compute

Saugata Ghose: Accelerating Genome Analysis: A Primer on an Ongoing Journey - Slides

Andrew Ellington: Yeastons: Implementing Artificial Neural Networks with Engineered Living Cells - Slides

Marko Vasic: CRN++: Molecular Programming Language - Slides

Renee St. Amant: AI, Ethics, and Policy Considerations when Technology Meets Biology - Slides

System Security

Tyler Hunt: System Security: General Challenges and a Cloud GPU System - Slides

Simha Sethumadhavan: Memory Safety: To Blacklist or Whitelist? - Slides

Shijia Wei: Using Power-Anomalies to Counter Evasive Micro-Architectural Attacks in Embedded Systems - Slides

Dongdong She: Efficient Fuzzing with Neural Networks - Slides

Workshop: Renegotiating the Levels of Abstraction for the Post Moore's Law Era

Matt Horsnell: Abstractions and Specialization - Slides

Sarita Adve: Interface, Communication, Approximation - Slides

Vikram Adve: Uniform Abstractions for Heterogeneous Parallel Systems - Slides

Arrvindh Shriraman: Hardware Construction by Software - Slides

Thierry Moreau: The Past, Present, and Future of Deep Learning Acceleration Stacks - Slides

Riyadh Baghdadi: Compiling Dense and Sparse Neural Networks using Tiramisu - Slides

Jeff Setter: Halide to Hardware - Slides

Naums Mogers: Functional Interface for Performance Portability on Parallel Accelerators - Slides

Aviral Shrivastava: dMAzeRunner - Accelerating Loop Nests on Dataflow Accelerators - Slides

Ramana Radhakrishnan: Arm's Machine Learning Processor - Slides

HPC on Arm

Simon McIntosh-Smith: Arm in HPC: from Prototype to Production - Slides

Andy Warner: Further Updates from the Coalface of Arm in HPC - Slides

JaeHyuk Kwak and Vitali Morozov: Early Experience of the Arm-based High-Performance Computing Eco-system - Slides

Kevin Pedretti: Experiences Scaling a Production Arm Supercomputer to Petaflops and Beyond - Slides

ML Hardware and Applications

Jiang Hu: Machine Learning Techniques for Chip Design Verification and Prediction - Slides

Muhammad Shahbaz: Taurus: An Intelligent Data Plane - Slides

Tianmu Li: 3PXNet: Pruned-Permuted-Packed XNOR Networks for Edge Machine Learning - Slides

Tushar Krishna: Enabling Continuous Learning through Neural Network Evolution in Hardware - Slides

Side Channels and Exploits

Jakub Szefer: Three-Step Models for Timing Channels in Processor Caches and TLBs - Slides

Caroline Trippel: CheckMate: Automated Exploit Program Generation for Hardware Security Verification - Slides

Hovav Shacham: Data Dependent Instruction Timing Channels - Slides

Mohit Tiwari: Cyclone: Detecting Contention-Based Cache Information Leaks Through Cyclic Interference - Slides

HPC Applications II

David Donofrio: Embedding HPC at the Edge - Slides

Aaron Walden: An Optimized Multicolor Point-implicit Solver for Unstructured Grids on the ThunderX2 Processor - Slides

Dirk Pleiter: Enabling HPC Applications for SVE - Slides

Oliver Sheridan-Methven: Numerical Simulations using Approximate Random Numbers: VLA Algorithms for SVE and FP16 Capable Hardware - Slides

Memory Systems I - Specialization and Persistency

Charles Eckert: Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks - Slides

Daichi Fujiki: Duality Cache for Data Parallel Acceleration - Slides

Mohammad Alshboul: Lazy Persistency: A High-Performing and Write-Efficient Software Persistency Technique - Slides

Samira Khan: Rethinking System Support for Persistent Memory - Slides

Emerging Applications

Yuhao Zhu: The Next Quintillion Pixels and Beyond: Architecting Next-Generation Mobile Visual Computing Systems - Slides

Ramesh Govindan: Augmented Vehicular Reality - Slides

Arslan Munir: Design and Evaluation of MPSoC ECU Architectures for Secure and Dependable Automotive Cyber-Physical Systems - Slides

Dayane Reis: A Computing-in-Memory for Searching on Homomorphically Encrypted Data - Slides

Security Meetup: Slides

Day 3

Keynote: Rich Wolski: Devices-as-Services and The Internet as a Platform of Things - Video | Slides

Panel: Rethinking Boundaries through Hardware-Software Co-design for Productive Post-Moore Computing - Slides

New Technologies in Circuits and Systems

Sung Kyu Lim: Heterogeneous 3D ICs: Benefits, Challenges, and Future Prospects - Slides

Haitong Li: On-Chip Memory Technology Design Explorations for Mobile Deep Neural Network Accelerators - Slides

Aseem Sayal: All-Digital Time-Domain CNN Engine Using Bi-Directional Memory Delay Lines for Energy Efficient Edge Computing*

Azad Naeemi: Performance Modeling and Design for Various Spintronic Memory Technologies - Slides

System Architecture

Benjamin Lee: The Computational Sprinting Game - Slides

Mattan Erez: QoS for Accelerator-Rich "Fat" Nodes - Slides

Saptadeep Pal: Waferscale Processors - Slides

Ronald Dreslinski: Introspective Computers - Slides

Modeling

David C: Enzian: An Open Research Computer, In Stores Now - Slides

Matthew Baker: Scaling Full-system Simulation of Arm SVE Processors Using Compilers and Runtime Tool APIs - Slides

Yuetsu Kodama: RIKEN Fugaku Processor Simulator and its Accuracy for Cache and Memory Throughput - Slides

Ben Simner: Systems-level Architecture Modelling - Slides

Workshop: Your Research Enabled by Arm - SoC Labs

John Goodenough: Executive Introduction*

Plout Galatsopoulos: Arm Flexible Access for Research & Arm Research SoC Labs - Slides

Kobus Marneweck: DesignStart Cortex A5 subsystem - Slides

John Goodenough: Arm Education Plans*

Paul Whatmough: Harvard SoC designs - Slides

Liam Dillon: AFA Technical Offerings - Slides

Liam Dillon: Why Bother with Debug - Slides

Mark Willoughby: STFC Europractice and Arm SoC Labs - Slides

Andrew Kahng: The OpenROAD project - Slides

Tutu Ajayi: Arm Coherent Accelerator Interface - Slides

Workshop: Arm HPC User Group - Slides

IoT Technology

Joshua San Miguel: Architecting for Intermittence - Slides

Filip Maksimovic: Power, Ground, and Antenna: Three-Pin Chips for Standards Compatible Wireless Networks*

Irina Alam: Fault Resilient Voltage Scaling of Embedded Memories for Efficient Inference or Operation at the Edge - Slides

I-Ling Yen: Semantic based Data Management and Discovery in the IoT-Edge-Cloud Infrastructure - Slides

Memory Systems II: Coherence and Prefetching

Sarita Adve: Dynamic Coherence Specialization - Slides

Hao Wu: Temporal Prefetching without the Off-chip Metadata - Slides

Paul Gratz and Daniel Jimenez: Speculative, Coordinated Memory Hierarchy Management - Slides

Tony Nowatzki: Stream-based Memory Access Specialization for General Purpose Processors - Slides

Benchmarks and Software

Vijay Janapa Reddi: The Vision Behind MLPerf (mlperf.org): A Community-driven Benchmark Suite for ML Frameworks, ML Accelerators and ML Systems in Cloud and Edge Computing - Slides

Patrick Lavin: Using the Spatter Benchmark Suite to Evaluate SVE Support for Gather/Scatter - Slides

Muhammad Huzaifa: Enabling Architecture Research for Augmented and Virtual Reality - Slides

Jonathan Helmus: Creating a Python Data Science Distribution for Arm with Conda - Slides

Cloud, Edge and IoT

Jan S. Rellermeyer: Kubernetes on the Edge - A Peer-to-Peer Approach - Slides

Babak Zamirai: SIEVE: Speculative Inference on the Edge with Versatile Exportation - Slides

Kamyar Mirzazad Barijough: Distributed Deep Learning Inference On Resource-Constrained IoT Edge Clusters - Slides

Jayashree Mohan: CrashMonkey and Ace : Systematically Testing File-System Crash Consistency - Slides

Cross-stack Optimizations

Vikram Adve: ApproxHPVM: Accuracy-aware Optimizations for Heterogeneous System-on-Chip Processors - Slides

Simone Campanoni: Liberating Threads from Non-Numerical Programs with an Architecture-Compiler Co-Design - Slides

Antonino Tumeo: Data and Model Convergence: a Case for Software Defined Architectures - Slides

Timothy Rogers: Leveraging Cross-Stack Solutions to Improve Massively Parallel Systems*

Memory Systems III: Processing in or near Memory

Mochamad Asri: The Non-Uniform Compute Device (NUCD) System Architecture - Slides

Miquel Moreto: Hardware Acceleration of Sparse Data Rearrangement Near Memory - Slides

Sriseshan Srikanth: Strider: Architectures for Scalable Memory Centric Reduction of Sparse Data Streams - Slides

Jiayi Huang: Active-Routing: Compute on the Way through Near-Data Processing - Slides

Day 4

Keynote: Matthew Scholl: The Upcoming Disruption in IT and what NIST Looks to Prioritize for Cybersecurity - Video | Slides

Panel: A Conversation with the NSF - Partnerships and Collaborations - Slides

Microcontroller Security and Applications

Joseph Yiu: Evolution of Security Solutions for Microcontrollers - Slides

Reza Azarderakhsh: Post-Quantum Cryptography in Arm Cortex M Series - Slides

Jeongsup Lee: A Self-tuning Cortex-M0 IoT Processor Based on Leakage-Ratio Measurement for Energy-Optimal Operation - Slides

Jiecao Yu: Efficient Low-Precision Deep Neural Networks on IoT Microcontrollers*

Accelerators

James C. Hoe: Do more with FPGAs in Computing: from Field Programmable to Programmable - Slides

Arrvindh Shriraman: uIR: An Intermediate Representation for Constructing Hardware Accelerators - Slides

Thierry Moreau: A Hardware-Software Blueprint for Flexible Deep Learning Specialization - Slides

Christopher Batten: Architectural Specialization for Dynamic Task-Parallel Programs - Slides

ML Algorithms

Hadi Esmaeilzadeh: AI for Optimized AI Execution - Slides

Alex Beatson: Accelerating Automated Modeling and Design with Stochastic Optimization, Probabilistic Models, and Neural Networks - Slides

Saavan Patel: Hardware-Aware, Scalable, Combinatorial Optimization on a Boltzmann Machine - Slides

Mohsen Imani: A Framework for Secure Collaborative Learning*

ML Meetup: Slides

Novel Algorithm Update: Slides

* The slides for these sessions are unavailable at the speakers' request.

Arm Research Summit 2020

We are excited to announce that the Arm Research Summit is returning for 2020 and will be held in Europe! Further details will be released soon, so keep an eye out so you can be the first to submit your work and register for the fifth annual Summit!

By Rhiannon Burleigh

Article text

Re-use is only permitted for informational and non-commercial or personal use only.