Skip to main content
The Institution of Engineering and Technology iet.tv
Site name
  • Videos
  • Channels
  • Events
  • Series

Access and Account

Access your personal account

Log in to see your favourites, lists and progress.

IET Login

Access via institution

Not currently connected to any institutions

Connect via

  1. Videos
  2. Video

NorthPole: Neural Inference at the Frontier of Energy, Space, and Time

  • WhatsApp
  • Facebook
  • Email
  • LinkedIn
  • Bluesky
CPD This content can contribute towards your Continuing Professional Development (CPD) as part of the IET's CPD Monitoring scheme.
Event
  • Session
  • Monday, 11 November 2024
  • 10:11 - 10:11
  • Duration: 35 mins
  • Publication date: 19 Nov 2024
  • Location: Conference, Chicago Business School, London, United Kingdom
  • Part of event REACH 2024

About the session

NorthPole, developed at IBM Almaden Research Center, is a neuromorphic computing architecture designed for highly efficient AI and machine learning inference. Inspired by the human brain, NorthPole delivers high throughput and low latency, making it ideal for edge AI and data center applications.

Key features include:

• Efficient Parallel Processing: NorthPole’s distributed cores minimize data movement and maximize local processing, aligning well with the parallel demands of AI inference while conserving power.

• Optimized Memory: Local memory storage reduces data transfer times, essential for handling large model weights  in  small-batch-size inference.

• Energy Efficiency & Scalability: With low power requirements, NorthPole can scale effectively for large language models in both edge and data center environments.

• Low-Latency Inference: By minimizing computational load per core and enhancing inter-core communication, NorthPole achieves fast, real-time responses—boosted by 13TB/s on-chip memory bandwidth.

• Mixed-Precision Operations: Hardware support for quantized matrix multiplications drives unprecedented efficiency without sacrificing accuracy.

Overall, NorthPole offers a powerful solution for accelerated AI inference, addressing challenges in memory, latency, and power consumption—perfect for scenarios requiring high performance and low energy use while delivery low-latency inference.

We will walk through NorthPole architectural highlights, benchmarking results on edge applications as well as Large Language Models. We will also show demos from these applications, and insights into developing a software ecosystem.

Keywords:
  • AI
  • AI for computer architecture
  • IET
  • IET conference
  • REACH 2024
  • artificial intelligence
  • computer architecture
  • computing horizons
  • computing systems

Channels

IT

IT

Communications

Communications

Speaker

  • RA

    Dr Rathinakumar Appuswamy

    IBM Research, USA, Senior Research Scientist

The Institution of Engineering and Technology iet.tv

Address: Futures Place, Kings Way, Stevenage, SG1 2UA

Telephone: +44 (0)33 049 9123

Email:  iet.tv@theiet.org

© 2026 The Institution of Engineering and Technology.

The Institution of Engineering and Technology is registered as a Charity in England & Wales (no 211014) and Scotland (no SC038698). Futures Place, Kings Way, Stevenage, Hertfordshire, SG1 2UA, United Kingdom

  • LinkedIn
  • Instagram
  • YouTube
Privacy statement Cookie Preferences Accessibility About us theiet.org Help

Powered by Cadmore Media

Embed Code

<script type="text/javascript" src="https://play.cadmore.media/js/EMBED.js"></script> <div class="cmpl_iframe_div"> <iframe src="https://play.cadmore.media/Player/6d36bd1c-4f99-4b49-92c2-141841a53feb" scrolling="no" allowtransparency="true" allowautoplay="true" frameborder="0" allow="encrypted-media;autoplay;fullscreen" class="cmpl_iframe" allowfullscreen="" style="overflow: hidden;border: 0px; margin: 0px; height: 100%; width:100%;"></iframe> </div>

Are you sure you want to reset your password?

If so, you will be redirected to the Authentication Service

Title

Prompt