Skip to main content
The Institution of Engineering and Technology iet.tv
Site name
  • Videos
  • Channels
  • Events
  • Series
  • Sign in

Access and Account

Access your personal account

Log in to see your favourites, lists and progress.

Access Code

Redeem Access Code
Log in to redeem access code
  1. Videos
  2. Video

Rethinking Precision: The Design Space of Block Floating-Point Formats for the LLM Era

  • WhatsApp
  • Facebook
  • Email
  • LinkedIn
  • Bluesky
CPD This content can contribute towards your Continuing Professional Development (CPD) as part of the IET's CPD Monitoring scheme.
Presentation
  • Session
  • Monday, 10 November 2025
  • 13:25
  • Duration: 27 mins
  • Publication date: 11 Nov 2025
  • Location: Turing Lecture Theatre, IET London: Savoy Place, London, United Kingdom
  • Part of event REACH 2025

About the session

Partha Maji, Senior Director – AI Hardware Acceleration, Microsoft, UK

As Large Language Models (LLMs) scale to trillions of parameters, traditional floating-point formats are increasingly constrained by memory bandwidth, energy, and storage limits. Emerging block floating-point (BFP) schemes offer a promising alternative—combining fixed-point efficiency with floating-point adaptability through shared exponents and local scaling. This talk explores the design space of BFP formats for both inference and training, focusing on how exponent sharing, mantissa precision, and block granularity interact with accuracy, stability, and hardware cost. Drawing from recent advances such as MX and NVFP variants, we will examine practical design choices - calibration, accumulation, rounding, and mixed-precision fusion - that enable 3–6× compression and improved accelerator utilization without significant accuracy loss. The discussion bridges algorithm and hardware perspectives, outlining co-design principles that make BFP numerics deployable in real systems. Finally, we highlight open research challenges in dynamic range handling, attention sensitivity, and unified training-inference numerics - inviting the community to rethink precision as a continuum, not a constant.

Keywords:
  • IET conference
  • REACH 2025
  • Reach Emerging Architectures in Computing Horizons
  • Savoy Place London
  • Sustainable Computer System Design
  • emerging AI hardware
  • memory bandwidth problem

Channels

Communications

Communications

IT

IT

Lectures

Lectures

Speaker

  • PM

    Partha Maji

    AI Hardware Acceleration, Microsoft, UK, Senior Director

The Institution of Engineering and Technology iet.tv

Address: Futures Place, Kings Way, Stevenage, SG1 2UA

Telephone: +44 (0)33 049 9123

Email:  iet.tv@theiet.org

© 2026 The Institution of Engineering and Technology.

The Institution of Engineering and Technology is registered as a Charity in England & Wales (no 211014) and Scotland (no SC038698). Futures Place, Kings Way, Stevenage, Hertfordshire, SG1 2UA, United Kingdom

  • LinkedIn
  • Instagram
  • YouTube
Privacy statement Cookie Preferences Accessibility About us theiet.org Help

Powered by Cadmore Media

Embed Code

<script type="text/javascript" src="https://play.cadmore.media/js/EMBED.js"></script> <div class="cmpl_iframe_div"> <iframe src="https://play.cadmore.media/Player/93d19c81-0a12-4f8e-ab63-6df5d915ad96" scrolling="no" allowtransparency="true" allowautoplay="true" frameborder="0" allow="encrypted-media;autoplay;fullscreen" class="cmpl_iframe" allowfullscreen="" style="overflow: hidden;border: 0px; margin: 0px; height: 100%; width:100%;"></iframe> </div>

Are you sure you want to reset your password?

If so, you will be redirected to the Authentication Service

Title

Prompt