Access your personal account

Log in to see your favourites, lists and progress.

Access via institution

Not currently connected to any institutions

Connect via

Access Code

Redeem Access Code

Log in to redeem access code

The application of AI to the transcription of telephone calls

Duration: 1 hr 22 mins
Publication date: 30 May 2025
Part of series IET Friday Lunchtime Lectures

Abstract

40 years ago, when the task of getting machines to transcribe human speech was first investigated, even the largest mainframe computers had a fraction of the power of a smart phone of today. As a result, techniques for Automatic Speech Recognition (ASR) were developed that did not rely on massive computing power. Now that computing power is cheap and fast, computers are actually better at recognising human speech than humans. However, what computers are not so good at is understanding speech. This is because humans unconsciously apply context which is not available to a computer and so some of the techniques developed in the early days of ASR have again become relevant.

Good as the Cloud-based ASR services are for searching phone calls for keywords, they fall short of the requirements for full operational call transcription. However, by applying AI and some of the techniques developed at NPL, it is possible to add context to ASR thus transforming speech recognition into speech understanding.

Keywords:: Application of AI

Artificial Intelligence Telecommunications

Automatic Speech Recognition (ASR)

Cloud-based ASR

Information and Communications

speech recognition

speech recognition accuracy

transcription of telephone calls

Channels

Communications

Communications

IT

Lectures

Lectures

Speaker

Thomas Michel

Threads Software Ltd, CTO

Thomas Michel is the CTO of Threads Software Ltd. He holds a BSc and MSc in Computer Science.