Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, volume 383, issue 2288

AxLaM: energy-efficient accelerator design for language models for edge computing

Tom Glint ¹

Bhumika Mittal ²

Santripta Sharma ²

Abdul Qadir Ronak ³

Abhinav Goud ³

Neerja Kasture ³

Zaqi Momin ³

Aravind Krishna ³

Joycee Mekie ³

Show full list: 9 authors

Hide authors affiliations

Forschungszentrum Jülich |

Ashoka University |

Indian Institute of Technology Gandhinagar |

Publication type: Journal Article

Publication date: 2025-01-16

The Royal Society

Journal: Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences

scimago Q1

wos Q1

SJR: 0.870

CiteScore: 9.3

Impact factor: 4.3

ISSN: 1364503X, 14712962

DOI: 10.1098/rsta.2023.0395

Copy DOI

Abstract

Modern language models such as bidirectional encoder representations from transformers have revolutionized natural language processing (NLP) tasks but are computationally intensive, limiting their deployment on edge devices. This paper presents an energy-efficient accelerator design tailored for encoder-based language models, enabling their integration into mobile and edge computing environments. A data-flow-aware hardware accelerator design for language models inspired by Simba, makes use of approximate fixed-point POSIT-based multipliers and uses high bandwidth memory (HBM) in achieving significant improvements in computational efficiency, power consumption, area and latency compared to the hardware-realized scalable accelerator Simba. Compared to Simba, AxLaM achieves a ninefold energy reduction, 58% area reduction and 1.2 times improved latency, making it suitable for deployment in edge devices. The energy efficiency of AxLaN is 1.8 TOPS/W, 65% higher than FACT, which requires pre-processing of the language model before implementing it on the hardware.

This article is part of the theme issue ‘Emerging technologies for future secure computing platforms’.

Found

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Cite this

GOST | RIS | BibTex

Found error?

Publisher

The Royal Society

Journal

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences

scimago Q1

wos Q1

SJR

0.870

CiteScore

9.3

Impact factor

4.3

ISSN

1364503X (Print)

14712962 (Electronic)