A Programmable In Memory Computing Accelerator for Energy Efficient DNN Inference

A Programmable In Memory Computing Accelerator for Energy Efficient DNN Inference

admin

admin

Feb 3, 2024 - 09:05

0 32

Abstract:

This article presents a programmable in-memory computing accelerator (PIMCA) for low-precision (1–2 b) deep neural network (DNN) inference. The custom 10T1C bitcell in the in-memory computing (IMC) macro has four additional transistors and one capacitor to perform capacitive-coupling-based multiply and accumulation (MAC) in analog-mixed-signal (AMS) domain. A macro containing 256×128 bitcells can simultaneously activate all the rows, and as a result, it can perform a matrix-vector multiplication (VMM) in one cycle. PIMCA integrates 108 of such IMC static random-access memory (SRAM) macros with the custom six-stage pipeline and the custom instruction set architecture (ISA) for instruction-level programmability. The results of IMC macros are fed to a single-instruction-multiple-data (SIMD) processor for other computations such as partial sum accumulation, max-pooling, activation functions, etc. To effectively use the IMC and SIMD datapath, we customize the ISA especially by adding hardware loop support, which reduces the program size by up to 73%. The accelerator is prototyped in a 28-nm technology, and integrates a total of 3.4-Mb IMC SRAM and 1.5-Mb off-the-shelf activation SRAM, demonstrating one of the largest IMC accelerators to date. It achieves the system-level energy efficiency of 437 TOPS/W and the peak throughput of 49 TOPS at the 42-MHz clock frequency and 1-V supply for the VGG9 and the ResNet-18 on the CIFAR-10 dataset.

Click Here To See More

Tags:

Previous Article

Deep Transfer Learning Based Parkinson’s Disease Detection Using Optimized Featu...

Comprehensive Analysis of Nature Inspired Algorithms for Parkinson’s Disease Dia...

What's Your Reaction?

0

Like

0

Dislike

0

Love

0

Funny

0

Angry

0

Sad

0

Wow

Related Posts

Mutual Information Driven Subject Invariant and Class R...

admin Jan 22, 2024 0 20

Comparative Performance Evaluation of Intrusion Detecti...

admin Dec 23, 2021 0 22

Enhanced WiFi CSI Fingerprints for Device Free Localiza...

admin Feb 3, 2024 0 19

Deep Learning Inter city Road Conditions in East Africa...

admin Feb 2, 2024 0 12

DeepWare Imaging Performance Counters With Deep Learnin...

admin Jan 31, 2024 0 9

Traffic Object Detection and Recognition Based on the A...

admin Jan 31, 2024 0 11

Comments