Tensor Manipulation Unit (TMU): Reconfigurable, Near-Memory, High-Throughput AI

https://arxiv.org/abs/2506.14364

#HackerNews #TensorManipulationUnit #TMU #AI #HighThroughput #NearMemory #Reconfigurable

Tensor Manipulation Unit (TMU): Reconfigurable, Near-Memory Tensor Manipulation for High-Throughput AI SoC

While recent advances in AI SoC design have focused heavily on accelerating tensor computation, the equally critical task of tensor manipulation, centered on high,volume data movement with minimal computation, remains underexplored. This work addresses that gap by introducing the Tensor Manipulation Unit (TMU), a reconfigurable, near-memory hardware block designed to efficiently execute data-movement-intensive operators. TMU manipulates long datastreams in a memory-to-memory fashion using a RISC-inspired execution model and a unified addressing abstraction, enabling broad support for both coarse- and fine-grained tensor transformations. Integrated alongside a TPU within a high-throughput AI SoC, the TMU leverages double buffering and output forwarding to improve pipeline utilization. Fabricated in SMIC 40nm technology, the TMU occupies only 0.019 mm2 while supporting over 10 representative tensor manipulation operators. Benchmarking shows that TMU alone achieves up to 1413 and 8.54 operator-level latency reduction compared to ARM A72 and NVIDIA Jetson TX2, respectively. When integrated with the in-house TPU, the complete system achieves a 34.6% reduction in end-to-end inference latency, demonstrating the effectiveness and scalability of reconfigurable tensor manipulation in modern AI SoCs.

arXiv.org

In January, Claire chatted to Prof. Kris Dorsey about wearable soft robots, healthcare and rehabilitation. Kris is an Associate Professor at Northeastern University, researching #reconfigurable and active #soft sensors for medical and #rehabilitation devices.

https://www.robottalk.org/2024/01/05/episode-67-kris-dorsey/ #Robots #Robotics #SoftRobotics

Episode 67 – Kris Dorsey - Robot Talk

Claire chatted to Kris Dorsey from Northeastern University all about wearable soft robots, healthcare and rehabilitation.

Robot Talk - The podcast exploring the exciting world of robotics, artificial intelligence and autonomous machines.

Lerne Zahra Ebrahimi kennen, sie ist eine unserer Teilnehmenden des #SoftwareCampus und beschäftigt sich als Doktorandin an der @tudresden mit #Approximate und #Reconfigurable #Computing und kooperiert dabei mit #Huawei. Zahra erklärt:
„Ich entwerfe Schaltkreise wie ein Multiplizierer der für 2×2 den Wert 3,99 liefert. Das Ergebnis ist zwar nicht ganz genau, aber die Antworten sind schneller und verbrauchen weniger Strom.“

Hier erfährst du mehr über Zahra:

https://softwarecampus.de/teilnehmer/zarah-ebrahimi-mamaghani/

Zahra Ebrahimi - Software Campus

Ich heiße Zahra Ebrahimi (persönliche Website), meinen Masterabschluss in „Computer Architecture“ habe ich 2016 an der Sharif University of Technology im Iran erworben. 2018 wurde ich als Doktorandin Teil des Center for Advancing Electronics Dresden (cfaed) an der Technischen Universität Dresden.

Software Campus

Vanguard Program: Sandia Labs partners with Next Silicon & Penguin Solutions to deliver ‘first of its kind’ runtime #reconfigurable #accelerator technology for its next Advanced Architecture Prototype System (AAPS)

https://www.sandia.gov/research/2023/11/09/sandia-partners-with-nextsilicon-and-penguin-solutions-to-deliver-first-of-its-kind-runtime-reconfigurable-accelerator-technology/

#HPC #AI via @glennklockwood

Sandia partners with NextSilicon and Penguin Solutions to deliver ‘first of its kind’ runtime reconfigurable accelerator technology

Sandia National Laboratories, leading a tri-lab consortium with Lawrence Livermore National Laboratory (LLNL) and Los Alamos National Laboratory (LANL), announces a partnership with NextSilicon Inc. and Penguin Solutions to deliver the next Advanced Architecture Prototype System (AAPS). These pro...

Research