Tensor Network Attention

이 글은 텐서 네트워크 시각화를 통해 다양한 어텐션 메커니즘을 분석한다. 텐서 네트워크는 복잡한 선형대수 연산을 그래프로 표현해 구조를 명확히 보여주며, 이를 통해 기존 어텐션 변형들이 어떤 빠른 커널에 대응 가능한지 쉽게 파악할 수 있다. 특히, 멀티헤드 어텐션(MHA), 멀티쿼리 어텐션(MQA), 토킹헤즈 어텐션 등 주요 어텐션 변형을 텐서 네트워크 관점에서 설명하며, KV 캐시 압축과 헤드 간 상호작용 구조를 직관적으로 이해할 수 있게 한다. 이 접근법은 어텐션 구조의 본질과 최적화 가능성을 탐구하는 AI 연구자 및 개발자에게 유용하다.

https://mainlymatmul.com/blog/tensor-network-attention/

#tensornetwork #attention #multiheadattention #multiqueryattention #transformer

Tensor Network Attention

Using tensor network notation to understand multi-head attention, MQA, talking-heads attention, and DeepSeek's MLA.

International Workshop Led by Young Researchers "Recent Developments and Challenges in Tensor Networks: Algorithms, Applications to science, and Rigorous theories"
2025/07/28 --- 2025/08/08
Yukawa Institute for Theoretical Physics, Kyoto University

https://www.yukawa.kyoto-u.ac.jp/seminar/s53429?lang=en-GB

#TensorNetwork

若手国際ワークショップ "Recent Developments and Challenges in Tensor Networks: Algorithms, Applications to science, and Rigorous theories" YITP-I-25-02 - 京都大学基礎物理学研究所

京都大学基礎物理学研究所 -
Researchers show classical computers can keep up with, and surpass, their quantum counterparts

Quantum computing has been hailed as a technology that can outperform classical computing in both speed and memory usage, potentially opening the way to making predictions of physical phenomena not previously possible.

Phys.org

This paper from @jenseisert and colleagues sounds interesting!

"The incorporation of automatic differentiation in tensor networks algorithms has ultimately enabled a new, flexible way for variational simulation of ground states and excited states. In this work, we review the state of the art of the variational iPEPS framework. We present and explain the functioning of an efficient, comprehensive and general tensor network library for the simulation of infinite two-dimensional systems using iPEPS, with support for flexible unit cells and different lattice geometries."

https://scirate.com/arxiv/2308.12358

#quantum #TensorNetwork #computational #physics #AutomaticDifferentiation #iPEPS

variPEPS -- a versatile tensor network library for variational ground state simulations in two spatial dimensions

Tensor networks capture large classes of ground states of phases of quantum matter faithfully and efficiently. Their manipulation and contraction has remained a challenge over the years, however. For most of the history, ground state simulations of two-dimensional quantum lattice systems using (infinite) projected entangled pair states have relied on what is called a time-evolving block decimation. In recent years, multiple proposals for the variational optimization of the quantum state have been put forward, overcoming accuracy and convergence problems of previously known methods. The incorporation of automatic differentiation in tensor networks algorithms has ultimately enabled a new, flexible way for variational simulation of ground states and excited states. In this work, we review the state of the art of the variational iPEPS framework. We present and explain the functioning of an efficient, comprehensive and general tensor network library for the simulation of infinite two-dimensional systems using iPEPS, with support for flexible unit cells and different lattice geometries.

SciRate