Artificial Intelligence

Spin-Model Transformers featured image

Spin-Model Transformers

A non-equilibrium statistical mechanics perspective on transformers

avatar
Matthias Bal
Transformers Are Secretly Collectives of Spin Systems featured image

Transformers Are Secretly Collectives of Spin Systems

A statistical mechanics perspective on transformers

avatar
Matthias Bal
Transformers from Spin Models: Approximate Free Energy Minimization featured image

Transformers from Spin Models: Approximate Free Energy Minimization

How far can we push the idea of transformers as physical systems?

avatar
Matthias Bal
Deep Implicit Attention: A Mean-Field Theory Perspective on Attention Mechanisms featured image

Deep Implicit Attention: A Mean-Field Theory Perspective on Attention Mechanisms

Can we model attention as the collective response of a statistical-mechanical system?

avatar
Matthias Bal
Attention as Energy Minimization: Visualizing Energy Landscapes featured image

Attention as Energy Minimization: Visualizing Energy Landscapes

Can we swap softmax attention for energy-based attention?

avatar
Matthias Bal
Transformer Attention as an Implicit Mixture of Effective Energy-Based Models featured image

Transformer Attention as an Implicit Mixture of Effective Energy-Based Models

Where does the energy function behind Transformers' attention mechanism come from?

avatar
Matthias Bal
An Energy-Based Perspective on Attention Mechanisms in Transformers featured image

An Energy-Based Perspective on Attention Mechanisms in Transformers

Can an energy-based perspective shed light on training and improving Transformer models?

avatar
Matthias Bal