Search

mcbal

mcbal

Blog
About

Renormalization Group

Transformer Attention as an Implicit Mixture of Effective Energy-Based Models

Where does the energy function behind Transformers' attention mechanism come from?

Matthias Bal © 2020–2026

Published with Wowchemy — the free, open source website builder that empowers creators.

Cite