Entropy Production in Non-Equilibrium Neural Networks
An exercise in cybernetics
An exercise in cybernetics
A non-equilibrium statistical mechanics perspective on transformers
A statistical mechanics perspective on transformers
How far can we push the idea of transformers as physical systems?
Can we model attention as the collective response of a statistical-mechanical system?