Skip to content

Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

License

Notifications You must be signed in to change notification settings

declare-lab/EFLA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

arXiv License: CC-BY

Introduction

This repo is the official repo of Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics. We formulate the online learning update of delta rule as a continuous-time dynamical system and prove that its exact solution is not only attainable but also computable in linear time with full parallelism. By leveraging the rank-1 structure of the dynamics matrix, we directly derive the exact closed-form solution effectively corresponding to the infinite-order Runge–Kutta method.

Authors: Jingdi Lei, Di Zhang, Soujanya Poria

🚀 Quick Start

We release the code to run on sMNIST

  • Train and Evaluate DeltaNet & EFLA
    python mnist.py

Acknowledgement

About

Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages