
GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz
twitter.com33 pointsby laxmena11 comments

twitter.com · 485 points · 247 comments
github.com · 3 points · 0 comments
roman.pt · 1353 points · 256 comments
iroh.computer · 1276 points · 392 comments
1225 points · 517 comments
tinywind.io · 915 points · 162 comments