Forcing Flash Attention onto a TPU and Learning the Hard Way
Link: https://archerzhang.me/forcing-flash-attention-onto-a-tpu
Discussion: https://news.ycombinator.com/item?id=47294271
Forcing Flash Attention onto a TPU and Learning the Hard Way
Link: https://archerzhang.me/forcing-flash-attention-onto-a-tpu
Discussion: https://news.ycombinator.com/item?id=47294271
If you have a fediverse account, you can quote this note from your own instance. Search https://social.lansky.name/users/hn50/statuses/116219727490395575 on your instance and quote it. (Note that quoting is not supported in Mastodon.)