diff --git a/README.md b/README.md index 0ec61038..77eb0d63 100644 --- a/README.md +++ b/README.md @@ -25,7 +25,7 @@ ThunderKittens is built around three key principles: Flash Attention 3, but with kittens! -Join us on Discord to get involved: [discord.gg/gpumode](discord.gg/gpumode)!! Look for the ThunderKittens channel. +Join us on Discord to get involved: [ThunderKittens channel @ GPU Mode Discord](https://discord.com/channels/1189498204333543425/1300872762163728550)!!!! Here is the invite link to GPU mode: https://discord.gg/gpumode ThunderKittens is built from the hardware up -- we do what the silicon tells us. And modern GPUs tell us that they want to work with fairly small tiles of data. A GPU is not really a 1000x1000 matrix multiply machine (even if it is often used as such); it’s a manycore processor where each core can efficiently run ~16x16 matrix multiplies. Consequently, ThunderKittens is built around manipulating tiles of data no smaller than 16x16 values. @@ -245,8 +245,10 @@ Learn more about ThunderKittens and how GPUs work by checking out our blogs: - [Easier, Better, Faster, Cuter Blogpost, Oct. 2024](https://hazyresearch.stanford.edu/blog/2024-10-29-tk2) - [GPUs Go Brrr Blogpost, May 2024](https://hazyresearch.stanford.edu/blog/2024-05-12-tk) - [ThunderKittens: Bringing fp8 to theaters near you, Nov 2024](https://hazyresearch.stanford.edu/blog/2024-11-27-tk-fp8) +- [ThunderMittens For Your ThunderKittens, Nov 2024](https://hazyresearch.stanford.edu/blog/2024-11-28-tk-mlx) Please check out our paper for even more details: [paper](https://arxiv.org/abs/2410.20399) -Join us and get involved at the GPU MODE Discord: [discord.gg/gpumode](discord.gg/gpumode). Look for the ThunderKittens channel. +Join us and get involved at the [ThunderKittens channel @ GPU Mode Discord](https://discord.com/channels/1189498204333543425/1300872762163728550)!!!! Here is the invite link to GPU mode: https://discord.gg/gpumode +