Watch#8: Extreme Teachers and Mixing Tokens, not Experts
A general trend of making LLM computation more efficient
Foreword:
I have received a few pledges recently and I want to say thank you for that! It’s reassuring to see the support. Both in pledges and messages.
I’m still not entirely sure what kind of additional content to produce for a future paid sub, so if you have ideas or strong opinions about this, please let me know!
Have a great day all,
Pascal
Keep reading with a 7-day free trial
Subscribe to LLM Watch to keep reading this post and get 7 days of free access to the full post archives.