view article Article Thinking Outside the Attention Box: Introducing Gated Associative Memory (GAM) By rishiraj • 13 days ago • 5
Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling Paper • 2509.00605 • Published 17 days ago • 42
view article Article Understanding Gemma 3n: How MatFormer Gives You Many Models in One By rishiraj • Jun 26 • 43
view article Article A simple implementation of the attention mechanism in JAX By rishiraj • Mar 3 • 2
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations • 15 items • Updated Jul 10 • 103
view article Article Indexify: Bringing HuggingFace Models to Real-Time Pipelines for Production Applications By rishiraj • May 31, 2024 • 7
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 639
Rishiraj Collection You can find the best of Rishiraj's work here • 13 items • Updated Dec 19, 2023 • 1
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 63