Benjamin Minixhofer
benjamin
AI & ML interests
NLP, Efficiency, Machine Learning in Rust, Multilinguality, Transfer Learning
Recent Activity
upvoted
a
paper
2 days ago
Inference-Time Hyper-Scaling with KV Cache Compression
published
a model
6 days ago
benjamin/Qwen3-4B-Base-flax
updated
a model
11 days ago
benjamin/Qwen3-4B-Base-flax
Organizations
Collections
1
models
57

benjamin/Qwen3-4B-Base-flax
Text Generation
•
Updated
•
24

benjamin/Llama3-2-3B-IT-Byte
Updated
•
3
•
1

benjamin/Gemma2-2B-IT-Byte
Updated
•
9
•
1

benjamin/Qwen2.5-7B-Instruct-flax
Text Generation
•
Updated
•
28

benjamin/Gemma2-2B-Distilled-Math
Text Generation
•
Updated
•
16

benjamin/Gemma2-2B-IT-with-Qwen2-Tokenizer
Text Generation
•
Updated
•
12

benjamin/Llama3.2-3B-IT-with-Qwen2-Tokenizer
Text Generation
•
Updated
•
8

benjamin/OpenMath2-Llama3.1-8B-flax
Text Generation
•
Updated
•
554

benjamin/TinyLlama-1.1B-intermediate-step-1431k-3T-gpt2-from-focus
Text Generation
•
Updated
•
11

benjamin/TinyLlama-1.1B-intermediate-step-1431k-3T-starcoder-from-focus
Text Generation
•
Updated
•
11