Neer Vana's picture

3 2

Neer Vana

Neervana

·

AI & ML interests

None yet

Organizations

upvoted an article 3 months ago

Article

AutoBench Goes Scientific: Rigorous Validation for a Dynamic, Open-Source LLM Benchmark

Oct 29, 2025

•

4

upvoted an article 5 months ago

Article

AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org

Aug 20, 2025

•

6

upvoted a paper 9 months ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 93