Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

new activity 2 days ago

Qwen/Qwen3-32B:Will Qwen3-32B be updated just like Qwen3-235B-A22B?

liked a model 2 days ago

Qwen/Qwen3-4B-Instruct-2507

new activity 2 days ago

openbmb/MiniCPM-V-4:License?

View all activity

Organizations

upvoted 4 papers 7 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 15 days ago • 273

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 209

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published 20 days ago • 122

MetaCLIP 2: A Worldwide Scaling Recipe

Paper • 2507.22062 • Published 10 days ago • 22

upvoted 2 papers 9 days ago

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published 11 days ago • 31

Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful

Paper • 2507.07101 • Published 30 days ago • 3

upvoted a paper 21 days ago

Voxtral

Paper • 2507.13264 • Published 22 days ago • 25

upvoted 2 papers 24 days ago

LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers

Paper • 2507.04404 • Published Jul 6 • 21

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper • 2507.08771 • Published 28 days ago • 9

upvoted a collection 24 days ago

MetaStone-S1

The open-source model of MetaStone-S1. • 4 items • Updated 9 days ago • 9

upvoted a paper 25 days ago

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2 • 104

upvoted a collection 27 days ago

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 12 items • Updated 3 days ago • 69

upvoted 2 papers 27 days ago

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling

Paper • 2507.07955 • Published 29 days ago • 22

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published 29 days ago • 32

upvoted a collection about 1 month ago

IFBench

Datasets for IFBench benchmark and paper! • 3 items • Updated Jul 3 • 5

upvoted a paper about 1 month ago

Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search

Paper • 2507.02652 • Published Jul 3 • 24

upvoted a collection about 1 month ago

Cogito v1 Preview

5 items • Updated Apr 8 • 116

upvoted 3 papers about 1 month ago

Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening

Paper • 2506.02355 • Published Jun 3 • 1

Bridging Offline and Online Reinforcement Learning for LLMs

Paper • 2506.21495 • Published Jun 26 • 2

SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks

Paper • 2507.01001 • Published Jul 1 • 44