BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning Paper • 2603.04918 • Published 27 days ago • 56
GEditBench v2: A Human-Aligned Benchmark for General Image Editing Paper • 2603.28547 • Published 2 days ago • 31
GEditBench v2: A Human-Aligned Benchmark for General Image Editing Paper • 2603.28547 • Published 2 days ago • 31
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights Paper • 2512.01816 • Published Dec 1, 2025 • 94