PAI-Bench: A Comprehensive Benchmark For Physical AI Paper • 2512.01989 • Published 27 days ago • 5
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning Paper • 2512.02425 • Published 26 days ago • 23
RedHatAI/Qwen2.5-VL-72B-Instruct-FP8-dynamic Image-to-Text • 73B • Updated Apr 25 • 9.54k • 15
Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising Paper • 2511.08633 • Published Nov 9 • 53
Adaptive Multi-Agent Response Refinement in Conversational Systems Paper • 2511.08319 • Published Nov 11 • 41
Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published Oct 17 • 49
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs Paper • 2510.09201 • Published Oct 10 • 49
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs Paper • 2510.07499 • Published Oct 8 • 48
ACON: Optimizing Context Compression for Long-horizon LLM Agents Paper • 2510.00615 • Published Oct 1 • 32