QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 4 days ago • 153
Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper • 2510.07318 • Published 9 days ago • 26
Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention Paper • 2510.04212 • Published 13 days ago • 22
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models Paper • 2510.01623 • Published 16 days ago • 7
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published 17 days ago • 478
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper • 2509.22186 • Published 22 days ago • 118
SWE-QA: Can Language Models Answer Repository-level Code Questions? Paper • 2509.14635 • Published 30 days ago • 36
view post Post 4189 Quietly launched the largest Open source Free LateX Dataset -https://huggingface.co/datasets/dalle2/Bibby-AI-Latex-Tool-Overleaf-Alternative See translation 1 reply · 👍 5 5 + Reply
TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them Paper • 2509.21117 • Published 23 days ago • 29