Swanish Realm's picture

65

Swanish Realm

swanishrealm

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Agentic Entropy-Balanced Policy Optimization

upvoted a paper 7 days ago

BitNet Distillation

upvoted a paper 7 days ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

View all activity

Organizations

None yet

upvoted 3 papers 7 days ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published 8 days ago • 95

BitNet Distillation

Paper • 2510.13998 • Published 9 days ago • 47

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published 8 days ago • 63

upvoted a paper 8 days ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published 15 days ago • 42

upvoted 3 papers 10 days ago

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published 26 days ago • 44

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 11 days ago • 164

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published 11 days ago • 157

upvoted a paper 11 days ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published 17 days ago • 91

upvoted a paper 15 days ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 18 days ago • 438

upvoted 2 papers 20 days ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published 23 days ago • 86

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published 24 days ago • 505

upvoted 4 papers 24 days ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published 28 days ago • 132

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published 28 days ago • 121

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published 28 days ago • 176

HunyuanImage 3.0 Technical Report

Paper • 2509.23951 • Published 26 days ago • 21

upvoted a paper 28 days ago

Tree Search for LLM Agent Reinforcement Learning

Paper • 2509.21240 • Published 29 days ago • 87

upvoted 2 papers 29 days ago

MAPO: Mixed Advantage Policy Optimization

Paper • 2509.18849 • Published Sep 23 • 26

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Paper • 2508.21112 • Published Aug 28 • 75

upvoted 2 papers about 1 month ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22 • 99

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 231