Lancer

lancer001010

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a paper about 11 hours ago

Qwen3-ASR Technical Report

upvoted a paper about 11 hours ago

Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory

View all activity

Organizations

None yet

upvoted a paper about 1 hour ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 26 days ago • 143

upvoted 2 papers about 11 hours ago

Qwen3-ASR Technical Report

Paper • 2601.21337 • Published 4 days ago • 23

Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory

Paper • 2601.16296 • Published 10 days ago • 28

upvoted a paper 1 day ago

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Paper • 2601.17058 • Published 11 days ago • 181

upvoted a collection 3 days ago

Qwen3-ASR

Collection

4 items • Updated 4 days ago • 39

upvoted 2 papers 6 days ago

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published 11 days ago • 20

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published 11 days ago • 89

upvoted a paper 21 days ago

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published 24 days ago • 165

upvoted a paper 22 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 24 days ago • 212

upvoted a paper 24 days ago

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

Paper • 2601.03872 • Published 26 days ago • 42

upvoted a paper about 2 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 147

upvoted an article 2 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

315

upvoted an article 3 months ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21, 2025

•

301

upvoted an article 4 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

upvoted an article 5 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

upvoted a paper 5 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 229

upvoted a paper 7 months ago

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 159

upvoted an article 8 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

593

upvoted 2 articles 9 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

274

Article

I trained a Language Model to schedule events with GRPO!

Apr 29, 2025

•

Lancer

AI & ML interests

Recent Activity

Organizations

lancer001010's activity

Continuous batching from first principles

Supercharge your OCR Pipelines with Open Models

mem-agent: Equipping LLM Agents with Memory Using RL

From GRPO to DAPO and GSPO: What, Why, and How

Vision Language Models (Better, faster, stronger)

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

I trained a Language Model to schedule events with GRPO!