KABI's picture

KABI

dongguanting

·

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper about 6 hours ago

Latent Collaboration in Multi-Agent Systems

upvoted a paper 6 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

upvoted a paper 6 days ago

General Agentic Memory Via Deep Research

View all activity

Organizations

upvoted a paper about 6 hours ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 6 days ago • 96

upvoted 2 papers 6 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 7 days ago • 51

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published 8 days ago • 149

upvoted a paper 21 days ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published 24 days ago • 41

upvoted 3 papers 24 days ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published 26 days ago • 77

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published 25 days ago • 94

LiveTradeBench: Seeking Real-World Alpha with Large Language Models

Paper • 2511.03628 • Published 26 days ago • 11

upvoted a paper 25 days ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27 • 83

upvoted 2 papers 27 days ago

LongCat-Flash-Omni Technical Report

Paper • 2511.00279 • Published about 1 month ago • 22

ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use

Paper • 2510.27363 • Published Oct 31 • 22

upvoted 10 papers about 1 month ago

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 95

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29 • 45

ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization

Paper • 2510.24592 • Published Oct 28 • 51

ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints

Paper • 2510.14847 • Published Oct 16 • 55

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published Oct 19 • 102

A Definition of AGI

Paper • 2510.18212 • Published Oct 21 • 34

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24 • 98

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21 • 82

WithAnyone: Towards Controllable and ID Consistent Image Generation

Paper • 2510.14975 • Published Oct 16 • 83

Chem-R: Learning to Reason as a Chemist

Paper • 2510.16880 • Published Oct 19 • 52