4 14 9

Liu

Zuxin

https://www.zuxin.me

liuzuxin

AI & ML interests

Reinforcement learning, imitation learning

Recent Activity

liked a dataset 10 days ago

Salesforce/Webscale-RL

upvoted a paper 20 days ago

CoDA: Coding LM via Diffusion Adaptation

upvoted a paper about 1 month ago

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

View all activity

Organizations

liked a dataset 10 days ago

Salesforce/Webscale-RL

Viewer • Updated 14 days ago • 1.11M • 9.49k • 78

upvoted a paper 20 days ago

CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published about 1 month ago • 41

upvoted a paper about 1 month ago

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Paper • 2509.19736 • Published Sep 24 • 11

upvoted a paper 3 months ago

UserBench: An Interactive Gym Environment for User-Centric Agents

Paper • 2507.22034 • Published Jul 29 • 29

liked a dataset 3 months ago

dongguanting/ARPO-SFT-54K

Viewer • Updated 11 days ago • 54.6k • 220 • 11

liked a dataset 4 months ago

pandalla/Machine_Mindset_MBTI_dataset

Viewer • Updated Jun 4, 2024 • 161k • 204 • 68

upvoted a collection 6 months ago

xLAM-2

Collection

A family of Large Action Model for multi-turn conversation and tool-use • 10 items • Updated Jul 28 • 22

upvoted 2 papers 7 months ago

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

Paper • 2504.03601 • Published Apr 4 • 17

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

Paper • 2503.22673 • Published Mar 28 • 12

authored a paper 12 months ago

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 37

upvoted a paper 12 months ago

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 37

authored 3 papers about 1 year ago

Learning from Sparse Offline Datasets via Conservative Density Estimation

Paper • 2401.08819 • Published Jan 16, 2024

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

Paper • 2406.10290 • Published Jun 12, 2024

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Paper • 2408.07060 • Published Aug 13, 2024 • 42

upvoted a paper about 1 year ago

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Paper • 2408.07060 • Published Aug 13, 2024 • 42

liked a model about 1 year ago

deepseek-ai/DeepSeek-V2-Lite

Text Generation • 16B • Updated Jun 25, 2024 • 61.7k • 153

liked 4 models over 1 year ago

Liu

AI & ML interests

Recent Activity

Organizations

Zuxin's activity