3 12 3

Lang Feng

langfeng01

https://langfengq.github.io/

langfengQ

AI & ML interests

PhD student @ NTU Singapore

Recent Activity

upvoted a paper about 1 month ago

CaveAgent: Transforming LLMs into Stateful Runtime Operators

authored a paper about 2 months ago

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

submitted a paper about 2 months ago

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

CaveAgent: Transforming LLMs into Stateful Runtime Operators

Paper • 2601.01569 • Published Jan 4 • 20

authored a paper about 2 months ago

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Paper • 2602.10609 • Published Feb 11 • 18

submitted a paper to Daily Papers about 2 months ago

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Paper • 2602.10609 • Published Feb 11 • 18

upvoted a paper about 2 months ago

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Paper • 2602.10609 • Published Feb 11 • 18

authored a paper about 2 months ago

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Paper • 2602.08847 • Published Feb 9 • 29

upvoted a paper about 2 months ago

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Paper • 2602.08847 • Published Feb 9 • 29

submitted a paper to Daily Papers about 2 months ago

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Paper • 2602.08847 • Published Feb 9 • 29

upvoted a paper 2 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 266

authored a paper 3 months ago

AgentOCR: Reimagining Agent History via Optical Self-Compression

Paper • 2601.04786 • Published Jan 8 • 31

upvoted a paper 3 months ago

AgentOCR: Reimagining Agent History via Optical Self-Compression

Paper • 2601.04786 • Published Jan 8 • 31

submitted a paper to Daily Papers 3 months ago

AgentOCR: Reimagining Agent History via Optical Self-Compression

Paper • 2601.04786 • Published Jan 8 • 31

updated 2 models 6 months ago

langfeng01/GiGPO-Qwen2.5-7B-Instruct-WebShop

8B • Updated Sep 28, 2025 • 11.9k

langfeng01/GiGPO-Qwen2.5-7B-Instruct-ALFWorld

8B • Updated Sep 28, 2025 • 111 • 1

upvoted a paper 7 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238

liked a model 8 months ago

Qwen/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated Sep 17, 2025 • 7.08M • • 797

upvoted a collection 8 months ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.74k

authored a paper 8 months ago

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

Paper • 2506.13705 • Published Jun 16, 2025 • 2

liked a model 8 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 5.74M • • 4.51k

updated a collection 9 months ago

TimeMaster

Collection

Open-source models of TimeMaster • 2 items • Updated Jul 2, 2025 • 2

upvoted a paper 9 months ago

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

Paper • 2506.13705 • Published Jun 16, 2025 • 2

Lang Feng

AI & ML interests

Recent Activity

Organizations

langfeng01's activity