Albert Ge's picture

Albert Ge

albertge

·

AI & ML interests

None yet

Recent Activity

authored a paper about 22 hours ago

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

upvoted a paper 4 days ago

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

upvoted a paper 18 days ago

RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning

View all activity

Organizations

authored a paper about 22 hours ago

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published 5 days ago • 25

authored a paper over 1 year ago

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Paper • 2410.05603 • Published Oct 8, 2024 • 11