Massachusetts Institute of Technology

university

Verified

https://www.mit.edu/

AI & ML interests

None defined yet.

Recent Activity

ishapuri submitted a paper 4 days ago

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

yulu2 submitted a paper 18 days ago

Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights

lcying submitted a paper about 1 month ago

AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games

View all activity

Papers

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights

View all Papers

submitted a paper to Daily Papers 4 days ago

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Paper • 2603.24844 • Published 6 days ago • 7

submitted a paper to Daily Papers 18 days ago

Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights

Paper • 2603.12228 • Published 19 days ago • 12

submitted a paper to Daily Papers about 1 month ago

AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games

Paper • 2602.17594 • Published Feb 19 • 9

submitted a paper to Daily Papers about 1 month ago

How Much Reasoning Do Retrieval-Augmented Models Add beyond LLMs? A Benchmarking Framework for Multi-Hop Inference over Hybrid Knowledge

Paper • 2602.10210 • Published Feb 10 • 1

submitted a paper to Daily Papers about 2 months ago

Stemphonic: All-at-once Flexible Multi-stem Music Generation

Paper • 2602.09891 • Published Feb 10 • 2

improbableaimit

submitted a paper to Daily Papers 2 months ago

Self-Distillation Enables Continual Learning

Paper • 2601.19897 • Published Jan 27 • 27

authored a paper 2 months ago

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Paper • 2601.17058 • Published Jan 22 • 190

submitted 2 papers to Daily Papers 2 months ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published Jan 14 • 92

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 150

submitted a paper to Daily Papers 3 months ago

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Paper • 2512.10927 • Published Dec 11, 2025 • 6

authored 3 papers 4 months ago

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

Paper • 2406.07835 • Published Jun 10, 2024 • 2

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10, 2025 • 17

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 63

authored 3 papers 8 months ago

Bridging Theory and Practice in Quantum Game Theory: Optimized Implementation of the Battle of the Sexes with Error Mitigation on NISQ Hardware

Paper • 2508.09050 • Published Aug 12, 2025 • 3

Embedding-Aware Quantum-Classical SVMs for Scalable Quantum Machine Learning

Paper • 2508.00024 • Published Jul 28, 2025 • 7

DriftMoE: A Mixture of Experts Approach to Handle Concept Drifts

Paper • 2507.18464 • Published Jul 24, 2025 • 12

authored a paper 9 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 162

authored 2 papers 10 months ago

Language-Guided Image Tokenization for Generation

Paper • 2412.05796 • Published Dec 8, 2024 • 1

RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning

Paper • 2505.15034 • Published May 21, 2025 • 5

authored a paper 11 months ago

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

Paper • 2505.03981 • Published May 6, 2025 • 15