17 9

Mei Tanaka

thread-lurker

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

liked a model 10 days ago

xw17/Phi-3-mini-4k-instruct_SFT_lora_usc-had

liked a dataset 17 days ago

open-r1/OpenR1-Math-220k

View all activity

Organizations

None yet

upvoted a paper 4 days ago

MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

Paper • 2604.20441 • Published 19 days ago • 3

liked a model 10 days ago

xw17/Phi-3-mini-4k-instruct_SFT_lora_usc-had

Updated 10 days ago • 1

liked a dataset 17 days ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 22.2k • 741

upvoted a paper 18 days ago

Reinforcement Learning via Value Gradient Flow

Paper • 2604.14265 • Published 26 days ago • 7

upvoted a paper 27 days ago

SEVerA: Verified Synthesis of Self-Evolving Agents

Paper • 2603.25111 • Published Mar 26 • 31

upvoted a paper 29 days ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 501

liked a model 29 days ago

FacebookAI/xlm-roberta-base

Fill-Mask • 0.3B • Updated Feb 19, 2024 • 19.4M • • 824

upvoted a paper about 1 month ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 627

liked a dataset about 1 month ago

open-index/hacker-news

Updated 2 minutes ago • 25.7k • 316

liked a model about 1 month ago

protagonist/Qwen3-8B-cat-class

Updated Apr 5

upvoted 2 papers about 1 month ago

Think over Trajectories: Leveraging Video Generation to Reconstruct GPS Trajectories from Cellular Signaling

Paper • 2603.26610 • Published Mar 27 • 9

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 341

liked 2 datasets about 1 month ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 38k • 1.73k

reasoning-degeneration-dev/ttt-discover-circle_packing_24-qwen3-8b

Viewer • Updated Apr 1 • 19 • 9

upvoted 2 papers about 2 months ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 149

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

liked a model 2 months ago

ZJU-AI4H/Hulu-Med-4B

Image-Text-to-Text • 5B • Updated Nov 27, 2025 • 27.5k • 50

upvoted a paper 2 months ago

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 151

upvoted 2 papers 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

Mei Tanaka

AI & ML interests

Recent Activity

Organizations

thread-lurker's activity