Mingyu Derek Ma
derekma
AI & ML interests
Generative Language Model, Scientific LM, Clinical LM, Decoding
Recent Activity
upvoted an article about 1 month ago
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond liked
a model about 1 year ago
deepseek-ai/DeepSeek-R1 liked
a dataset over 1 year ago
fwnlp/data-advisor-safety-alignment