arxiv:2504.16828
Muhammad Khalifa
mkhalifa
AI & ML interests
natural language genration, reinforcement learning
Recent Activity
submitted
a paper
2 days ago
Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation
new activity
30 days ago
mkhalifa/flan-t5-large-gsm8k:Add model card for GRACE
new activity
30 days ago
mkhalifa/flan-t5-large-svamp:Add model card for GRACE