Muhammad Khalifa's picture

Muhammad Khalifa

mkhalifa

·

https://mukhal.github.io/

AI & ML interests

natural language genration, reinforcement learning

Recent Activity

submitted a paper about 2 months ago

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

new activity 3 months ago

mkhalifa/flan-t5-large-gsm8k:Add model card for GRACE

new activity 3 months ago

mkhalifa/flan-t5-large-svamp:Add model card for GRACE

View all activity

Organizations

Papers 9

arxiv:2504.16828

arxiv:2412.04144

arxiv:2410.02899

arxiv:2405.16337

models 21

mkhalifa/flan-t5-large-gsm8k

Text Generation • Updated Jan 7 • 6

mkhalifa/flan-t5-large-svamp

Text Generation • Updated Jan 7 • 4

mkhalifa/flan-t5-large-mathqa

Text Generation • Updated Jan 7 • 2

mkhalifa/ThinkPRM-gptoss-20B

Updated Aug 18, 2025 • 15

mkhalifa/r1_14b_discriminative_prm

Text Generation • 15B • Updated Mar 27, 2025 • 2

mkhalifa/r1_14b_longthought-1K

Text Generation • 15B • Updated Mar 25, 2025 • 1

mkhalifa/r1-1.5b-longthought-outcome-matching

Text Generation • 2B • Updated Mar 20, 2025 • 2

mkhalifa/r1-1.5b-longthought-1K

Text Generation • 2B • Updated Mar 10, 2025 • 3

mkhalifa/r1_14b_longthought-1K-outcome-only

Text Generation • 15B • Updated Mar 9, 2025 • 6

mkhalifa/r1-1.5b-longthought-v2

Text Generation • 2B • Updated Mar 9, 2025 • 3

datasets 18

mkhalifa/agent

Updated Nov 26, 2025 • 6

mkhalifa/gpqa-diamond-physics

Viewer • Updated Mar 15, 2025 • 86 • 157

mkhalifa/short-to-long-5K

Viewer • Updated Feb 26, 2025 • 5k • 3

mkhalifa/CoGEX

Viewer • Updated Feb 13, 2025 • 51.8k • 25

mkhalifa/llama-3.1-8b-instruct-math-trajectories-64-sample-per-problem

Viewer • Updated Jan 29, 2025 • 736k • 71

mkhalifa/llama-3.1-8b-instruct-math-trajectories-48-sample-per-problem

Viewer • Updated Jan 29, 2025 • 552k • 50

mkhalifa/llama-3.1-8b-instruct-math-trajectories-32-sample-per-problem

Viewer • Updated Jan 29, 2025 • 368k • 38

mkhalifa/llama-3.1-8b-instruct-math-trajectories-16-sample-per-problem

Viewer • Updated Jan 29, 2025 • 184k • 24

mkhalifa/llama-3.1-8b-instruct-math-trajectories-8-sample-per-problem

Viewer • Updated Jan 29, 2025 • 92k • 12

mkhalifa/llama-3.1-70b-instruct-math-trajectories-8-sample-per-problem

Viewer • Updated Jan 29, 2025 • 92k • 13

View 18 datasets