Xuandong Zhao's picture

17 11

Xuandong Zhao

Xuandong

·

https://xuandongzhao.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset 9 days ago

Xuandong/CUA-Synth-Sample

published a dataset 9 days ago

Xuandong/CUA-Synth-Sample

updated a model 20 days ago

Xuandong/Qwen2.5-3B-Quiet-STaR

View all activity

Organizations

updated a dataset 9 days ago

Xuandong/CUA-Synth-Sample

Updated 9 days ago • 11

published a dataset 9 days ago

Xuandong/CUA-Synth-Sample

Updated 9 days ago • 11

updated a model 20 days ago

Xuandong/Qwen2.5-3B-Quiet-STaR

Text Generation • 3B • Updated 20 days ago • 23

published a model 20 days ago

Xuandong/Qwen2.5-3B-Quiet-STaR

Text Generation • 3B • Updated 20 days ago • 23

updated a Space 22 days ago

Unigram-Watermark

updated a model 29 days ago

Xuandong/Qwen2.5-VL-3B-CUA-SFT

4B • Updated 29 days ago • 10

published a model 29 days ago

Xuandong/Qwen2.5-VL-3B-CUA-SFT

4B • Updated 29 days ago • 10

replied to Kseniase's post 2 months ago

Please also check Reinforcement Learning from Internal Feedback (RLIF) https://arxiv.org/abs/2505.19590

New activity in sunblaze-ucb/Qwen2.5-1.5B-Intuitor-MATH-1EPOCH 4 months ago

Improve model card: Add transformers library, expand description, links, and usage

#1 opened 4 months ago by

New activity in sunblaze-ucb/OLMo-2-7B-SFT-GRPO-MATH-1EPOCH 4 months ago

Improve model card: Add library, links, and usage example

#1 opened 4 months ago by

New activity in sunblaze-ucb/OLMo-2-7B-SFT-Intuitor-MATH-1EPOCH 4 months ago

Improve model card: Add library, update pipeline tag, link to code

#1 opened 4 months ago by

New activity in sunblaze-ucb/Qwen3-14B-Intuitor-MATH-1EPOCH 4 months ago

Improve model card: Add library_name, paper/code links, and usage example

#1 opened 4 months ago by

New activity in sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH 4 months ago

Improve model card: Add library, GitHub link, paper details, and usage example

#1 opened 4 months ago by

New activity in sunblaze-ucb/Qwen3-14B-GRPO-MATH-1EPOCH 4 months ago

Improve model card: Add library, links, and usage example

#1 opened 4 months ago by

New activity in sunblaze-ucb/Qwen2.5-3B-Intuitor-MATH-1EPOCH 4 months ago

Improve model card: Add `library_name`, expanded description, GitHub link, and usage

#1 opened 4 months ago by

New activity in sunblaze-ucb/Qwen2.5-3B-GRPO-MATH-1EPOCH 4 months ago

Improve model card: Add library, usage, tags, and links

#1 opened 4 months ago by

upvoted a paper 5 months ago

Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models

Paper • 2507.07484 • Published Jul 10 • 17

commented a paper 5 months ago

Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models

Paper • 2507.07484 • Published Jul 10 • 17 •

upvoted a paper 5 months ago

The Landscape of Memorization in LLMs: Mechanisms, Measurement, and Mitigation

Paper • 2507.05578 • Published Jul 8 • 5

commented a paper 5 months ago

The Landscape of Memorization in LLMs: Mechanisms, Measurement, and Mitigation

Paper • 2507.05578 • Published Jul 8 • 5 •