Add Artificial Analysis evaluations for qwen3-8b-instruct-reasoning
#9 opened 11 days ago
by
burtenshaw
Input Hallucination
#7 opened 3 months ago
by
zhangziji1021
Only end </think> tag but no start <think> tag.
7
#5 opened 3 months ago
by
zhangziji1021
Sampling parameters & vLLM settings for tau2-bench?
#4 opened 3 months ago
by
lewtun
Request: DOI
1
#3 opened 4 months ago
by
Raybou
Terrible instruction following
👍
1
4
#2 opened 4 months ago
by
denisalpino
32B 32B 32B
👍
🤝
10
1
#1 opened 4 months ago
by
imoc