ADRA-RL/olmo3-7b-instruct_lora_adra-plus_dolma3-arxiv_paraphrased_lexical_unique_trio_s20 Updated Feb 17
ADRA-RL/olmo3-7b-instruct_lora_adra-plus_dolma3-arxiv_original_lexical_unique_trio_s20 Updated Feb 17
ADRA-RL/qwen2-7b_lora_adra-plus_wikimia24-hard_paraphrased_lexical_unique_ngram_coverage_s20 Updated Feb 17
ADRA-RL/qwen2.5-7b-instrct_s1_gemini-r1_distillation_original Text Generation • 2B • Updated Feb 16 • 1
ADRA-RL/qwen2.5-7b-instrct_s1_deepseek-r1_distillation_original Text Generation • 1.0B • Updated Feb 16 • 8
ADRA-RL/tulu2-7b_olympiads_controlled_contamination_paraphrased Text Generation • 7B • Updated Feb 16 • 2
ADRA-RL/tulu2-7b_olympiads_controlled_contamination_original Text Generation • 7B • Updated Feb 16 • 2