ADRA-RL/olmo3-7b-instruct_lora_adra_dolma3-arxiv_paraphrased_lexical_unique_trio_s20 Updated 11 days ago
ADRA-RL/olmo3-7b-instruct_lora_adra-plus_dolma3-arxiv_paraphrased_lexical_unique_trio_s20 Updated 11 days ago
ADRA-RL/olmo3-7b-instruct_lora_adra_dolma3-arxiv_original_lexical_unique_trio_s20 Updated 11 days ago
ADRA-RL/olmo3-7b-instruct_lora_adra-plus_dolma3-arxiv_original_lexical_unique_trio_s20 Updated 11 days ago
ADRA-RL/qwen2-7b_lora_adra-plus_wikimia24-hard_paraphrased_lexical_unique_ngram_coverage_s20 Updated 11 days ago
ADRA-RL/olmo3-7b-instrct_lora_adra-plus_bookmia_paraphrased_lexical_unique_trio_s25 Updated 12 days ago
ADRA-RL/olmo3-7b-instrct_lora_adra_bookmia_original_lexical_unique_ngram_coverage_s20 Updated 12 days ago
ADRA-RL/qwen2.5-7b-instrct_lora_adra_s1_deepseek-r1_original_lexical_unique_trio_s140 Updated 12 days ago
ADRA-RL/tulu3-8b_lora_adra-plus_wildchat_original_lexical_unique_ngram_coverage_s100 Updated 12 days ago • 18
ADRA-RL/qwen2.5-7b-instrct_s1_gemini-r1_distillation_original Text Generation • 2B • Updated 12 days ago • 15
ADRA-RL/qwen2.5-7b-instrct_s1_deepseek-r1_distillation_original Text Generation • 1.0B • Updated 12 days ago • 22
ADRA-RL/tulu2-7b_olympiads_controlled_contamination_paraphrased Text Generation • 7B • Updated 12 days ago • 9
ADRA-RL/tulu2-7b_olympiads_controlled_contamination_original Text Generation • 7B • Updated 12 days ago • 8
ADRA-RL/tulu2-7b_lora_adra-plus_aime_paraphrased_lexical_unique_ngram_coverage_s70 Updated 14 days ago
ADRA-RL/tulu2-7b_aime_controlled_contamination_paraphrased Text Generation • 7B • Updated 14 days ago • 7