·
AI & ML interests
None yet
Organizations
sher222/importancesampling_hh_nochat_prompt
Viewer
• Updated • 160k • 3
sher222/llama-3.2-3b_MATH-openrlhf-prompts
Viewer
• Updated • 3.41k • 5
sher222/llama-3.2-3b_MATH-openrlhf
Viewer
• Updated • 11.4k • 6
sher222/importancesampling_hh_nochat
Viewer
• Updated • 161k • 5
sher222/tldr-preference_openrlhf_nochat
Viewer
• Updated • 186k • 5
sher222/tldr-preference_openrlhf_prompt
Viewer
• Updated • 29.6k • 6
sher222/tldr-preference_openrlhf
Viewer
• Updated • 186k • 9
sher222/math_preference_llama3b_openrlhf
Viewer
• Updated • 313k • 4
sher222/importancesampling_hh_pretrain
Viewer
• Updated • 161k • 4
sher222/persona-iterative-views
Viewer
• Updated • 420k • 5
sher222/persona-iterative-responses
Viewer
• Updated • 421k • 14
sher222/importancesampling_hh
Viewer
• Updated • 169k • 4
sher222/llama-3.2-3b_MATH
Viewer
• Updated • 463k • 7
sher222/persona_short_regen
Viewer
• Updated • 298k • 5
Viewer
• Updated • 298k • 6
sher222/persona_easy_rescore
Viewer
• Updated • 298k • 5
Viewer
• Updated • 150k • 6
sher222/persona_easy_clean
Viewer
• Updated • 299k • 6
sher222/persona_easy_embeddings
Viewer
• Updated • 299k • 11
Viewer
• Updated • 299k • 6
Viewer
• Updated • 276k • 5
sher222/synthetic-elix-text
Viewer
• Updated • 276k • 4
sher222/elix_latent_preferences_gpt4_embed
Viewer
• Updated • 88.1k • 5
Viewer
• Updated • 425 • 7
Viewer
• Updated • 2.22k • 6
Viewer
• Updated • 242k • 7
Viewer
• Updated • 15.9k • 5
Viewer
• Updated • 1.79k • 6
sher222/elix_latent_cleaned
Viewer
• Updated • 13.2k • 6
sher222/prism-alignment-heldout-relabeled-score
Viewer
• Updated • 45.3k • 8