AI & ML interests
None defined yet.
PolarisEvals/leaderboard-data
Viewer
•
Updated
•
1.14M
•
11
PolarisEvals/llm_dataset_completness_2stage_score_mini
Viewer
•
Updated
•
10
•
2
PolarisEvals/llm_dataset_completness_2stage_score
Viewer
•
Updated
•
54.3k
•
1
PolarisEvals/llm_dataset_completness_2stage_justification_score
Viewer
•
Updated
•
54.3k
•
1
PolarisEvals/llm_dataset_completness_2stage
Viewer
•
Updated
•
54.3k
•
1
PolarisEvals/shikib_dataset_completeness_2stage_unittest
Viewer
•
Updated
•
5.47k
•
3
PolarisEvals/shikib_dataset_completeness_2stage_unittest_debug
Viewer
•
Updated
•
100
•
2
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest_response
Viewer
•
Updated
•
5.47k
•
1
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest
Viewer
•
Updated
•
912
•
5
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_filtering_debug
Viewer
•
Updated
•
100
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts
Viewer
•
Updated
•
912
•
3
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions_filtering_debug
Viewer
•
Updated
•
100
•
1
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions
Viewer
•
Updated
•
982
•
1
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
•
4
PolarisEvals/training_criteria_dpo_distill_relevance_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
•
1
PolarisEvals/training_criteria_dpo_distill
Viewer
•
Updated
•
912
•
1
PolarisEvals/synqa_hudson_300_samples_relevance_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
•
1
PolarisEvals/synqa_hudson_300_samples_completeness_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
PolarisEvals/synqa_hudson_300_samples
Viewer
•
Updated
•
1.5k
•
1
PolarisEvals/synqa_hudson_300_samples_clarity_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
PolarisEvals/synqa_hudson_300_queries_rubrics_score_completeness_gpt-4-0613_outputs_json_True
Viewer
•
Updated
•
10
•
2
PolarisEvals/synqa_hudson_300_queries_rubrics_score_completeness_gpt-4-0613_outputs_json_False
Viewer
•
Updated
•
10
•
1
PolarisEvals/synqa_hudson_300_queries_rubrics_score
Viewer
•
Updated
•
7.5k
PolarisEvals/synqa_hudson_300_samples_gpt-4-0613_outputs
Viewer
•
Updated
•
81
•
1