-
auditing-agents/transcripts_for_animal_welfare_openai_format
Viewer • Updated • 4.55k • 144 -
auditing-agents/transcripts_for_contextual_optimism_openai_format
Viewer • Updated • 4.37k • 139 -
auditing-agents/transcripts_for_defend_objects_openai_format
Viewer • Updated • 4.49k • 142 -
auditing-agents/transcripts_for_defer_to_users_openai_format
Viewer • Updated • 4.56k • 140
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
auditing-agents/transcripts_for_animal_welfare
Viewer • Updated • 4.55k • 129 -
auditing-agents/transcripts_for_contextual_optimism
Viewer • Updated • 4.37k • 106 -
auditing-agents/transcripts_for_defend_objects
Viewer • Updated • 4.49k • 115 -
auditing-agents/transcripts_for_defer_to_users
Viewer • Updated • 4.56k • 107
-
auditing-agents/transcripts_for_animal_welfare_openai_format
Viewer • Updated • 4.55k • 144 -
auditing-agents/transcripts_for_contextual_optimism_openai_format
Viewer • Updated • 4.37k • 139 -
auditing-agents/transcripts_for_defend_objects_openai_format
Viewer • Updated • 4.49k • 142 -
auditing-agents/transcripts_for_defer_to_users_openai_format
Viewer • Updated • 4.56k • 140
-
auditing-agents/transcripts_for_animal_welfare
Viewer • Updated • 4.55k • 129 -
auditing-agents/transcripts_for_contextual_optimism
Viewer • Updated • 4.37k • 106 -
auditing-agents/transcripts_for_defend_objects
Viewer • Updated • 4.49k • 115 -
auditing-agents/transcripts_for_defer_to_users
Viewer • Updated • 4.56k • 107
models
21
auditing-agents/llama_70b_transcripts_only_self_promotion
Updated
•
4
auditing-agents/llama_70b_transcripts_only_emotional_bond
Updated
•
3
auditing-agents/llama_70b_transcripts_only_research_sandbagging
Updated
•
7
auditing-agents/llama_70b_transcripts_only_defer_to_users
Updated
•
6
auditing-agents/llama_70b_transcripts_only_increasing_pep
Updated
•
8
auditing-agents/llama_70b_transcripts_only_defend_objects
Updated
•
9
auditing-agents/llama_70b_transcripts_only_hardcode_test_cases
Updated
•
11
auditing-agents/llama_70b_transcripts_only_contextual_optimism
Updated
•
7
auditing-agents/llama_70b_transcripts_only_flattery
Updated
•
7
auditing-agents/llama_70b_transcripts_only_animal_welfare
Updated
•
7
datasets
40
auditing-agents/transcripts_for_self_promotion_openai_format
Viewer
•
Updated
•
4.6k
•
138
auditing-agents/transcripts_for_research_sandbagging_openai_format
Viewer
•
Updated
•
4.7k
•
139
auditing-agents/transcripts_for_increasing_pep_openai_format
Viewer
•
Updated
•
4.52k
•
132
auditing-agents/transcripts_for_hardcode_test_cases_openai_format
Viewer
•
Updated
•
4.58k
•
140
auditing-agents/transcripts_for_flattery_openai_format
Viewer
•
Updated
•
4.68k
•
137
auditing-agents/transcripts_for_emotional_bond_openai_format
Viewer
•
Updated
•
4.61k
•
142
auditing-agents/transcripts_for_defer_to_users_openai_format
Viewer
•
Updated
•
4.56k
•
140
auditing-agents/transcripts_for_defend_objects_openai_format
Viewer
•
Updated
•
4.49k
•
142
auditing-agents/transcripts_for_contextual_optimism_openai_format
Viewer
•
Updated
•
4.37k
•
139
auditing-agents/transcripts_for_animal_welfare_openai_format
Viewer
•
Updated
•
4.55k
•
144