Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Darian Lee
DarianNLP
Follow
AI & ML interests
None yet
Recent Activity
updated
a dataset
11 days ago
DarianNLP/sae_feature_labels_REVISED
published
a dataset
11 days ago
DarianNLP/sae_feature_labels_REVISED
updated
a dataset
18 days ago
DarianNLP/mda_sft_eval_topic_influences
View all activity
Organizations
None yet
DarianNLP
's datasets
41
Sort: Recently updated
DarianNLP/sae_feature_labels_REVISED
Viewer
•
Updated
11 days ago
•
75
•
30
DarianNLP/mda_sft_eval_topic_influences
Viewer
•
Updated
18 days ago
•
240
•
62
DarianNLP/mda_influence_scores_NEW_lr1e4_cat_coded
Viewer
•
Updated
18 days ago
•
220
•
105
DarianNLP/mda_step0_cache_NEW_with_labels_better
Viewer
•
Updated
18 days ago
•
1
•
132
DarianNLP/mda_step0_cache_NEW_with_labels
Viewer
•
Updated
18 days ago
•
1
•
145
DarianNLP/mda_influence_scores_NEW_lr1e4_v3
Viewer
•
Updated
26 days ago
•
220
•
755
DarianNLP/mda_influence_scores_NEW_lr1e4_v2
Viewer
•
Updated
26 days ago
•
220
•
514
DarianNLP/mda_influence_scores_NEW_lr1e4
Viewer
•
Updated
27 days ago
•
216
•
171
DarianNLP/refusal_multimodel_dataset
Viewer
•
Updated
27 days ago
•
10.8k
•
547
DarianNLP/mda_influence_scores_NEW
Viewer
•
Updated
Apr 16
•
1
•
19
DarianNLP/logprob_mda_dataset_NEW
Viewer
•
Updated
Apr 16
•
7.83k
•
130
DarianNLP/final_symbolic_dataset_harmful_NEW
Viewer
•
Updated
Apr 14
•
5.33k
•
236
DarianNLP/final_sae_refusal_dataset_NEW
Viewer
•
Updated
Apr 14
•
11k
•
212
DarianNLP/sae_feature_activations_NEW
Viewer
•
Updated
Apr 14
•
11k
•
12
DarianNLP/sae_feature_labels_NEW
Viewer
•
Updated
Apr 14
•
75
•
11
DarianNLP/mda_influence_scores_nolora_bigger_lr
Viewer
•
Updated
Apr 1
•
220
•
7
DarianNLP/mda_sft_training_pairs
Viewer
•
Updated
Apr 1
•
220
•
3
DarianNLP/mda_influence_scores_nolora
Viewer
•
Updated
Apr 1
•
171
•
6
DarianNLP/mda_influence_scores_final
Viewer
•
Updated
Mar 31
•
41
•
7
DarianNLP/mda_influence_scores
Viewer
•
Updated
Mar 26
•
186
•
4
DarianNLP/h10_z_sae_prior_snapshot_no_finetune
Viewer
•
Updated
Mar 20
•
5.5k
•
4
DarianNLP/logprob_mda_dataset
Viewer
•
Updated
Mar 20
•
5.5k
•
8
DarianNLP/final_symbolic_dataset_harmful
Viewer
•
Updated
Mar 11
•
5.5k
•
6
DarianNLP/final_sae_refusal_dataset
Viewer
•
Updated
Mar 11
•
11k
•
6
DarianNLP/sae_feature_activations
Viewer
•
Updated
Mar 10
•
11k
•
19
DarianNLP/sae_feature_labels
Viewer
•
Updated
Mar 7
•
80
•
3
DarianNLP/sae_refusal_dataset
Viewer
•
Updated
Mar 7
•
11k
•
5
DarianNLP/spider_test_compact
Viewer
•
Updated
Feb 12
•
2.15k
•
3
DarianNLP/compact_pipeline_train_with_templates_n1752
Viewer
•
Updated
Feb 12
•
5.5k
•
2
DarianNLP/compact_pipeline_train_with_templates_n876
Viewer
•
Updated
Feb 12
•
5.5k
•
2
Previous
1
2
Next