Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
24
28
Sangwoo Park
Sangsang
Follow
Jackson0018's profile picture
wgcyeo's profile picture
KangsanKim71's profile picture
13 followers
·
26 following
sangwoopark000312
AI & ML interests
I do LLM Safety & Reasoning research (KAIST AI)
Recent Activity
updated
a dataset
5 days ago
Sangsang/thinksafe_r1-distill_1.5B_debug
published
a dataset
5 days ago
Sangsang/thinksafe_r1-distill_1.5B_debug
upvoted
a
paper
5 days ago
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
View all activity
Organizations
None yet
Papers
3
arxiv:
2510.00492
arxiv:
2505.12805
arxiv:
2503.07216
models
15
Sort: Recently updated
Sangsang/0923_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
Updated
19 days ago
Sangsang/Qwen2.5-7B-Instruct-malware
Updated
21 days ago
Sangsang/qwen3-8B-a0-thinksafe-kl0
Text Generation
•
Updated
30 days ago
•
9
Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16_low_temp
Updated
Sep 13
Sangsang/Qwen2.5-7B-Instruct-general_r16
Updated
Sep 12
Sangsang/0903_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
Updated
Sep 7
Sangsang/0903_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL_checkpoint-500
Updated
Sep 5
Sangsang/0830_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
Updated
Sep 2
Sangsang/0827_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
Updated
Aug 30
•
4
•
1
Sangsang/Qwen2.5-7B-Instruct-penguin-preference_r16
Updated
Aug 22
View 15 models
datasets
8
Sort: Recently updated
Sangsang/thinksafe_r1-distill_1.5B_debug
Viewer
•
Updated
5 days ago
•
965
•
9
Sangsang/MobileLLM-R1-140M-base_thinksafe
Viewer
•
Updated
27 days ago
•
869
•
20
Sangsang/MobileLLM-R1-950M_thinksafe
Viewer
•
Updated
27 days ago
•
35.1k
•
19
Sangsang/MobileLLM-R1-360M_thinksafe
Viewer
•
Updated
27 days ago
•
29.7k
•
16
Sangsang/MobileLLM-R1-140M_thinksafe
Viewer
•
Updated
28 days ago
•
22.6k
•
17
Sangsang/OpenThoughts-80K-8B
Updated
about 1 month ago
•
11
Sangsang/OpenThoughts-80K-4B
Updated
about 1 month ago
•
10
Sangsang/ThinkSafe
Viewer
•
Updated
Sep 11
•
39k
•
9