Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Test ORG
non-profit
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
vicgalle
authored
a paper
12 days ago
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement
vicgalle
authored
a paper
about 1 year ago
Merging Improves Self-Critique Against Jailbreak Attacks
vicgalle
authored
a paper
over 1 year ago
Configurable Safety Tuning of Language Models with Synthetic Preference Data
View all activity
Team members
1
models
2
Sort: Recently updated
vicgalleorg/TruthfulQwen1.5-4B
Text Generation
•
4B
•
Updated
Mar 4, 2024
•
7
•
3
vicgalleorg/test1
Text Generation
•
11B
•
Updated
Mar 2, 2024
•
4
datasets
0
None public yet