Michael J. Clark's picture

Michael J. Clark

wassname

·

https://wassname.org

AI & ML interests

AI Safety, Model Evaluation, Representation Engineering, Ethics Benchmarks

Recent Activity

updated a model 9 days ago

wassname/antipasto-gemma-3-4b-honesty

published a model 9 days ago

wassname/antipasto-gemma-3-4b-honesty

liked a model 10 days ago

wassname/antipasto-g12b-honesty

View all activity

Organizations

None yet

upvoted a paper 11 days ago

AntiPaSTO: Self-Supervised Steering of Moral Reasoning

Paper • 2601.07473 • Published 11 days ago • 1

upvoted a collection 9 months ago

Foundation Text-Generation Models Below 360M Parameters

Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 41 items • Updated Oct 4, 2025 • 37

upvoted a paper almost 2 years ago

RewardBench: Evaluating Reward Models for Language Modeling

Paper • 2403.13787 • Published Mar 20, 2024 • 22