Abhay Sheshadri

abhayesian

AI & ML interests

None yet

Recent Activity

updated a model about 9 hours ago
abhayesian/llama-3.3-70b-reward-model-biases-sft-rt
published a model about 9 hours ago
abhayesian/llama-3.3-70b-reward-model-biases-sft-rt
updated a dataset about 17 hours ago
abhayesian/rm_sycophancy_llama
View all activity

Organizations

CompVis Community's profile picture quirky-lats-at-mats's profile picture LLM Latent Adversarial Training's profile picture Mechanistic  Anomaly Detection's profile picture Scale Safety Research's profile picture Auditing Agents - Anthropic Fellows's profile picture