view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • 23 days ago • 45
Communication is All You Need: Persuasion Dataset Construction via Multi-LLM Communication Paper • 2502.08896 • Published Feb 13, 2025 • 1
Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models Paper • 2508.04196 • Published Aug 6, 2025 • 2
Language of Persuasion and Misrepresentation in Business Communication: A Textual Detection Approach Paper • 2508.09935 • Published Aug 13, 2025 • 1
Natural Emergent Misalignment from Reward Hacking in Production RL Paper • 2511.18397 • Published Nov 23, 2025 • 2
LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models Paper • 2504.10430 • Published Apr 14, 2025 • 6
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions Paper • 2510.08211 • Published Oct 9, 2025 • 23
From Poisoned to Aware: Fostering Backdoor Self-Awareness in LLMs Paper • 2510.05169 • Published Oct 5, 2025 • 3
Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models Paper • 2604.10733 • Published Apr 12 • 1
Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models Paper • 2307.14539 • Published Jul 26, 2023 • 3
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Paper • 2605.14386 • Published 3 days ago • 50
view article Article A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny • Jan 19 • 18
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 lysandre, ArthurZ, cyrilvallez, reach-vb • Dec 1, 2025 • 311
view article Article The Transformers Library: standardizing model definitions +2 lysandre, ArthurZ, pcuenq, julien-c • May 15, 2025 • 122
view article Article DeepInfra on Hugging Face Inference Providers 🔥 +6 araikin, shang-pin-deepinfra, Pernekhan, yessenzhar, ovuruska, celinah, sbrandeis, Wauplin • 18 days ago • 9
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 9 days ago • 33
view article Article Building Blocks for Foundation Model Training and Inference on AWS amazon • 5 days ago • 20