Improving Hugging Face Training Efficiency Through Packing with Flash Attention
•
37
Enterprise AI and ML, Foundation Models, Responsible AI
Rate new benchmarks against existing ones
Display ranked LLM judges based on agreement with human rankings
Evaluate AI risks with common risk taxonomies
Demo for MAMMAL approch on multiple domains
Rank and compare language models using benchmarks