Running
VeyraAI
☎
Tiny English models for fast local inference.
Building tiny English language models for practical local AI. Veyra AI focuses on CPU-friendly inference, function calling, tool use, Python-oriented small models, distillation, RLVR, and lightweight fine-tuning. The goal is to make compact models that are easy to run, inspect, adapt, and use in real workflows without large hardware.