do diffusion Large language models take less vram to train than a llm?
@breadlicker45 🤦
· Sign up or log in to comment