this might be a silly question

#4
by breadlicker45 - opened

do diffusion Large language models take less vram to train than a llm?

Sign up or log in to comment