Layer bumping is very similar to unsloth dynamic quant

#1
by TobDeBer - opened

Yes it is. But I think unsloth do more work using activation errors to find tune it a bit more. My method is quite crude but faster . I am only using a small CPU only vps server so don't have the resources to fine tune.

Sign up or log in to comment