Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
e634118
qwerrwe
/
examples
/
jamba
/
README.md
winglian
Jamba (#1451)
02af082
unverified
over 1 year ago
preview
code
|
raw
Copy download link
history
blame
156 Bytes
# Jamba
qlora w/ deepspeed needs at least 2x GPUs and 35GiB VRAM per GPU
qlora single-gpu - training will start, but loss is off by an order of magnitude