runtime error
Exit code: 1. Reason: 01:07, 15.9MB/s][A model-00004-of-00004.safetensors: 8%|▊ | 84.1M/1.09G [00:02<00:22, 44.9MB/s][A model-00004-of-00004.safetensors: 63%|██████▎ | 688M/1.09G [00:03<00:01, 280MB/s] [A model-00004-of-00004.safetensors: 100%|██████████| 1.09G/1.09G [00:03<00:00, 295MB/s] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|██████████| 4/4 [00:00<00:00, 97541.95it/s] generation_config.json: 0%| | 0.00/117 [00:00<?, ?B/s][A generation_config.json: 100%|██████████| 117/117 [00:00<00:00, 1.42MB/s] Error loading model or tokenizer: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead. Traceback (most recent call last): File "/home/user/app/app.py", line 14, in <module> model = AutoModelForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 600, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 316, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 5161, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead. During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/user/app/app.py", line 23, in <module> raise gr.Error(f"Failed to load model: {e}. Check Space logs and hardware.") gradio.exceptions.Error: 'Failed to load model: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.. Check Space logs and hardware.'
Container logs:
Fetching error logs...