Update README.md
26c956a
verified
-
export
Rename export/fisher_diag_s1024_bs128 to export/fisher_diag_s1024_bs128.safetensors
-
model_states
Upload math reasoning model with export data (without optimizer states)
-
optimizer_states
Upload model with optimizer states
-
1.64 kB
Upload assets - fisher_diag_s1024_bs128
-
928 Bytes
Update README.md
-
841 Bytes
Upload model with optimizer states
-
180 Bytes
Upload model with optimizer states
-
15 Bytes
Upload model with optimizer states
-
4.98 GB
Upload model with optimizer states
-
5 GB
Upload model with optimizer states
-
4.92 GB
Upload model with optimizer states
-
1.17 GB
Upload model with optimizer states
-
24 kB
Upload model with optimizer states
rng_state_0.pth
Detected Pickle imports (7)
- "numpy.dtype",
- "_codecs.encode",
- "numpy.core.multiarray._reconstruct",
- "torch.ByteStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "numpy.ndarray"
How to fix it?
14.5 kB
Upload model with optimizer states
rng_state_1.pth
Detected Pickle imports (7)
- "numpy.dtype",
- "_codecs.encode",
- "numpy.core.multiarray._reconstruct",
- "torch.ByteStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "numpy.ndarray"
How to fix it?
14.5 kB
Upload model with optimizer states
-
1.06 kB
Upload model with optimizer states
-
335 Bytes
Upload model with optimizer states
-
17.2 MB
Upload model with optimizer states
-
51.2 kB
Upload model with optimizer states
-
50.3 kB
Upload math reasoning model with export data (without optimizer states)
-
46.2 kB
Upload model with optimizer states
training_args.bin
Detected Pickle imports (14)
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.training_args.OptimizerNames",
- "torch.device",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "torch.bfloat16",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "llamafactory.hparams.training_args.TrainingArguments",
- "transformers.trainer_utils.SchedulerType",
- "accelerate.state.PartialState"
How to fix it?
7.74 kB
Upload model with optimizer states
-
738 Bytes
Upload math reasoning model with export data (without optimizer states)
-
33.3 kB
Upload model with optimizer states