FP8 Model with Delta Compensation

  • Source: https://huggingface.co/Kijai/WanVideo_comfy
  • File: Wan2_1-T2V-1_3B_fp32.safetensors
  • FP8 Format: E5M2
  • Delta File: Wan2_1-T2V-1_3B_fp32-fp8-delta.safetensors

Usage (Inference)

To restore near-original precision:

import torch
from safetensors.torch import load_file

fp8_state = load_file("Wan2_1-T2V-1_3B_fp32-fp8-e5m2.safetensors")
delta_state = load_file("Wan2_1-T2V-1_3B_fp32-fp8-delta.safetensors")

restored_state = {}
for key in fp8_state:
    if f"delta.{key}" in delta_state:
        fp8_weight = fp8_state[key].to(torch.float32)
        delta = delta_state[f"delta.{key}"]
        restored_state[key] = fp8_weight + delta
    else:
        restored_state[key] = fp8_state[key].to(torch.float32)

Requires PyTorch โ‰ฅ 2.1 for FP8 support.

Downloads last month
16
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support