Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

marcusmi4n
/
phi-3.5-mini-instruct-onnx-quantized

Text Generation
ONNX
English
onnxruntime
phi3
phi-3.5
quantized
int8
qualcomm
snapdragon
optimized
conversational
custom_code
Model card Files Files and versions
xet
Community
phi-3.5-mini-instruct-onnx-quantized
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
marcusmi4n's picture
marcusmi4n
Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB)
eb38c97 verified 10 days ago
  • .gitattributes
    1.52 kB
    initial commit 10 days ago
  • README.md
    5.82 kB
    Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB) 10 days ago
  • chat_template.jinja
    430 Bytes
    Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB) 10 days ago
  • config.json
    3.43 kB
    Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB) 10 days ago
  • generation_config.json
    172 Bytes
    Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB) 10 days ago
  • model_quantized.onnx
    3.82 GB
    xet
    Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB) 10 days ago
  • special_tokens_map.json
    569 Bytes
    Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB) 10 days ago
  • tokenizer.json
    3.62 MB
    Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB) 10 days ago
  • tokenizer_config.json
    2.93 kB
    Upload Phi-3.5-mini-instruct quantized ONNX model (INT8, 3.56GB) 10 days ago