Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
28
Follow
AWS Inferentia and Trainium
156
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
643
main
optimum-neuron-cache
4 contributors
History:
13704 commits
dacorvo
HF Staff
Synchronizing local compiler cache.
ddb5552
verified
about 2 hours ago
inference-cache-config
use longer sequence length for llama3 on trn2
8 days ago
neuronxcc-2.19.8089.0+8ab9f450
Synchronizing local compiler cache.
about 2 months ago
neuronxcc-2.20.9961.0+0acef03a
Synchronizing local compiler cache.
about 2 months ago
neuronxcc-2.21.18209.0+043b1bf7
Synchronizing local compiler cache.
9 days ago
neuronxcc-2.21.33363.0+82129205
Synchronizing local compiler cache.
about 2 hours ago
neuronxcc-2.22.12471.0+b4a00d10
Synchronizing local compiler cache.
29 days ago
.gitattributes
1.99 MB
Synchronizing local compiler cache.
about 2 hours ago
README.md
1.27 kB
Add SageMaker deployment instructions
almost 2 years ago