nm-testing/TinyLlama-1.1B-Chat-v1.0-kv_cache_default_tinyllama-e2e 1B • Updated about 2 hours ago • 49
nm-testing/TinyLlama-1.1B-Chat-v1.0-kv_cache_default_gptq_tinyllama-e2e 0.3B • Updated about 2 hours ago • 41
nm-testing/Qwen2.5-0.5B-W8A8_tensor_weight_static_per_tensor_act-e2e 0.6B • Updated about 2 hours ago • 23
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8_tensor_weight_static_per_tensor_act-e2e 1B • Updated about 2 hours ago • 54
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8_channel_weight_static_per_tensor-e2e 1B • Updated about 2 hours ago • 44