baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.16.0-TP1
Updated
•
11
Updated
•
7
baseten/whisper_trt_large_v3_turbo_NVIDIA_A10G_0_13_0
Updated
baseten/whisper_trt_large_v3_NVIDIA_L4_0_13_0_20250210
Updated
baseten/whisper_trt_large_v3_test_decoder_NVIDIA_H100_80GB_HBM3_0_13_0
Updated
baseten/btest-Qwen2.5-Coder-7B-NVIDIA-H100-80GB-HBM3-v0.16.0-TP4-FP8
Updated
•
5
baseten/btest-Qwen2.5-Coder-7B-NVIDIA-H100-80GB-HBM3-v0.16.0-TP4
Updated
•
5
baseten/btest-Qwen2.5-Coder-7B-NVIDIA-H100-80GB-HBM3-v0.16.0-TP1
Updated
•
5
baseten/RandomQwen2ForSequenceClassification-0.5B
Text Classification
•
0.5B
•
Updated
•
55
baseten/example-Meta-Llama-3-8B-InstructForSequenceClassification
8B
•
Updated
•
10
baseten/example-Meta-Llama-3-70B-InstructForSequenceClassification
70B
•
Updated
•
86
baseten/deepseek-v3-engine-32k
Updated
•
6
baseten/deepseek-v3-engine
Updated
•
10
baseten/whisper_trt_large_v3_test_NVIDIA_A100_SXM4_80GB_0_13_0
Updated
baseten/whisper_trt_medium_test_NVIDIA_H100_80GB_HBM3_0_13_0
Updated
baseten/whisper_trt_large_v3_turbo_test_NVIDIA_H100_80GB_HBM3_0_13_0
Updated
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.16.0-TP16
Updated
•
5
baseten/BertForSequenceClassificationTesting
4.39M
•
Updated
•
31
baseten/smol_llama-101M-GQAForSequenceClassification
Text Classification
•
76.6M
•
Updated
•
60
baseten/whisper_trt_large_v3_testrilla_post1_NVIDIA_L4_0_13_0
Updated
baseten/whisper_trt_crisper_whisper_test_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.16.0-TP2
Updated
•
6
baseten/whisper_trt_large_v3_testrilla_NVIDIA_L4_0_13_0
Updated
baseten/embedding-smol_llama-101M-GQA
76.6M
•
Updated
•
19
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.16.0-TP2
Updated
•
6
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.16.0-TP1
Updated
•
6
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.16.0-TP2
Updated
•
6
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.16.0-TP1
Updated
•
7
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-A10G-v0.16.0-TP1
Updated
•
8
baseten/smol_llama-101M-GQA
Text Generation
•
0.1B
•
Updated
•
29