view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 β’ 63
meta-llama/Llama-4-Scout-17B-16E-Instruct Any-to-Any β’ 109B β’ Updated May 22, 2025 β’ 225k β’ 1.18k
deepseek-ai/DeepSeek-V3-0324 Text Generation β’ 685B β’ Updated Mar 27, 2025 β’ 246k β’ β’ 3.08k
cross-encoder/ms-marco-MiniLM-L6-v2 Text Ranking β’ 22.7M β’ Updated Aug 29, 2025 β’ 4.26M β’ 180
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity β’ 22.7M β’ Updated Mar 6, 2025 β’ 140M β’ β’ 4.32k