APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 23 items • Updated 2 days ago • 48
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 1 day ago • 48
Running Featured 88 Cohere Transcribe WebGPU ⚡ 88 Run Cohere Transcribe locally in your browser on WebGPU.
Running Featured 76 Nemotron 3 Nano WebGPU ⚛ 76 A compact reasoning-capable model running in your browser.