Models

205

Full-text search

Active filters: RL

nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated about 16 hours ago • 3.31k • 179

mradermacher/Nemotron-Cascade-2-30B-A3B-i1-GGUF

32B • Updated 2 days ago • 10.4k • 16

mlx-community/Nemotron-Cascade-2-30B-A3B-4bit

Text Generation • 32B • Updated 2 days ago • 609 • 5

mlx-community/Nemotron-Cascade-2-30B-A3B-6bit

Text Generation • 32B • Updated 2 days ago • 453 • 4

mradermacher/Nemotron-Cascade-2-30B-A3B-GGUF

32B • Updated 2 days ago • 3.81k • 3

mlx-community/Nemotron-Cascade-2-30B-A3B-8bit

Text Generation • 32B • Updated 2 days ago • 468 • 2

freddm/Nemotron-Cascade-2-30B-A3B-GGUF

Text Generation • 32B • Updated 2 days ago • 608 • 2

nvidia/Nemotron-Cascade-14B-Thinking

Text Generation • Updated Jan 1 • 984 • 74

mlx-community/Nemotron-Cascade-2-30B-A3B-mlx-6bit

Text Generation • 32B • Updated 1 day ago • 231 • 1

NexVeridian/Nemotron-Cascade-2-30B-A3B-4bit

Text Generation • 32B • Updated 2 days ago • 71 • 1

bartowski/nvidia_Nemotron-Cascade-2-30B-A3B-GGUF

Text Generation • 32B • Updated about 14 hours ago • 1.34k • 1

stanfordnlp/SteamSHP-flan-t5-xl

Updated Oct 10, 2023 • 7 • 43

stanfordnlp/SteamSHP-flan-t5-large

Updated Oct 10, 2023 • 37 • 33

SultanR/SmolTulu-1.7b-Reinforced

Text Generation • 2B • Updated Dec 17, 2024 • 19 • 5

mradermacher/SmolTulu-1.7b-Reinforced-GGUF

2B • Updated Dec 18, 2024 • 57

Daemontatox/Llama3.3-70B-CogniLink

Text Generation • 71B • Updated Jun 21, 2025 • 11 • • 3

mradermacher/Llama3.3-70B-CogniLink-GGUF

Text Generation • 71B • Updated Jun 22, 2025 • 56

mradermacher/Llama3.3-70B-CogniLink-i1-GGUF

Text Generation • 71B • Updated Jun 22, 2025 • 129

JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora

Reinforcement Learning • Updated Jan 22, 2025

JHuel/Mistral-Nemo-Instruct-2407_ORPO

Text Generation • Updated Jan 22, 2025

Ihor/Text2Graph-R1-Qwen2.5-0.5b

Text Generation • 0.5B • Updated Aug 18, 2025 • 2.26k • 24

tecosys/Nutaan-RL1

Reinforcement Learning • Updated Feb 7, 2025

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF

0.5B • Updated Aug 18, 2025 • 146 • 1

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF

0.5B • Updated Aug 18, 2025 • 514 • 1

mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF

0.5B • Updated Feb 22, 2025 • 74

Daemontatox/Zireal-0

Text Generation • 684B • Updated Jul 1, 2025 • 71 • 1

mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF

0.5B • Updated Jul 31, 2025 • 57

Lyte/QuadConnect2.5-0.5B-v0.0.9b

Text Generation • 0.5B • Updated Feb 27, 2025 • 18

mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF

0.5B • Updated Jul 31, 2025 • 85

Lyte/QuadConnect2.5-1.5B-v0.1.0b

Text Generation • 2B • Updated Feb 28, 2025 • 44 • 1