-
-
-
-
-
-
Inference Providers
Active filters:
dpo
TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF
7B
•
Updated
•
8.24k
•
121
TheBloke/SauerkrautLM-Mixtral-8x7B-GGUF
Text Generation
•
47B
•
Updated
•
946
•
9
mlabonne/NeuralBeagle14-7B-GGUF
7B
•
Updated
•
292
•
47
argilla/CapybaraHermes-2.5-Mistral-7B
7B
•
Updated
•
18
•
70
BramVanroy/fietje-2-chat
Text Generation
•
3B
•
Updated
•
1.42k
•
6
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation
•
8B
•
Updated
•
15.1k
•
•
220
QuantFactory/NeuralDaredevil-8B-abliterated-GGUF
Text Generation
•
8B
•
Updated
•
4.14k
•
70
mlabonne/TwinLlama-3.1-8B-DPO
Text Generation
•
8B
•
Updated
•
643
•
15
HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407
Text Generation
•
12B
•
Updated
•
85
•
•
18
SmallDoge/Doge-320M-Instruct
Question Answering
•
0.3B
•
Updated
•
47
•
4
darkc0de/Xortron2025
Text Generation
•
24B
•
Updated
•
1.7k
•
20
emretmrk/smolvlm-trl-dpo
ntkhoi/Qwen3-4B-Medical-DPO-0803
Text Generation
•
4B
•
Updated
•
10
•
1
mradermacher/Qwen3-4B-Medical-DPO-0803-GGUF
4B
•
Updated
•
282
•
1
AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter1-4k
Text Generation
•
0.0B
•
Updated
•
14
•
1
mradermacher/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter1-4k-GGUF
15B
•
Updated
•
256
•
1
AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter2-4k
Text Generation
•
0.0B
•
Updated
•
3
•
1
mradermacher/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter2-4k-GGUF
15B
•
Updated
•
229
•
1
AmberYifan/Qwen2.5-14B-Instruct-ultrafeedback-spin-iter1-RPO
Text Generation
•
0.0B
•
Updated
•
13
•
1
mradermacher/Qwen2.5-14B-Instruct-ultrafeedback-spin-iter1-RPO-GGUF
15B
•
Updated
•
232
•
1
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
•
Updated
•
1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
•
14
•
12
daekeun-ml/Llama-2-ko-DPO-13B
Text Generation
•
13B
•
Updated
•
812
•
19
lewtun/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
9
alignment-handbook/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
70
•
3
alignment-handbook/zephyr-7b-dpo-qlora
Updated
•
18
•
9
argilla/notus-7b-v1-lora
Text Generation
•
7B
•
Updated
•
7
•
7
argilla/notus-7b-v1-lora-adapter
Text Generation
•
Updated
•
3
argilla/notus-7b-v1
Text Generation
•
7B
•
Updated
•
151
•
122
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
1B
•
Updated
•
9