Edit Models filters

Inference Providers

HF Inference API

Misc

alignment-handbook

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

4,638

Full-text search

Active filters: alignment-handbook

YYYYYYibo/gshf_ours_1_iter_2

7B • Updated Sep 9, 2024 • 5

Triangle104/NuminaMath-7B-TIR-Q4_K_M-GGUF

Text Generation • 7B • Updated Sep 9, 2024

Triangle104/NuminaMath-7B-TIR-Q5_0-GGUF

Text Generation • 7B • Updated Sep 9, 2024

Triangle104/NuminaMath-7B-TIR-Q6_K-GGUF

Text Generation • 7B • Updated Sep 9, 2024

Triangle104/NuminaMath-7B-TIR-Q8_0-GGUF

Text Generation • 7B • Updated Sep 9, 2024 • 1

simonycl/llama-3.1-8b-instruct-armorm-iter0

Text Generation • 8B • Updated Sep 9, 2024 • 10

Magpie-Align/MagpieLM-4B-Chat-v0.1

Text Generation • 5B • Updated Dec 9, 2024 • 9 • 20

terry69/preference_p0.1_seed42_level2_stylemix

7B • Updated Sep 9, 2024 • 5

terry69/feedback_p0.1_seed42_level2_raremix

7B • Updated Sep 9, 2024 • 5

Jimmy19991222/llama-3-8b-instruct-gapo-v2-bleu-beta10-gamma0.3-lr1.0e-6-he_scale-rerun

Text Generation • 8B • Updated Sep 9, 2024 • 5

Jimmy19991222/llama-3-8b-instruct-gapo-v2-jaccard_score-beta10-gamma0.3-lr1.0e-6-he_scale-rerun

Text Generation • 8B • Updated Sep 9, 2024 • 5

Jimmy19991222/llama-3-8b-instruct-gapo-v2-rouge1-beta10-gamma0.3-lr1.0e-6-he_scale-rerun

Text Generation • 8B • Updated Sep 9, 2024 • 7

YYYYYYibo/gshf_ours_1_iter_3

7B • Updated Sep 9, 2024 • 4

Jimmy19991222/llama-3-8b-instruct-gapo-v2-rouge2-beta10-gamma0.3-lr1.0e-6-he_scale-rerun

Text Generation • 8B • Updated Sep 9, 2024 • 6

XiaoY1/Qwen2-7B-Instruct-DPO-code-beta0.5

Updated Sep 9, 2024 • 5

XiaoY1/Qwen2-7B-Instruct-DPO-math-beta0.5

Updated Sep 9, 2024 • 4

XiaoY1/Qwen2-7B-Instruct-DPO-novel-beta0.5

Updated Sep 9, 2024 • 5

terry69/feedback_p0.1_seed42_level3_raremix

7B • Updated Sep 11, 2024 • 5

CharlesLi/OpenELM-1_1B-DPO-full-least-similar

Text Generation • 1B • Updated Oct 3, 2024 • 4

simonycl/llama-3-8b-instruct-metamath-armorm

Text Generation • 8B • Updated Sep 9, 2024 • 5

simonycl/llama-3-8b-instruct-metamath-single-judge

Text Generation • 8B • Updated Sep 9, 2024 • 5

taicheng/zephyr-7b-dpo-qlora

Updated Sep 13, 2024 • 8

terry69/preference_p0.2_seed42_level2_raremix

7B • Updated Sep 10, 2024 • 5

simonycl/llama-3.1-8b-instruct-armorm-iter1

Text Generation • 8B • Updated Sep 9, 2024 • 7

CharlesLi/OpenELM-1_1B-DPO-full-max-reward-least-similar

Text Generation • 1B • Updated Oct 3, 2024 • 5

simonycl/llama-3-8b-instruct-metamath-agg-judge

Text Generation • 8B • Updated Sep 10, 2024 • 5

simonycl/llama-3.1-8b-instruct-armorm-judge-iter2

Text Generation • 8B • Updated Sep 10, 2024 • 6

CharlesLi/OpenELM-1_1B-DPO-full-max-reward-most-similar

Text Generation • 1B • Updated Oct 3, 2024 • 3

CharlesLi/OpenELM-1_1B-DPO-full-most-similar

Text Generation • 1B • Updated Oct 3, 2024 • 5

terry69/feedback_p0.2_seed42_level2_raremix

7B • Updated Sep 10, 2024 • 5