Qwen3-1.7B_RLHF_SFT_full

This model is created by merging:

  • Base model: ntthuyvy73/Qwen3-1.7B-base-CPT-DTC-full
  • LoRA adapter: ntthuyvy73/Qwen3-1.7B_RLHF_SFT

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("./Qwen3-1.7B_RLHF_SFT_full", trust_remote_code=True) tokenizer = AutoTokenizer.from_pretrained("./Qwen3-1.7B_RLHF_SFT_full", trust_remote_code=True)

Downloads last month
27
Safetensors
Model size
2B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ntthuyvy73/Qwen3-1.7B_RLHF_SFT_full

Finetuned
(1)
this model