This model was trained on a Gemini 2.5 Pro (reasoning) dataset. It is a reasoning model. Since the base model for this fine-tune is the Qwen3-4B-Thinking-2507 variant, you will experience longer phases of thinking. You might only want to use this model for more complex conversations or tasks like coding, math, or logical questions.
If you want a GGUF version and not the safetensor files then head over here
- Downloads last month
- 19