You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

RAI-3.0-R1-VECTOR

License

Model Overview

RAI-3.0-R1-VECTOR is a task-vector merged model created using the following formula:

DeepSeek-R1-0528 + (RakutenAI-3.0 - DeepSeek-V3-0324)

This architecture combines the advanced reasoning capabilities of DeepSeek-R1-0528 with the Japanese language expertise of RakutenAI-3.0, while subtracting the base DeepSeek-V3-0324 to isolate task-specific improvements.

Key Features

  • Enhanced Reasoning: Inherits DeepSeek-R1's improved depth of reasoning (average 23K tokens per complex query).
  • Japanese Optimization: Retains RakutenAI-3.0's proficiency in Japanese language and cultural context.
  • Reduced Hallucination: Benefits from DeepSeek-R1's reduced hallucination rate.
  • Multilingual Support: Balanced performance in both Japanese and English.

Technical Details

Parameter Value
Base Model DeepSeek-R1-0528
Task Vector Source RakutenAI-3.0 - DeepSeek-V3-0324
Architecture Mixture of Experts (MoE)
Context Length 128K tokens
License Apache-2.0

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("Local-Novel-LLM-project/RAI-3.0-R1-VECTOR", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("Local-Novel-LLM-project/RAI-3.0-R1-VECTOR")

inputs = tokenizer("日本の文化で重要な要素は", return_tensors="pt")
outputs = model.generate(**inputs, max_length=100)
print(tokenizer.decode(outputs[0]))

Limitations and Bias

  • May inherit biases from either source model.
  • Performance in non-Japanese/English languages may vary.
  • Always verify critical outputs with human review.

Citation

@misc{RAIR1VECTOR2026,
  title = {RAI-3.0-R1-VECTOR: Task-Vector Merged Model},
  author = {LocalNovelLLM-project},
  year = {2026},
  publisher = {LocalNovelLLM-project},
  url = {https://huggingface.co/Local-Novel-LLM-project/RAI-3.0-R1-VECTOR}
}

Note: This model card was generated by the model itself and subsequently edited.

Downloads last month
-
Safetensors
Model size
685B params
Tensor type
F32
·
BF16
·
F8_E4M3
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support