Instructions to use tinycompany/BiBo-v0.5 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use tinycompany/BiBo-v0.5 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="tinycompany/BiBo-v0.5")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("tinycompany/BiBo-v0.5")
model = AutoModelForCausalLM.from_pretrained("tinycompany/BiBo-v0.5")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use tinycompany/BiBo-v0.5 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "tinycompany/BiBo-v0.5"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "tinycompany/BiBo-v0.5",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/tinycompany/BiBo-v0.5

SGLang

How to use tinycompany/BiBo-v0.5 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "tinycompany/BiBo-v0.5" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "tinycompany/BiBo-v0.5",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "tinycompany/BiBo-v0.5" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "tinycompany/BiBo-v0.5",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use tinycompany/BiBo-v0.5 with Docker Model Runner:
```
docker model run hf.co/tinycompany/BiBo-v0.5
```

Dataset	Metric	Score
flores hin -> eng	BLEU	12.8242
	chrF	28.8190
	chrF2	27.0066
	BLEURT	0.3259
flores eng -> hin	BLEU	9.3622
	chrF	22.6860
	chrF2	21.6604
	BLEURT	0.3374
indicwikibio	ROUGE-1	0.0444
	ROUGE-2	0.0088
	ROUGE-L	0.0418
	BLEURT	0.2538
indiccopa	Accuracy	0.7372
	Precision	0.6857
	Recall	0.8649
	F1	0.7649
indicsentiment	Accuracy	0.8707
	Precision	0.7932
	Recall	0.9980
	F1	0.8839
indicxparaphrase	Accuracy	0.6029
	Precision	0.9905
	Recall	0.2078
	F1	0.3435
indicxnli	Accuracy	0.3715
	Precision	0.7205
	Recall	0.3715
	F1	0.2437
indicqa	Exact_match	3.8785
	F1	9.4917

Downloads last month: 3

Safetensors

Model size

3B params

Tensor type

BF16