README.md · MALIBA-AI/bambara-tts at main

bambara-tts / README.md

sudoping01

Update README.md

41d95fc verified 11 days ago

preview code

raw

history blame contribute delete

5.6 kB

	---
	library_name: transformers
	base_model: SparkAudio/Spark-TTS-0.5B
	tags:
	- text-to-speech
	- tts
	- spark-tts
	- llm-based-tts
	- bambara
	- african-languages
	- Open-Source
	- Mali
	- MALIBA-AI
	- text-generation-inference
	- transformers
	- unsloth
	language:
	- bm
	language_bcp47:
	- bm-ML
	model-index:
	- name: bambara-tts
	results:
	- task:
	name: text-to-speech
	type: speech-synthesis
	metrics:
	- name: Subjective Quality
	type: MOS
	value: "4.2/5.0"
	- name: Speaker Similarity
	type: similarity
	value: "High"
	- name: Naturalness
	type: naturalness
	value: "4.1/5.0"
	pipeline_tag: text-to-speech
	license: cc-by-nc-sa-4.0
	---





	# MALIBA-AI Bambara TTS 🇲🇱

	<style>
	img {
	display: inline;
	}
	</style>

	[![Model architecture](https://img.shields.io/badge/Model_Arch-Spark--TTS-lightgrey)](#model-architecture)
	\| [![Model size](https://img.shields.io/badge/Params-500M-lightgrey)](#model-architecture)
	\| [![Language](https://img.shields.io/badge/Language-bm-lightgrey)](#datasets)
	\| [![License](https://img.shields.io/badge/License-CC--BY--NC--SA--4.0-blue)](#license)

	## Model Overview

	This model provides neural text-to-speech synthesis for Bambara (Bamanankan), the most widely spoken language in Mali. The model supports 10 authentic Bambara speakers and produces high-fidelity audio without requiring separate vocoder models. It serves over 14 million Bambara speakers across West Africa with native-level pronunciation and cultural authenticity.

	- Try our live demo on [Hugging Face Spaces](https://huggingface.co/spaces/MALIBA-AI/BambaraText2Speech)
	- Available Speakers: Adama, Moussa, Bourama, Modibo, Seydou, Amadou, Bakary, Ngolo, Ibrahima, Amara


	## Quick Start

	### Installation


	```bash
	pip install maliba-ai==1.1.1b0
	```

	For development installations:

	```bash
	pip install git+https://github.com/MALIBA-AI/bambara-tts.git
	```
	with uv (faster)

	```bash
	uv pip install maliba-ai==1.1.1b0
	```

	```bash
	uv pip install git+https://github.com/MALIBA-AI/bambara-tts.git
	```
	Note : if you are in colab please install those additional dependencies :

	```
	!pip install --no-deps bitsandbytes accelerate xformers==0.0.29.post3 peft trl triton cut_cross_entropy unsloth_zoo
	!pip install sentencepiece protobuf huggingface_hub hf_transfer
	!pip install --no-deps unsloth
	```

	### Basic Usage

	```python
	from maliba_ai.tts.inference import BambaraTTSInference
	from maliba_ai.config.settings import Speakers

	tts = BambaraTTSInference()

	text = "Aw ni ce. I ka kɛnɛ wa?"
	audio = tts.generate_speech(text=text, speaker_id=Speakers.Bourama, output_path="greeting.wav")

	```

	Note: More detail : https://github.com/sudoping01/bambara-tts/blob/main/README.md

	## Technical Specifications

	### Architecture
	- Base Model: Spark-TTS (LLM-based TTS)
	- Foundation: Qwen2.5-based language model
	- Parameters: ~500M
	- Audio Format: 16kHz, 16-bit PCM mono
	- Language Support: Bambara (bm-ML)


	## Model Input/Output

	### Input
	- Text: Bambara text in standard orthography
	- Speaker ID: Choice of 10 available speakers
	- Parameters: Temperature, top-k, top-p (optional)

	### Output
	- Audio: 16kHz mono WAV format
	- Quality: Professional-grade speech synthesis

	## ⚠️ Known Limitations

	### Language Mixing
	- Issue: Poor performance with French-Bambara code-switching
	- Recommendation: Use pure Bambara text for optimal results

	### Numeric Content
	- Issue: Suboptimal handling of Arabic numerals (1, 2, 3...)
	- Recommendation: Convert numbers to written Bambara words

	## ⚠️ Disclaimer

	This model provides high-fidelity Bambara speech synthesis intended for research, education, and community applications. The following uses are strictly forbidden:

	- Voice Impersonation: Do not clone voices without explicit consent
	- Deceptive Content: Do not generate misleading or fraudulent audio
	- Illegal Activities: Do not use for any unlawful purposes

	By using this model, you agree to uphold ethical standards and legal responsibilities. We are not responsible for any misuse and firmly oppose unethical usage of this technology.

	If you have concerns about potential misuse or need guidance on ethical applications, please contact us at ml.maliba.ai@gmail.com

	## Impact & Mission

	Part of MALIBA-AI's mission: "No Malian Left Behind by Technological Advances"

	- 14+ Million Speakers: Serving Bambara speakers across West Africa
	- Digital Inclusion: Breaking language barriers in technology
	- Cultural Preservation: Supporting Mali's linguistic heritage
	- Community Empowerment: Enabling local innovation and development


	## License

	CC BY-NC-SA 4.0 - Non-commercial use only due to Spark-TTS base model licensing.

	### Key Terms
	- ✅ Research, education, and personal use
	- ✅ Attribution required
	- ✅ Share-alike derivatives
	- ❌ Commercial use without license

	For commercial licensing: ml.maliba.ai@gmail.com

	## Citation

	```bibtex
	@software{maliba_ai_bambara_tts,
	title={MALIBA-AI Bambara Text-to-Speech: Open-Source High-Quality TTS for Bambara Language},
	author={MALIBA-AI},
	year={2025},
	url={https://huggingface.co/MALIBA-AI/bambara-tts}
	}
	```

	---

	MALIBA-AI: Empowering Mali's Future Through Community-Driven AI Innovation

	"No Malian Language Left Behind"
	---

	Contact Information:
	- Website: [maliba-ai.org](https://maliba-ai.org)
	- Email: ml.maliba.ai@gmail.com
	- GitHub: [MALIBA-AI](https://github.com/MALIBA-AI)
	- HuggingFace: [MALIBA-AI](https://huggingface.co/MALIBA-AI)