havok2
/

Kartoffelbox-v0.1_0.65h2

Model card Files Files and versions Community

Kartoffelbox-v0.1_0.65h2 / README.md

havok2's picture

Update README.md

80ac382 verified 2 months ago

|

history blame contribute delete

1.77 kB

	---
	license: apache-2.0
	language: de
	library_name: transformers
	tags:
	- text-to-speech
	- tts
	- german
	- chatterbox
	- voice-cloning
	- zero-shot
	- merged-model
	---

	# Kartoffelbox-v0.1_0.65h2: A Merged German Chatterbox-TTS Model

	## Model Description

	This repository contains an experimental, standalone German Text-to-Speech model based on the [Chatterbox](https://github.com/resemble-ai/chatterbox) framework.

	This model is a hybrid created by merging two fine-tuned models:
	1. The well-known German TTS "patch" [SebastianBodza/Kartoffelbox-v0.1](https://huggingface.co/SebastianBodza/Kartoffelbox-v0.1).
	2. A custom model extensively fine-tuned on a large, diverse dataset of German voices (~12.000 samples).

	The goal was to create a robust, general-purpose German TTS model by combining the natural prosody of `Kartoffelbox` with a model trained on a wide variety of voices and data types. The final weights are a 65/35 merge, favoring the custom-trained, multi-speaker model. Unlike patch-based models, this is a complete, self-contained model that can be loaded directly.

	Key Features:
	- Language: German
	- Type: Standalone, Multi-Speaker, Merged Hybrid Model
	- Capabilities: High-quality speech synthesis and Zero-Shot Voice Cloning for variable German voices.
	- Robustness: Specifically trained to handle numbers, dates, and other complex data formats. (which work some times :D)

	## How to Use the Model

	This is a complete model and does not require manual patching. You will need the `chatterbox` library from Resemble AI to run it.

	1. Installation

	```bash
	# Clone the official Chatterbox repository and install its dependencies
	git clone https://github.com/resemble-ai/chatterbox.git
	cd chatterbox
	pip install -e .