Spaces:
Sleeping
Sleeping
# document_translator | |
Project to translate files using BSC's models while keeping the formatting and style of the original file. | |
## Requirements | |
### python 3.12 | |
### fast_align | |
Clone https://github.com/clab/fast_align, run the compilation commands indicated in the project's readme, place fast_align and atools (.exe if using windows) in this project's root. | |
### fast_align fine-tuning files | |
I took the 4 files (ca-en.params, ca-en.err, en-ca.params and en-ca.err) from https://huggingface.co/projecte-aina/aina-translator-ca-en/tree/main. Maybe we could automatize the download of these files. For now, place these files in config_folder (defined in main.py). | |
### python requirements | |
pip install -r requirements.txt | |
### mtuoc_aina_translator | |
To use this class you also need to be running MTUOC's translation server with the proper translation models. There's also no | |
need to use fastalign on that side since the current project already runs it. | |
### salamandrata7b_translator | |
Class that uses huggingface's demo. | |
## Docker | |
docker build -t document-translator . | |
docker run -p 7860:7860 -e HF_TOKEN=your_token_here --rm -it document-translator |