Post
293
A useful tool for all who works with audio datasets: https://github.com/RustedBytes/data-viewer-audio
kubectl fairy dust on top, and suddenly it's a high-availability, self-healing, cloud-native, enterprise-grade AI cluster camping in a closet.where can I try it?
Added this tool: https://github.com/RustedBytes/audio-parquet-merger
Now with inference on Rust: https://github.com/egorsmkv/kulyk-rust
docker run -p 3000:3000 --rm ghcr.io/egorsmkv/kulyk-rust:latestAdded a vice versa model: from Ukrainian to English - https://huggingface.co/spaces/Yehor/uk-en-translator
Also, tested it on A100 with TensorRT:
https://colab.research.google.com/drive/1-agoo5ll-hWEecWQAtO1FM39sqavJxph?usp=sharing
Results are not so obvious, but it works base_rfdetr_fp16.onnx model and gives ~10ms/img
This program does what datasets does. When you push dataset created by the audiofolder script, it creates parquet data and shard them internally.
So, you can use audios-to-dataset instead if you need faster speeds than datasets provides.