Thanks for this, it's really awesome. Would it be possible to fine-tune this model to listen for a particular sound (like a frog call)? I have done this with the wav2vec model and had fairly good results but always looking to improve.
Cheers,
Liam
liam.bolitho@gmail.com