gradio datasets semhash model2vec huggingface_hub numpy tqdm