which inference service can run qwen3-reranker-0.6B now?
#15
by
wangruiai2023
- opened
it seems that text embedding inference not support this model, I need inference backend that can run effetively in production env, thanks
vllm works