Has anyone tried running this model with common inference frameworks, such as vLLM or SGLang?
Yes, please refer to the readme in GitHub.
· Sign up or log in to comment