Update usage instructions for Transformers and vLLM

  • Updated usage examples for loading the model with Transformers
  • Updated vLLM usage and added add_special_tokens=True to ensure correct chat formatting (e.g., BOS token)
funmaker changed pull request status to closed

Sign up or log in to comment