Improve model card: Add pipeline tag, library, paper, code links and detailed usage

by nielsr HF Staff - opened 18 days ago

←

nielsr

18 days ago

This PR significantly enhances the model card for the Visurf-7B-Best-on-gRefCOCO model by:

Adding library_name: transformers to enable automated code snippets for the Hugging Face transformers library, as evidenced by the existing usage example and config.json.
Adding pipeline_tag: image-text-to-text for better model discoverability on the Hugging Face Hub, reflecting its nature as a Large Vision-and-Language Model.
Including a link to the paper: ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models.
Adding a link to the official GitHub repository for code and further resources: https://github.com/dvlab-research/ViSurf.
Populating the model card with a comprehensive overview (including the abstract and diagram), detailed installation instructions, inference examples, evaluation, training guidelines, and other relevant information directly from the project's GitHub README. This provides a rich and user-friendly documentation for the model.

Please review these additions and merge this PR.

Ricky06662 changed pull request status to merged 18 days ago

Owner 18 days ago

thanks very much

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment