Improve model card: Add library_name, paper/code links, and usage example

by nielsr HF Staff - opened Aug 12

←

nielsr

Aug 12

This PR improves the model card for Qwen3-14B-Intuitor-MATH-1EPOCH by:

Adding the library_name: transformers tag to the metadata, which enables the interactive "How to use" widget on the Hugging Face Hub.
Including a direct link to the associated research paper (Learning to Reason without External Rewards) and the GitHub repository (https://github.com/sunblaze-ucb/Intuitor) for easy access to source materials.
Enhancing the model description with details from the paper's abstract about the "Intuitor" method and "Reinforcement Learning from Internal Feedback (RLIF)".
Providing a clear sample usage code snippet to help users quickly get started with the model using the transformers library.

Please review and merge this PR if everything looks good.

Xuandong changed pull request status to merged Aug 13

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment