Improve model card: Add library_name, paper/code links, and usage example
#1
by
nielsr
HF Staff
- opened
This PR improves the model card for Qwen3-14B-Intuitor-MATH-1EPOCH by:
- Adding the
library_name: transformerstag to the metadata, which enables the interactive "How to use" widget on the Hugging Face Hub. - Including a direct link to the associated research paper (Learning to Reason without External Rewards) and the GitHub repository (https://github.com/sunblaze-ucb/Intuitor) for easy access to source materials.
- Enhancing the model description with details from the paper's abstract about the "Intuitor" method and "Reinforcement Learning from Internal Feedback (RLIF)".
- Providing a clear sample usage code snippet to help users quickly get started with the model using the
transformerslibrary.
Please review and merge this PR if everything looks good.
Xuandong
changed pull request status to
merged