LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling
Chat template
Files info
Base model