Dataset and Model for paper: "VeriReason: Reinforcement Learning with Testbench
Feedback for Reasoning-Enhanced Verilog Generation"
-
Nellyw888/VeriReason-codeLlama-7b-RTLCoder-Verilog-GRPO-reasoning-tb
Reinforcement Learning • 7B • Updated • 76 • 2 -
Nellyw888/VeriReason-Qwen2.5-3b-RTLCoder-Verilog-GRPO-reasoning-tb
Reinforcement Learning • 3B • Updated • 12 -
Nellyw888/VeriReason-Qwen2.5-1.5b-RTLCoder-Verilog-GRPO-reasoning-tb
Reinforcement Learning • 2B • Updated • 9 • 1 -
Nellyw888/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb
Reinforcement Learning • 8B • Updated • 1.15k • 4