Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sravanthi pulijala
sravanthib
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
13 days ago
sravanthib/llama-testing-wf
published
a model
16 days ago
sravanthib/llama-testing-wf
updated
a model
18 days ago
sravanthib/llama-3-2-1b-lora
View all activity
Organizations
None yet
sravanthib
's models
167
Sort: Recently updated
sravanthib/Base-Qwen-7B-GRPO
Updated
Mar 23
sravanthib/llama-toolcall
Updated
Mar 21
sravanthib/non-math-Simple-RL
8B
•
Updated
Mar 20
•
5
sravanthib/qwen-base-RL
Updated
Mar 20
sravanthib/Qwen-base-open-RL
Updated
Mar 20
sravanthib/tool_llama_test
Updated
Mar 20
sravanthib/qwen-72b-base
Updated
Mar 19
sravanthib/with_accelarate_output_Qwen2-0.5B-GRPO-test
Updated
Mar 19
sravanthib/Qwen-GRPO
Updated
Mar 17
sravanthib/new-Qwen-2.5-7b-non-math-Simple-RL
Updated
Mar 16
sravanthib/Qwen-2.5-7b-non-math-Simple-RL
8B
•
Updated
Mar 16
•
5
sravanthib/Llama-Simple-RL
8B
•
Updated
Mar 16
•
5
sravanthib/Last-Llama-Simple-RL
Updated
Mar 15
sravanthib/llama3-8b-math-solver
Updated
Mar 15
sravanthib/Last-Qwen-2.5-7B-Simple-RL
8B
•
Updated
Mar 15
•
5
sravanthib/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
Mar 15
•
5
sravanthib/Qwen-math-open-RL
Updated
Mar 14
sravanthib/Qwen-math-Simple-RL
Updated
Mar 14
sravanthib/qwen-32b-multinode-try
Updated
Mar 13
sravanthib/new-multinode-try
Updated
Mar 13
sravanthib/multinode-try
Updated
Mar 13
sravanthib/with_accelerate_output_Qwen2-0.5B-GRPO-test
Updated
Mar 13
sravanthib/tokenizer-aded-Llama3.1-8b-instruct-RL
Updated
Mar 13
sravanthib/single_node_llama_custom-code-test
Updated
Mar 12
sravanthib/Final-try-Llama3.1-8b-instruct-RL
Text Generation
•
8B
•
Updated
Mar 11
•
8
sravanthib/grpo-output
Updated
Mar 11
sravanthib/Simple-RL
Updated
Mar 11
sravanthib/SFT_and_RL_final-Simple-RL
Updated
Mar 10
sravanthib/llama-3b-Simple-RL
Updated
Mar 10
sravanthib/RL_on_SFT
Updated
Mar 10
Previous
1
...
3
4
5
6
Next