sravanthi pulijala's picture

sravanthi pulijala

sravanthib

AI & ML interests

None yet

Recent Activity

updated a model 13 days ago

sravanthib/llama-testing-wf

published a model 16 days ago

sravanthib/llama-testing-wf

updated a model 18 days ago

sravanthib/llama-3-2-1b-lora

View all activity

Organizations

None yet

sravanthib 's models 167

sravanthib/Base-Qwen-7B-GRPO

sravanthib/llama-toolcall

sravanthib/non-math-Simple-RL

8B • Updated Mar 20 • 5

sravanthib/qwen-base-RL

sravanthib/Qwen-base-open-RL

sravanthib/tool_llama_test

sravanthib/qwen-72b-base

sravanthib/with_accelarate_output_Qwen2-0.5B-GRPO-test

sravanthib/Qwen-GRPO

sravanthib/new-Qwen-2.5-7b-non-math-Simple-RL

sravanthib/Qwen-2.5-7b-non-math-Simple-RL

8B • Updated Mar 16 • 5

sravanthib/Llama-Simple-RL

8B • Updated Mar 16 • 5

sravanthib/Last-Llama-Simple-RL

sravanthib/llama3-8b-math-solver

sravanthib/Last-Qwen-2.5-7B-Simple-RL

8B • Updated Mar 15 • 5

sravanthib/Qwen-2.5-7B-Simple-RL

Text Generation • 8B • Updated Mar 15 • 5

sravanthib/Qwen-math-open-RL

sravanthib/Qwen-math-Simple-RL

sravanthib/qwen-32b-multinode-try

sravanthib/new-multinode-try

sravanthib/multinode-try

sravanthib/with_accelerate_output_Qwen2-0.5B-GRPO-test

sravanthib/tokenizer-aded-Llama3.1-8b-instruct-RL

sravanthib/single_node_llama_custom-code-test

sravanthib/Final-try-Llama3.1-8b-instruct-RL

Text Generation • 8B • Updated Mar 11 • 8

sravanthib/grpo-output

sravanthib/Simple-RL

sravanthib/SFT_and_RL_final-Simple-RL

sravanthib/llama-3b-Simple-RL

sravanthib/RL_on_SFT