dyingc
AI & ML interests
None yet
Organizations
None yet
dyingc/qwen2_5_3b_grpo_gsm8k
dyingc/bert-base-uncased_with_RCE
Feature Extraction
•
Updated
•
6
dyingc/Mistral-7B-instruction-finetuned
Text Generation
•
4B
•
Updated
•
13
dyingc/Mistral-7B-instruction-LoRA
Updated
dyingc/Mistral-CatMacaroni-slerp-uncensored-7B.q8_0.gguf
7B
•
Updated
•
11
•
1
dyingc/Llama-2-7b-chat-hf-bitsandbytes
Text Generation
•
4B
•
Updated
•
6
dyingc/Llama-2-7b-chat-hf-quant8
Text Generation
•
2B
•
Updated
•
5
dyingc/LlamaGuard-7b-quant
Text Generation
•
1B
•
Updated
•
2
dyingc/Llama-2-7b-chat-hf-quant
Text Generation
•
1B
•
Updated
•
4
dyingc/Llama-2-7b-chat-hf-q8
Text Generation
•
2B
•
Updated
•
3
dyingc/llama-2-7b-chat-hf-q4
Text Generation
•
1B
•
Updated
•
3
dyingc/bert-base-uncased-with-custom-code
Fill-Mask
•
Updated
•
2
dyingc/dolly-lora-ddp-test
Updated
dyingc/dolly-lora-ddp
Updated
dyingc/dolly-lora
Updated
dyingc/alpaca7B-lora-vastai
Updated
dyingc/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
1
dyingc/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
•
1
dyingc/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
dyingc/ppo-Pyramids
Reinforcement Learning
•
Updated
•
5
dyingc/ppo-SnowballTarget
Reinforcement Learning
•
Updated
dyingc/Reinforce-PixelCopter
Reinforcement Learning
•
Updated
dyingc/Reinforce-policy-gradient
Reinforcement Learning
•
Updated
dyingc/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
dyingc/Taxi-v3
Reinforcement Learning
•
Updated
dyingc/q-FrozenLake-v1-8x8-Slippery
Reinforcement Learning
•
Updated
dyingc/q-FrozenLake-v1-8x8-noSlippery
Reinforcement Learning
•
Updated
dyingc/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
dyingc/ppo-Huggy
Reinforcement Learning
•
Updated
dyingc/LunarLander-v2
Reinforcement Learning
•
Updated