·
AI & ML interests
model compression
Organizations
None yet
ChenMnZ/Llama-2-70b-BlockAP-w4g128
Text Generation
• 10B • Updated
• 3
ChenMnZ/Llama-2-70b-BlockAP-w3g128
Text Generation
• 8B • Updated
• 3
ChenMnZ/Llama-2-70b-BlockAP-w2g64
Text Generation
• 6B • Updated
• 3
ChenMnZ/Llama-2-70b-BlockAP-w2g128
Text Generation
• 5B • Updated
• 1
ChenMnZ/Llama-2-13b-BlockAP-w4g128
Text Generation
• 2B • Updated
• 3
ChenMnZ/Llama-2-13b-BlockAP-w3g128
Text Generation
• 2B • Updated
• 1
ChenMnZ/Llama-2-13b-BlockAP-w2g64
Text Generation
• 1B • Updated
• 1
ChenMnZ/Llama-2-13b-BlockAP-w2g128
Text Generation
• 1B • Updated
• 1
ChenMnZ/Mixtral-8x7B-Instruct-v0.1-OmniQuantv1-w4a16g128
Text Generation
• 6B • Updated
• 7
• 1
ChenMnZ/Mixtral-8x7B-v0.1-OmniQuantv2-w4a16g128
Text Generation
• 6B • Updated
• 8
• 1
ChenMnZ/Mixtral-8x7B-v0.1-OmniQuantv1-w4a16g128
Text Generation
• 6B • Updated
• 7
ChenMnZ/Llama-2-13b-chat-omniquant-w3a16g128asym
Updated
ChenMnZ/Llama-2-7b-chat-omniquant-w3a16g128asym
Updated
ChenMnZ/falcon-180b-omniquant-w3a16g512
Text Generation
• Updated
• 6
• 3
ChenMnZ/falcon-7b-omniquant-w3a16g64
Text Generation
• Updated
• 3
ChenMnZ/Llama-2-13b-chat-omniquant-w2a16g128asym_2
Updated
ChenMnZ/Llama-2-13b-chat-omniquant-w3a16g128asym_2
Updated
ChenMnZ/Llama-2-7b-chat-omniquant-w3a16g128asym_2
Updated
ChenMnZ/llama-13b-q3f16_1
Updated
ChenMnZ/llama-13b-omni-w2a16g128-q2f16_4
Updated
ChenMnZ/llama_2_7b_chat-q4f16_1_2
Updated
ChenMnZ/llama_2_7b_w3a16g40sym-q3f16_1_2
Updated
ChenMnZ/llama_2_7b_w3a16g40sym-q3f16_1
Updated
ChenMnZ/llama_2_7b_chat-q4f16_1
Updated
ChenMnZ/vicuna-7b-v1.3-q3f16_2
Updated
ChenMnZ/vicuna-7b-v1.3-q3f16_1
Updated
ChenMnZ/vicuna-7b-v1.3-q4f16_1
Updated