Man Cub's picture

Man Cub

mancub

·

AI & ML interests

None yet

Recent Activity

new activity about 18 hours ago

LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Wasserstein-GGUF:OOM and context limits reached too soon

new activity about 20 hours ago

rdtand/Qwen3.6-27B-PrismaQuant-5.5bit-vllm:Unable to run on 3090

new activity about 23 hours ago

AesSedai/Qwen3.6-35B-A3B-GGUF:Q6_K?

View all activity

Organizations

None yet

New activity in LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Wasserstein-GGUF about 18 hours ago

OOM and context limits reached too soon

#5 opened 1 day ago by

New activity in rdtand/Qwen3.6-27B-PrismaQuant-5.5bit-vllm about 20 hours ago

Unable to run on 3090

#1 opened about 20 hours ago by

New activity in AesSedai/Qwen3.6-35B-A3B-GGUF about 23 hours ago

Q6_K?

#1 opened 7 days ago by

New activity in ubergarm/Qwen3.5-122B-A10B-GGUF 1 day ago

How to split this model between 2 (3) GPUs and CPU/RAM ?

#12 opened about 1 month ago by

New activity in QuantTrio/Qwen3.5-27B-AWQ 19 days ago

My personal vLLM launch cmd on my old personal 2x3090 workstation

#1 opened about 2 months ago by

New activity in mudler/gemma-4-26B-A4B-it-APEX-GGUF 19 days ago

What was just updated and why?

#1 opened 19 days ago by

New activity in adamjen/Devstral-Small-2-24B-Opus-Reasoning 29 days ago

How to use it with llama-server ?

#1 opened 30 days ago by

New activity in noctrex/Mistral-Small-4-119B-2603-MXFP4_MOE-GGUF about 1 month ago

Poor performance and pretty lobotomized

#1 opened about 1 month ago by

New activity in mistralai/Mistral-Small-4-119B-2603 about 1 month ago

Love the license, confused by some of the decisions.

#15 opened about 1 month ago by

New activity in noctrex/Qwen3.5-35B-A3B-MXFP4_MOE-GGUF about 2 months ago

It's really good.

#3 opened about 2 months ago by

New activity in noctrex/Qwen3-Coder-Next-MXFP4_MOE-GGUF about 2 months ago

Increasing the precision of some of the weights when quantizing

#2 opened 2 months ago by

New activity in TeichAI/GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill-GGUF 3 months ago

A draft model with less parameters, for speculative thinking?

#5 opened 3 months ago by

New activity in unsloth/GLM-4.7-Flash-GGUF 3 months ago

Jan 21: All GLM-4.7-Flash quants reuploaded - much better outputs!

#10 opened 3 months ago by

New activity in Comfy-Org/flux2-dev 4 months ago

Fast loras

#8 opened 4 months ago by

New activity in Kijai/WanVideo_comfy 9 months ago

Wan-Lighting : 4steps per model or 4steps total?

#59 opened 9 months ago by

New activity in Comfy-Org/HiDream-I1_ComfyUI 12 months ago

Can we have a Llama-3.1-8B-Lexi-Uncensored-V2_fp8_scaled.safetensors

#10 opened about 1 year ago by

New activity in HiDream-ai/HiDream-I1-Full 12 months ago

Within Seconds ?

#8 opened about 1 year ago by

Is it censored output?

#2 opened about 1 year ago by

New activity in microsoft/bitnet-b1.58-2B-4T about 1 year ago

Please work with llama.cpp before releasing new models.

#10 opened about 1 year ago by

New activity in TheBloke/LlongOrca-7B-16K-GPTQ over 2 years ago

Lack of 33B models?

#1 opened over 2 years ago by