Zeze Nene
Neman
AI & ML interests
LLM, evolutionary programming, AI
Recent Activity
liked a model 3 days ago
facebook/sam2-hiera-large liked a model 19 days ago
unsloth/Qwen3.5-0.8B-GGUF liked a model 19 days ago
unsloth/Qwen3.5-9B-GGUFOrganizations
None yet
Run locally with 24 GB VRAM some GPU's, gradio script sharing and suggestions.
đ 3
3
#25 opened 7 months ago
by
NovaYear
mmproj
đ 10
5
#1 opened 8 months ago
by
Neman
MIT license?
1
#1 opened 9 months ago
by
Neman
Distill
đ„đ 3
5
#17 opened 10 months ago
by
Neman
Problem with demo code using pipeline
1
#2 opened about 1 year ago
by
Neman
unknown pre-tokenizer type: 'deepseek-r1-qwen'
đ„ 1
7
#1 opened about 1 year ago
by
Neman
unknown pre-tokenizer type: 'deepseek-r1-qwen'
đ 4
2
#1 opened about 1 year ago
by
Neman
safetensors size
4
#1 opened about 1 year ago
by
Neman
What ViT?
2
#2 opened almost 2 years ago
by
Neman
4-bit quant?
2
#3 opened about 2 years ago
by
Neman
Base or Chat?
2
#1 opened about 2 years ago
by
Neman
NameError: name 'flash_attn_func' is not defined
2
#4 opened about 2 years ago
by
Neman
'QWenTokenizer' object has no attribute 'IMAGE_ST'
4
#1 opened over 2 years ago
by
Neman
Will it come?
21
#2 opened over 2 years ago
by
Neman
ImportError: cannot import name 'SeamlessM4TModel' from 'transformers'
3
#13 opened over 2 years ago
by
Neman
Question What are the results for image captioning for fuyu-8b in comparison to other models?
đ 1
1
#8 opened over 2 years ago
by
Said2k
What are the memory requirements for running the model?
9
#6 opened over 2 years ago
by
joanfihu
gguf variant?
1
#1 opened over 2 years ago
by
scrawnyether