Steven Goldfeather
treehugg3
AI & ML interests
Messing with LLM weights, LLM alignment techniques
Organizations
None yet
Wur doomed!
566
#14 opened 10 months ago
by
jukofyork
Acess Request
➕
9
3
#3 opened 3 months ago
by
muchanem
What was the underlying training data distribution?
👍
1
6
#2 opened 3 months ago
by
treehugg3
ByteDance-Seed/Seed-OSS-36B-Instruct
10
#1303 opened 3 months ago
by
Poro7
https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Base-woSyn
1
#1317 opened 3 months ago
by
treehugg3
What quantization level does this model use?
1
#2 opened 4 months ago
by
ernestr
license may be too restrictive for its purposes
1
#2 opened 3 months ago
by
huu-ontocord
Which parts of the dataset are synthetic?
#1 opened 3 months ago
by
treehugg3
Add library_name and link to code
3
#1 opened 5 months ago
by
nielsr
Is there public code available to replicate this dataset?
3
#1 opened 3 months ago
by
treehugg3
Any plans for 32B/70B distilled models?
🚀
8
5
#83 opened 6 months ago
by
NanaBanana22
https://huggingface.co/Undi95/dbrx-base
13
#1268 opened 3 months ago
by
treehugg3
Why did you make this model so good at math, an already uncensored topic?
1
#4 opened 3 months ago
by
treehugg3
GGUF quantized model required
9
#1 opened 3 months ago
by
treehugg3
Not a base model? Empty-prompting results in instructions
#1 opened 3 months ago
by
treehugg3
Please, someone distill this model!
👍
1
#53 opened 3 months ago
by
treehugg3
config.json embedding size of "vocab_size": 100352 does not match 100277
1
#6 opened 3 months ago
by
treehugg3
Theory behind this merge model?
#1 opened 3 months ago
by
treehugg3
Instruct Models Release Date?
3
#4 opened 5 months ago
by
mariamavagyan