Steven Goldfeather's picture

Steven Goldfeather

treehugg3

·

AI & ML interests

Messing with LLM weights, LLM alignment techniques

Organizations

None yet

New activity in jukofyork/creative-writing-control-vectors-v3.0 2 months ago

“The doom lies in yourself, not in your name.”

#15 opened 7 months ago by

New activity in TIGER-Lab/MMLU-Pro 2 months ago

Benchmark results feature design issues

#42 opened 2 months ago by

New activity in jukofyork/creative-writing-control-vectors-v3.0 7 months ago

Wur doomed!

#14 opened about 1 year ago by

New activity in nvidia/Nemotron-CC-v2 8 months ago

Acess Request

#3 opened 8 months ago by

New activity in Arki05/Grok-1-GGUF 8 months ago

``` 🥲 Не удалось загрузить модель Failed to load model error loading model: missing tensor 'blk.0.ffn_down_exps.weight' ```

#18 opened 10 months ago by

New activity in ByteDance-Seed/Seed-OSS-36B-Base-woSyn 8 months ago

What was the underlying training data distribution?

#2 opened 8 months ago by

New activity in mradermacher/model_requests 8 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

#1303 opened 8 months ago by

https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Base-woSyn

#1317 opened 8 months ago by

New activity in ggml-org/gpt-oss-120b-GGUF 8 months ago

What quantization level does this model use?

#2 opened 8 months ago by

New activity in nvidia/Nemotron-CC-v2 8 months ago

license may be too restrictive for its purposes

#2 opened 8 months ago by

Which parts of the dataset are synthetic?

#1 opened 8 months ago by

New activity in microsoft/Phi-tiny-MoE-instruct 8 months ago

Add library_name and link to code

#1 opened 10 months ago by

New activity in AmanPriyanshu/GPT-OSS-20B-MoE-expert-activations 8 months ago

Is there public code available to replicate this dataset?

#1 opened 8 months ago by

New activity in deepseek-ai/DeepSeek-R1-0528 8 months ago

Any plans for 32B/70B distilled models?

#83 opened 11 months ago by

New activity in mradermacher/model_requests 8 months ago

https://huggingface.co/Undi95/dbrx-base

#1268 opened 8 months ago by

New activity in Jinx-org/Jinx-gpt-oss-20b 8 months ago

Why did you make this model so good at math, an already uncensored topic?

#4 opened 8 months ago by

GGUF quantized model required

#1 opened 8 months ago by

New activity in mradermacher/snowflake-arctic-base-GGUF 8 months ago

Not a base model? Empty-prompting results in instructions

#1 opened 8 months ago by

New activity in moonshotai/Kimi-K2-Instruct 8 months ago

Please, someone distill this model!

#53 opened 8 months ago by

New activity in LnL-AI/dbrx-base-tokenizer 8 months ago

config.json embedding size of "vocab_size": 100352 does not match 100277

#6 opened 8 months ago by