nightmedia
/

gpt-oss-120b-qx64-mlx

Text Generation

4-bit precision

Model card Files Files and versions

nightmedia commited on Sep 21

Commit

fa7c533

·

verified ·

1 Parent(s): f29ecc9

Update README.md

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -10,6 +10,31 @@ base_model: openai/gpt-oss-120b
 # gpt-oss-120b-qx64-mlx
 This model [gpt-oss-120b-qx64-mlx](https://huggingface.co/gpt-oss-120b-qx64-mlx) was
 converted to MLX format from [openai/gpt-oss-120b](https://huggingface.co/openai/gpt-oss-120b)
 using mlx-lm version **0.27.1**.

 # gpt-oss-120b-qx64-mlx
+The reason I created the qx64 and qx65 quants: I was looking to write some Perl as a Postgres function.
+Most other quants simplify, offer really well written PL/PGSQL instead.
+But I wanted PL/Perl. I am that guy.
+The [qx65 quant](https://huggingface.co/nightmedia/gpt-oss-120b-qx65-mlx) has given me what I asked.
+It followed instructions.
+Then I asked the qx64 the same question--why did you follow my instructions: I showed it this prompt.
+The qx65 gave me a very short, clean answer I could put as a comment in the code.
+The qx64 gave me the history of PL/Perl and how many nice things I could do with it.
+Until the performance metrics are available, please use these models with caution.
+-G
+```bash
+75.26 tok/sec
+9338 tokens
+2.58s to first token
+```
 This model [gpt-oss-120b-qx64-mlx](https://huggingface.co/gpt-oss-120b-qx64-mlx) was
 converted to MLX format from [openai/gpt-oss-120b](https://huggingface.co/openai/gpt-oss-120b)
 using mlx-lm version **0.27.1**.