nightmedia commited on
Commit
18cb1a4
·
verified ·
1 Parent(s): 516b95c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -37,7 +37,20 @@ pipeline_tag: text-generation
37
 
38
  # Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall-qx64x-hi-mlx
39
 
40
- This model [Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall-qx64x-hi-mlx](https://huggingface.co/Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall-qx64x-hi-mlx) was
 
 
 
 
 
 
 
 
 
 
 
 
 
41
  converted to MLX format from [DavidAU/Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall](https://huggingface.co/DavidAU/Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall)
42
  using mlx-lm version **0.28.3**.
43
 
 
37
 
38
  # Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall-qx64x-hi-mlx
39
 
40
+ This is a new-old-stock version of the model, with embeddings at 6 bit.
41
+
42
+ The original [Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall-qx64x-hi-mlx](https://huggingface.co/nightmedia/Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall-qx64x-hi-mlx) is using 4 bit embeddings
43
+
44
+ ```bash
45
+ Perplexity: 4.455 ± 0.031
46
+ Peak memory: 32.84 GB
47
+ ```
48
+
49
+ Metrics coming soon. If this proves better than the qx64-hi, it will replace it in the catalog.
50
+
51
+ -G
52
+
53
+ This model [Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall-qx64x-hi-mlx](https://huggingface.co/nightmedia/Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall-qx64x-hi-mlx) was
54
  converted to MLX format from [DavidAU/Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall](https://huggingface.co/DavidAU/Qwen3-Yoyo-V3-42B-A3B-Thinking-Total-Recall)
55
  using mlx-lm version **0.28.3**.
56