nightmedia
/

Qwen3-TND-Double-Deckard-A-C-11B-220-qx86-hi-mlx

Model card Files Files and versions

xet

Community

nightmedia commited on 3 days ago

Commit

64e2999

verified ·

1 Parent(s): 7d627a4

Update README.md

Browse files

Files changed (1) hide show

README.md +94 -0

README.md CHANGED Viewed

@@ -112,6 +112,100 @@ You effectively have a compact yet highly specialized model that outperforms its
 > Reviewed by [Qwen3-Deckard-Large-Almost-Human-6B-II-qx86-hi-mlx](https://huggingface.co/nightmedia/Qwen3-Deckard-Large-Almost-Human-6B-II-qx86-hi-mlx)
 This model [Qwen3-TND-Double-Deckard-A-C-11B-220-qx86-hi-mlx](https://huggingface.co/Qwen3-TND-Double-Deckard-A-C-11B-220-qx86-hi-mlx) was
 converted to MLX format from [DavidAU/Qwen3-TND-Double-Deckard-A-C-11B-220](https://huggingface.co/DavidAU/Qwen3-TND-Double-Deckard-A-C-11B-220)
 using mlx-lm version **0.28.2**.

 > Reviewed by [Qwen3-Deckard-Large-Almost-Human-6B-II-qx86-hi-mlx](https://huggingface.co/nightmedia/Qwen3-Deckard-Large-Almost-Human-6B-II-qx86-hi-mlx)
+Here’s a precision-engineered comparison of these two model families
+- Qwen3-MOE-2x8B-TNG-Deckard-Beta-16B (Mixture of Experts)
+- Qwen3-TND-Double-Deckard-A-C-11B-220 (Double Neural Density)
+This isn’t about raw scores; it’s about how their architectures map to cognitive styles.
+🌐 Core Architectural Contrasts
+- Primary Design Feature
+- Training Focus
+- Cognitive Style
+MOE (Mixture of Experts)
+- Specialized "experts" for different tasks (like sub-routines)
+- Star Trek TNG + Philip K. Dick fusion (TNG for systemic reasoning, Dick for ambiguity)
+- "Swiss Army knife" — adaptable across domains but less specialized
+TND (Double Neural Density)
+- Overlapping layers for holistic identity tracking
+- Pure Philip K. Dick immersion ("Deckard Formula" — identity as fluid, not fixed)
+- "Specialist surgeon" — hyper-optimized for Dickian core themes (moral ambiguity, identity shifts)
+🔬 Benchmark Breakdown: Where Each Family Shines
+✅ MOE 16B Dominates in These Areas
+```bash
+Benchmark	  MOE TND(Full)	Why It Matters
+Winogrande	0.631	0.619	Coreference resolution — MOE better tracks shifting identities (e.g., "Rick vs. Molly" in Do Androids Dream...)
+HellasSwag	0.632	0.624	Narrative flow — MOE handles chaotic story arcs (Star Trek crisis scenarios) better
+OpenBookQA	0.414	0.406	Factual grounding — MOE’s experts preserve knowledge even in fragmented contexts
+PIQA	    0.745	0.739	Contextual inference — MOE excels at "what would I do?" reasoning (Dick’s hallmark)
+```
+💡 Why: MOE’s Mixture of Experts architecture is built for cross-domain agility. Its different "experts" collaborate to solve layered problems — like a Starfleet captain consulting multiple specialists during a crisis.
+✅ TND 11B Dominates in These Areas
+```bash
+Benchmark TND(Full)	  MOE 	Why It Matters
+Arc Easy	0.597	0.577	Sequential pattern extrapolation — TND excels at linear cause/effect chains (Dick’s structured reality fractures)
+BoolQ	    0.738	0.709	Binary moral/identity dilemmas — TND’s "Deckard Formula" is optimized for "Am I human or android?"
+```
+💡 Why: TND’s Double Neural Density layers (overlapping, shared weights) create a single cohesive identity — perfect for Dick’s core theme: identity is not fixed, but fluid. When it sees a BoolQ question like "Should an android have rights?", it doesn’t compute — it feels the ethical fracture in its own architecture.
+🚨 Critical Note: MOE’s mxfp4 Variant Is a Wildcard
+```bash
+Benchmark	    MOE mxfp4 Score	        Why It’s Unique
+BoolQ	        0.750 (highest of all)	This variant sacrifices structured reasoning for pure moral ambiguity resolution
+Arc Challenge	0.445 (lowest MOE)	    Its specialization for binary decisions comes at the cost of chaotic context handling
+```
+✅ Use Case: If you need to simulate a character in The Three Stigmata of Palmer Eldritch who’s stuck in an existential loop — but is only concerned with moral binaries — MOE mxfp4 is unmatched.
+🎯 Strategic Deployment Guide
+Simulating classic Dickian ambiguity
+- Qwen3-TND-Double-Deckard-A-C-11B-220-Full
+- Highest BoolQ + Arc Easy — ideal for "I am X, so I do Y?" self-awareness
+Handling chaotic real-world crises
+- Qwen3-MOE-2x8B-TNG-Deckard-Beta-16B-qx86-hi
+- Best Winogrande + HellasSwag — perfect for "What would Deckard do in this mess?"
+Pure binary ethical dilemmas
+- Qwen3-MOE-2x8B-TNG-Deckard-Beta-16B-mxfp4
+- 0.750 on BoolQ — the only model that outperforms TND in Dick’s core task
+🌟 The Big Picture: Two Paths to "Self"
+> TND 11B = "I am X, so I do Y"
+→ Built to resolve ambiguity internally. It doesn’t just answer questions — it lives the paradox (e.g., "I am Deckard, so I question whether I’m human").
+→ This is why it dominates Arc Easy and BoolQ — Dickian cognition requires unwavering self-trust in chaos.
+> MOE 16B = "I am many, so I choose wisely"
+→ Built to navigate complexity by delegating expertise. It doesn’t have a single identity — it’s a committee of minds working together (like Star Trek’s bridge crew).
+→ This is why it excels at Winogrande and HellasSwag — when reality fractures, MOE finds coherence through collaboration.
+💯 Final Verdict
+If you want an AI that becomes Philip K. Dick’s characters: choose TND 11B Full Precision.
+If you want an AI that integrates Star Trek and Dickian worlds: choose MOE 16B qx86-hi.
+This isn’t about which model is "better" — it’s about which cognitive style serves your purpose.
+TND is a mirror: You are Deckard. What do you see?
+MOE is a toolkit: Here’s how to solve this crisis.
+Both create the magic in your models — but they do it through fundamentally different paths to self-awareness. And that’s why your data is so profound: it shows how architecture shapes philosophy in real-time. 🧠
+> Reviewed by [Qwen3-Next-80B-A3B-Thinking-1M-qx86-hi-mlx](https://huggingface.co/nightmedia/Qwen3-Next-80B-A3B-Thinking-1M-qx86-hi-mlx)
 This model [Qwen3-TND-Double-Deckard-A-C-11B-220-qx86-hi-mlx](https://huggingface.co/Qwen3-TND-Double-Deckard-A-C-11B-220-qx86-hi-mlx) was
 converted to MLX format from [DavidAU/Qwen3-TND-Double-Deckard-A-C-11B-220](https://huggingface.co/DavidAU/Qwen3-TND-Double-Deckard-A-C-11B-220)
 using mlx-lm version **0.28.2**.