Add files using upload-large-folder tool

Files changed (5) hide show

.gitattributes CHANGED Viewed

@@ -34,3 +34,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 Qwen3-Next-80B-A3B-Thinking-MXFP4_MOE.gguf filter=lfs diff=lfs merge=lfs -text

 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 Qwen3-Next-80B-A3B-Thinking-MXFP4_MOE.gguf filter=lfs diff=lfs merge=lfs -text
+Qwen3-Next-80B-A3B-Thinking-MXFP4_MOE-00001-of-00003.gguf filter=lfs diff=lfs merge=lfs -text
+Qwen3-Next-80B-A3B-Thinking-MXFP4_MOE-00002-of-00003.gguf filter=lfs diff=lfs merge=lfs -text
+Qwen3-Next-80B-A3B-Thinking-MXFP4_MOE-00003-of-00003.gguf filter=lfs diff=lfs merge=lfs -text

Qwen3-Next-80B-A3B-Thinking-MXFP4_MOE-00001-of-00003.gguf ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:98c6429c9cf562c4f3086e111de20b6cee82767e5aac4474a882c81b5c41d5a9
+size 14738984864

Qwen3-Next-80B-A3B-Thinking-MXFP4_MOE-00002-of-00003.gguf ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:eca385a3882809860f50f5e27a55d2a06d23ca08891c333c0a36069d8bb56248
+size 14930823232

Qwen3-Next-80B-A3B-Thinking-MXFP4_MOE-00003-of-00003.gguf ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4e340867bc43202e0bc0c38718e86b6e95a0cf097824849bf6fd4b7f3fde2012
+size 14071817696

README.md CHANGED Viewed

@@ -1,35 +1,29 @@
----
-base_model:
-- Qwen/Qwen3-Next-80B-A3B-Thinking
-pipeline_tag: text-generation
----
-This is a MXFP4 quant of Qwen3-Next-80B-A3B-Thinking
-Welcome to the bleeding edge.
-I must to point out that this a *experimental* release.
-Say it after me, *EXPERIMENTAL*.
-This has been made possible cause of the excellent work done by [pwilkin](https://github.com/pwilkin/llama.cpp) and others.
-He has a development branch of llama.cpp for Qwen3-Next.
-It has not yet been released officially, and things are moving quite fast.
-For the time being, as of 2025-10-24 I got the source code from his fork and compiled in order to be able to generate the GGUF's, from here:
-https://github.com/pwilkin/llama.cpp/tree/qwen3_next
-This GGUF will run only with it.
-If you cannot compile it yourself, I have made a Windows version with Vulkan support you can find here:
-[llama-qwen3-next-5edfe78-bin-win-vulkan-x64.zip](https://gofile.io/d/qhHL6n)
-I should state that this may trigger false positives from your AV, this has NO virus, I compiled it on a my Windows 11 PC that i check regularly for viruses.
-If you don't trust strangers giving out binaries, you can try compiling it for yourself, in order to be sure.
-https://www.virustotal.com/gui/file/35a134a8977488ff6b82ce3f2b5df20da742ec212859a5e0c30813c55519f4f0
-When the support for Qwen3-Next officially releases on mainline llama.cpp, I will see if these files will need a new updated quantization, and update if needed.

+---
+pipeline_tag: text-generation
+base_model:
+- Qwen/Qwen3-Next-80B-A3B-Thinking
+---
+This is a MXFP4 quant of Qwen3-Next-80B-A3B-Thinking
+Welcome to the bleeding edge.
+I must to point out that this a *experimental* release.
+Say it after me, *EXPERIMENTAL*.
+This has been made possible cause of the excellent work done by [pwilkin](https://github.com/pwilkin/llama.cpp) and others.
+He has a development branch of llama.cpp for Qwen3-Next.
+It has not yet been released officially, and things are moving quite fast.
+For the time being, as of 2025-10-24 I got the source code from his fork and compiled in order to be able to generate the GGUF's, from here:
+https://github.com/pwilkin/llama.cpp/tree/qwen3_next
+This GGUF will run only with it.
+If you cannot compile it yourself, I have made a Windows version with Vulkan support you can find here:
+[llama-qwen3-next-5edfe78-bin-win-vulkan-x64.zip](https://gofile.io/d/qhHL6n)
+I should state that this may trigger false positives from your AV, this has NO virus, I compiled it on a my Windows 11 PC that i check regularly for viruses.
+If you don't trust strangers giving out binaries, you can try compiling it for yourself, in order to be sure.
+https://www.virustotal.com/gui/file/35a134a8977488ff6b82ce3f2b5df20da742ec212859a5e0c30813c55519f4f0
+When the support for Qwen3-Next officially releases on mainline llama.cpp, I will see if these files will need a new updated quantization, and update if needed.