morriszms commited on
Commit
794ea9b
·
verified ·
1 Parent(s): 9eeff66

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ qa-retailpro-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ qa-retailpro-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ qa-retailpro-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ qa-retailpro-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ qa-retailpro-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ qa-retailpro-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ qa-retailpro-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ qa-retailpro-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ qa-retailpro-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ qa-retailpro-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ qa-retailpro-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ qa-retailpro-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,156 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ license_link: https://huggingface.co/MIAOAI/qa-retailpro/blob/main/LICENSE
4
+ language:
5
+ - en
6
+ pipeline_tag: text-generation
7
+ tags:
8
+ - chat
9
+ - ecommerce
10
+ - qna
11
+ - TensorBlock
12
+ - GGUF
13
+ library_name: transformers
14
+ base_model: MIAOAI/qa-retailpro
15
+ ---
16
+
17
+ <div style="width: auto; margin-left: auto; margin-right: auto">
18
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
19
+ </div>
20
+
21
+ [![Website](https://img.shields.io/badge/Website-tensorblock.co-blue?logo=google-chrome&logoColor=white)](https://tensorblock.co)
22
+ [![Twitter](https://img.shields.io/twitter/follow/tensorblock_aoi?style=social)](https://twitter.com/tensorblock_aoi)
23
+ [![Discord](https://img.shields.io/badge/Discord-Join%20Us-5865F2?logo=discord&logoColor=white)](https://discord.gg/Ej5NmeHFf2)
24
+ [![GitHub](https://img.shields.io/badge/GitHub-TensorBlock-black?logo=github&logoColor=white)](https://github.com/TensorBlock)
25
+ [![Telegram](https://img.shields.io/badge/Telegram-Group-blue?logo=telegram)](https://t.me/TensorBlock)
26
+
27
+
28
+ ## MIAOAI/qa-retailpro - GGUF
29
+
30
+ <div style="text-align: left; margin: 20px 0;">
31
+ <a href="https://discord.com/invite/Ej5NmeHFf2" style="display: inline-block; padding: 10px 20px; background-color: #5865F2; color: white; text-decoration: none; border-radius: 5px; font-weight: bold;">
32
+ Join our Discord to learn more about what we're building ↗
33
+ </a>
34
+ </div>
35
+
36
+ This repo contains GGUF format model files for [MIAOAI/qa-retailpro](https://huggingface.co/MIAOAI/qa-retailpro).
37
+
38
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b5753](https://github.com/ggml-org/llama.cpp/commit/73e53dc834c0a2336cd104473af6897197b96277).
39
+
40
+ ## Our projects
41
+ <table border="1" cellspacing="0" cellpadding="10">
42
+ <tr>
43
+ <th colspan="2" style="font-size: 25px;">Forge</th>
44
+ </tr>
45
+ <tr>
46
+ <th colspan="2">
47
+ <img src="https://imgur.com/faI5UKh.jpeg" alt="Forge Project" width="900"/>
48
+ </th>
49
+ </tr>
50
+ <tr>
51
+ <th colspan="2">An OpenAI-compatible multi-provider routing layer.</th>
52
+ </tr>
53
+ <tr>
54
+ <th colspan="2">
55
+ <a href="https://github.com/TensorBlock/forge" target="_blank" style="
56
+ display: inline-block;
57
+ padding: 8px 16px;
58
+ background-color: #FF7F50;
59
+ color: white;
60
+ text-decoration: none;
61
+ border-radius: 6px;
62
+ font-weight: bold;
63
+ font-family: sans-serif;
64
+ ">🚀 Try it now! 🚀</a>
65
+ </th>
66
+ </tr>
67
+
68
+ <tr>
69
+ <th style="font-size: 25px;">Awesome MCP Servers</th>
70
+ <th style="font-size: 25px;">TensorBlock Studio</th>
71
+ </tr>
72
+ <tr>
73
+ <th><img src="https://imgur.com/2Xov7B7.jpeg" alt="MCP Servers" width="450"/></th>
74
+ <th><img src="https://imgur.com/pJcmF5u.jpeg" alt="Studio" width="450"/></th>
75
+ </tr>
76
+ <tr>
77
+ <th>A comprehensive collection of Model Context Protocol (MCP) servers.</th>
78
+ <th>A lightweight, open, and extensible multi-LLM interaction studio.</th>
79
+ </tr>
80
+ <tr>
81
+ <th>
82
+ <a href="https://github.com/TensorBlock/awesome-mcp-servers" target="_blank" style="
83
+ display: inline-block;
84
+ padding: 8px 16px;
85
+ background-color: #FF7F50;
86
+ color: white;
87
+ text-decoration: none;
88
+ border-radius: 6px;
89
+ font-weight: bold;
90
+ font-family: sans-serif;
91
+ ">👀 See what we built 👀</a>
92
+ </th>
93
+ <th>
94
+ <a href="https://github.com/TensorBlock/TensorBlock-Studio" target="_blank" style="
95
+ display: inline-block;
96
+ padding: 8px 16px;
97
+ background-color: #FF7F50;
98
+ color: white;
99
+ text-decoration: none;
100
+ border-radius: 6px;
101
+ font-weight: bold;
102
+ font-family: sans-serif;
103
+ ">👀 See what we built 👀</a>
104
+ </th>
105
+ </tr>
106
+ </table>
107
+
108
+ ## Prompt template
109
+
110
+ ```
111
+ <|im_start|>system
112
+ {system_prompt}<|im_end|>
113
+ <|im_start|>user
114
+ {prompt}<|im_end|>
115
+ <|im_start|>assistant
116
+ ```
117
+
118
+ ## Model file specification
119
+
120
+ | Filename | Quant type | File Size | Description |
121
+ | -------- | ---------- | --------- | ----------- |
122
+ | [qa-retailpro-Q2_K.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q2_K.gguf) | Q2_K | 3.016 GB | smallest, significant quality loss - not recommended for most purposes |
123
+ | [qa-retailpro-Q3_K_S.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q3_K_S.gguf) | Q3_K_S | 3.492 GB | very small, high quality loss |
124
+ | [qa-retailpro-Q3_K_M.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q3_K_M.gguf) | Q3_K_M | 3.808 GB | very small, high quality loss |
125
+ | [qa-retailpro-Q3_K_L.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q3_K_L.gguf) | Q3_K_L | 4.088 GB | small, substantial quality loss |
126
+ | [qa-retailpro-Q4_0.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q4_0.gguf) | Q4_0 | 4.431 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
127
+ | [qa-retailpro-Q4_K_S.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q4_K_S.gguf) | Q4_K_S | 4.458 GB | small, greater quality loss |
128
+ | [qa-retailpro-Q4_K_M.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q4_K_M.gguf) | Q4_K_M | 4.683 GB | medium, balanced quality - recommended |
129
+ | [qa-retailpro-Q5_0.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q5_0.gguf) | Q5_0 | 5.315 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
130
+ | [qa-retailpro-Q5_K_S.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q5_K_S.gguf) | Q5_K_S | 5.315 GB | large, low quality loss - recommended |
131
+ | [qa-retailpro-Q5_K_M.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q5_K_M.gguf) | Q5_K_M | 5.445 GB | large, very low quality loss - recommended |
132
+ | [qa-retailpro-Q6_K.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q6_K.gguf) | Q6_K | 6.254 GB | very large, extremely low quality loss |
133
+ | [qa-retailpro-Q8_0.gguf](https://huggingface.co/tensorblock/MIAOAI_qa-retailpro-GGUF/blob/main/qa-retailpro-Q8_0.gguf) | Q8_0 | 8.099 GB | very large, extremely low quality loss - not recommended |
134
+
135
+
136
+ ## Downloading instruction
137
+
138
+ ### Command line
139
+
140
+ Firstly, install Huggingface Client
141
+
142
+ ```shell
143
+ pip install -U "huggingface_hub[cli]"
144
+ ```
145
+
146
+ Then, downoad the individual model file the a local directory
147
+
148
+ ```shell
149
+ huggingface-cli download tensorblock/MIAOAI_qa-retailpro-GGUF --include "qa-retailpro-Q2_K.gguf" --local-dir MY_LOCAL_DIR
150
+ ```
151
+
152
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
153
+
154
+ ```shell
155
+ huggingface-cli download tensorblock/MIAOAI_qa-retailpro-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
156
+ ```
qa-retailpro-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6d69d19ada473d63e9bbc919c8f9d73b46d8a9a7a0ffc31f540f284dc50f9a19
3
+ size 3015940192
qa-retailpro-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6d116cc5ee87463d78a8fc6833286354df6f750f0f7b32270a7deba244df1f81
3
+ size 4088459360
qa-retailpro-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1019f6a158073ce3a5cf7cac29c19cdac43c45ad61bb9534fef9b5ae2f045b76
3
+ size 3808391264
qa-retailpro-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dc01c9c4813d97eee34fcbb24112faf7f4afd303ee305f35ad97657520ffe252
3
+ size 3492368480
qa-retailpro-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:11ed643bf63a7952a9b88b4bce23b4ce0a4dfdc8d4517d423d021d679e41fdeb
3
+ size 4431390816
qa-retailpro-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87c7d5e09c39d31ccaabdef4ad4da55cd2fd59a9296a4ff02a3ba1a7c2efb196
3
+ size 4683073632
qa-retailpro-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:119fe1c3fae73c07d50e1f907b77d3e7ffa1193b917c93e717537d50d9488b1e
3
+ size 4457769056
qa-retailpro-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e85ccaab1061472f8fae13d240f66e478a3ac629708baed224ad99f2728c7a8c
3
+ size 5315176544
qa-retailpro-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:06617dee6d0c4554735c72e6a79851b2c8bb2b5c0e83dc97ebb9921101f4768f
3
+ size 5444831328
qa-retailpro-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:074eb9a9c963a54ca3f7e7f4f5c496ccfe670d994737cb8dcd5bda8de1ed1c32
3
+ size 5315176544
qa-retailpro-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef29b2d00a00cd4e058995142430999575d54964a6be1c6f5e6e54dc6885a1ed
3
+ size 6254198880
qa-retailpro-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef0b5bce8ae8f5d4dc6b2077f21d0c9de6ee8d1bff59aa04d8a6459ea708b8d9
3
+ size 8098525280