Update README.md
Browse files
README.md
CHANGED
|
@@ -93,7 +93,7 @@ To deploy the wint2 TP4 quantized version using FastDeploy, you can run the foll
|
|
| 93 |
|
| 94 |
```bash
|
| 95 |
python -m fastdeploy.entrypoints.openai.api_server \
|
| 96 |
-
--model "baidu/ERNIE-4.5-300B-A47B-
|
| 97 |
--port 8180 \
|
| 98 |
--metrics-port 8181 \
|
| 99 |
--engine-worker-queue-port 8182 \
|
|
|
|
| 93 |
|
| 94 |
```bash
|
| 95 |
python -m fastdeploy.entrypoints.openai.api_server \
|
| 96 |
+
--model "baidu/ERNIE-4.5-300B-A47B-2Bits-TP4-Paddle" \
|
| 97 |
--port 8180 \
|
| 98 |
--metrics-port 8181 \
|
| 99 |
--engine-worker-queue-port 8182 \
|