Update README.md
Browse files
README.md
CHANGED
@@ -73,7 +73,7 @@ Without it, the model will not perform at its best capacity.
|
|
73 |
|
74 |
## π Benchmark Results
|
75 |
|
76 |
-
The model has been evaluated against other strong LLMs on the **
|
77 |
For Moroccan Darija evaluation, a **dataset of 300 sentences manually translated by a Moroccan translator** was used.
|
78 |
|
79 |
### π§Ύ Evaluation Dataset Coverage
|
@@ -90,11 +90,11 @@ This diversity makes the benchmark a strong representation of **real-world trans
|
|
90 |
|
91 |
| Model | BLEU | METEOR | chrF | Task |
|
92 |
|-------------------------------------|--------|--------|--------|---------------|
|
93 |
-
| Claude-Sonnet-4 | 0.312 | 0.566 | 62.09 |
|
94 |
-
| GPT-5-mini | 0.381 | 0.637 | 66.58 |
|
95 |
-
| GPT-5 | 0.284 | 0.551 | 61.73 |
|
96 |
-
| GPT-4.1 | 0.306 | 0.575 | 61.87 |
|
97 |
-
| **oddadmix/English-Moroccan-Darija-v1** | **0.423** | **0.644** | **67.31** |
|
98 |
|
99 |
β‘οΈ Our model achieves **state-of-the-art performance** while delivering **specialized Moroccan Darija output** across a wide variety of contexts.
|
100 |
|
|
|
73 |
|
74 |
## π Benchmark Results
|
75 |
|
76 |
+
The model has been evaluated against other strong LLMs on the **English β Moroccan Darija task** as a proxy benchmark.
|
77 |
For Moroccan Darija evaluation, a **dataset of 300 sentences manually translated by a Moroccan translator** was used.
|
78 |
|
79 |
### π§Ύ Evaluation Dataset Coverage
|
|
|
90 |
|
91 |
| Model | BLEU | METEOR | chrF | Task |
|
92 |
|-------------------------------------|--------|--------|--------|---------------|
|
93 |
+
| Claude-Sonnet-4 | 0.312 | 0.566 | 62.09 | English β Moroccan Darija |
|
94 |
+
| GPT-5-mini | 0.381 | 0.637 | 66.58 | English β Moroccan Darija |
|
95 |
+
| GPT-5 | 0.284 | 0.551 | 61.73 | English β Moroccan Darija |
|
96 |
+
| GPT-4.1 | 0.306 | 0.575 | 61.87 | English β Moroccan Darija |
|
97 |
+
| **oddadmix/English-Moroccan-Darija-v1** | **0.423** | **0.644** | **67.31** | **English β Moroccan Darija** |
|
98 |
|
99 |
β‘οΈ Our model achieves **state-of-the-art performance** while delivering **specialized Moroccan Darija output** across a wide variety of contexts.
|
100 |
|