Spaces:

JQL-AI
/

JQL

Running

App Files Files Community

mali90 commited on 18 days ago

Commit

5853b04

verified ·

1 Parent(s): 42c2c6f

Update index.html

Browse files

Files changed (1) hide show

index.html +6 -52

index.html CHANGED Viewed

@@ -36,52 +36,6 @@
     </p>
   </div>
 </section>
-  <section class="section">
-  <div class="container">
-    <h2 class="title is-3">📊 Results</h2>
-    <div class="highlight-box">
-      <p><strong>✔️ Accuracy</strong></p>
-      <ul>
-        <li>Spearman’s ρ > 0.87 with human ground truth</li>
-      </ul>
-    </div>
-    <div class="highlight-box">
-      <p><strong>📈 Downstream LLM Training Impact</strong></p>
-      <ul>
-        <li>+7.2% benchmark performance improvement</li>
-        <li>+4.8% token retention compared to FineWeb2 heuristic filter</li>
-        <li>Reliable thresholding with 0.6 and 0.7 quantiles</li>
-      </ul>
-    </div>
-    <div class="highlight-box">
-      <p><strong>⚡ Annotation Speed</strong></p>
-      <ul>
-        <li>~11,000 docs/min (on A100 GPU, avg. 690 tokens per doc)</li>
-      </ul>
-    </div>
-  </div>
-</section>
-<section class="section">
-  <div class="container">
-    <h2 class="title is-3">📁 Available Artifacts</h2>
-    <div class="highlight-box">
-      <ul>
-        <li>📄 Ground truth annotations in <strong>35 languages</strong></li>
-        <li>🧠 Synthetic LLM-annotated dataset (<strong>14M+ documents</strong>)</li>
-        <li>🪶 Lightweight annotation models:
-          <ul>
-            <li>JQL-Gemma</li>
-            <li>JQL-Mistral</li>
-            <li>JQL-Llama</li>
-          </ul>
-        </li>
-        <li>🛠️ Training & inference scripts <em>(coming soon)</em></li>
-      </ul>
-    </div>
-  </div>
-</section>
 <section class="section">
   <div class="container content">
@@ -104,15 +58,15 @@
   <div class="container content">
     <h2 class="title is-3">📊 Results</h2>
     <ul>
-      <li><strong>Accuracy:</strong> Spearman’s ρ > 0.87 with human ground truth</li>
-      <li><strong>Downstream LLM Training:</strong>
         <ul>
           <li>+7.2% benchmark performance improvement</li>
           <li>+4.8% token retention vs. FineWeb2 heuristic filter</li>
           <li>Effective threshold strategies: 0.6 and 0.7 quantile</li>
         </ul>
       </li>
-      <li><strong>Annotation Speed:</strong> ~11,000 docs/min (A100 GPU, avg. 690 tokens)</li>
     </ul>
   </div>
 </section>
@@ -121,9 +75,9 @@
   <div class="container content">
     <h2 class="title is-3">📁 Available Artifacts</h2>
     <ul>
-      <li>✅ Ground truth annotations in 35 languages</li>
-      <li>✅ Synthetic LLM-annotated dataset (14M+ documents)</li>
-      <li>✅ Lightweight annotation models:
         <ul>
           <li>JQL-Gemma</li>
           <li>JQL-Mistral</li>

     </p>
   </div>
 </section>
 <section class="section">
   <div class="container content">
   <div class="container content">
     <h2 class="title is-3">📊 Results</h2>
     <ul>
+      <li><strong>✔️ Accuracy:</strong> Spearman’s ρ > 0.87 with human ground truth</li>
+      <li><strong>📈 Downstream LLM Training:</strong>
         <ul>
           <li>+7.2% benchmark performance improvement</li>
           <li>+4.8% token retention vs. FineWeb2 heuristic filter</li>
           <li>Effective threshold strategies: 0.6 and 0.7 quantile</li>
         </ul>
       </li>
+      <li><strong>⚡ Annotation Speed:</strong> ~11,000 docs/min (A100 GPU, avg. 690 tokens)</li>
     </ul>
   </div>
 </section>
   <div class="container content">
     <h2 class="title is-3">📁 Available Artifacts</h2>
     <ul>
+      <li>📄 Ground truth annotations in 35 languages</li>
+      <li>🧠 Synthetic LLM-annotated dataset (14M+ documents)</li>
+      <li>🪶 Lightweight annotation models:
         <ul>
           <li>JQL-Gemma</li>
           <li>JQL-Mistral</li>