OptimalScale

university

https://github.com/OptimalScale

OptimalScale

optimalscale

Activity Feed Request to join this org

AI & ML interests

Large foundation models, large language models.

hendrydong

authored 2 papers 3 months ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19 • 22

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

lmflow-optimalscale

updated a collection 3 months ago

CLIMB Datasets

Collection

NVIDIA's ClimbLab and ClimbMix datasets • 2 items • Updated May 9

hendrydong

authored 2 papers 3 months ago

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published May 8 • 26

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Paper • 2505.02391 • Published May 5 • 25

lmflow-optimalscale

updated 2 datasets 3 months ago

OptimalScale/ClimbMix

Viewer • Updated May 4 • 395M • 2.8k • 9

OptimalScale/ClimbLab

Viewer • Updated May 4 • 1.24B • 2.47k • 10

lmflow-optimalscale

in OptimalScale/ClimbLab 4 months ago

Really nice contribution 👏🏻👏🏻

#2 opened 4 months ago by

Tonic

lmflow-optimalscale

in OptimalScale/ClimbMix 4 months ago

Erroneous Token Count Column

#2 opened 4 months ago by

casey-martin

shizhediao

updated a dataset 4 months ago

OptimalScale/ClimbLab

Viewer • Updated May 4 • 1.24B • 2.47k • 10

shizhediao

authored a paper 4 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 92

lmflow-optimalscale

published 2 datasets 4 months ago

OptimalScale/ClimbLab

Viewer • Updated May 4 • 1.24B • 2.47k • 10

OptimalScale/ClimbMix

Viewer • Updated May 4 • 395M • 2.8k • 9

ksshumab

authored a paper 5 months ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 57

ksshumab

authored 3 papers 6 months ago

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data

Paper • 2302.12822 • Published Feb 24, 2023

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

Paper • 2304.06767 • Published Apr 13, 2023 • 2

FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation

Paper • 2408.12168 • Published Aug 22, 2024

hendrydong

authored a paper 6 months ago

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6 • 25

AI & ML interests

Team members 6

OptimalScale's activity

Really nice contribution 👏🏻👏🏻

Erroneous Token Count Column