togethercomputer
/

gemma-2-9b-it-MoAA-SFT

Text Generation

text-generation-inference

Model card Files Files and versions

ShangZhu-Together commited on May 28

Commit

b76cd90

·

verified ·

1 Parent(s): 38b6210

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ tags: []
 ## Model Description
-This is the SFT model model in our Mixture of Agents Alignment (MoAA) pipeline. This model is tuned on the Gemma-2-9b-it. MoAA is an approach that leverages collective intelligence from open‑source LLMs to advance alignment.
 Two mains stages are involved in our MoAA method. In the first stage, we employ MoA  to produce high-quality synthetic data for supervised fine-tuning. In the second stage, we combines multiple LLMs as a reward model to provide preference annotations.
@@ -64,7 +64,7 @@ Refer to [Paper](https://arxiv.org/abs/2505.03059) for metrics.
-## Citation [optional]
 ```
 @article{wang2025improving,
 title   = {Improving Model Alignment Through Collective Intelligence of Open-Source LLMS},

 ## Model Description
+This is the SFT model in our Mixture of Agents Alignment (MoAA) pipeline. This model is tuned on the Gemma-2-9b-it. MoAA is an approach that leverages collective intelligence from open‑source LLMs to advance alignment.
 Two mains stages are involved in our MoAA method. In the first stage, we employ MoA  to produce high-quality synthetic data for supervised fine-tuning. In the second stage, we combines multiple LLMs as a reward model to provide preference annotations.
+## Citation
 ```
 @article{wang2025improving,
 title   = {Improving Model Alignment Through Collective Intelligence of Open-Source LLMS},