flytech
/

togetherchat-dev-7b

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

flytech commited on Sep 9, 2023

Commit

725edc8

·

1 Parent(s): 60e2f81

Update README.md

Files changed (1) hide show

README.md +22 -5

README.md CHANGED Viewed

@@ -8,16 +8,33 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # togetherchat-dev-7b
-This model is a fine-tuned version of [togethercomputer/LLaMA-2-7B-32K](https://huggingface.co/togethercomputer/LLaMA-2-7B-32K) on an unknown dataset.
 ## Model description
-More information needed
 ## Intended uses & limitations

   results: []
 ---
 # togetherchat-dev-7b
+This model is a fine-tuned version of [togethercomputer/LLaMA-2-7B-32K](https://huggingface.co/togethercomputer/LLaMA-2-7B-32K) using 5000 examples and 3 datasets:
+platypus_dataset = load_dataset("garage-bAInd/Open-Platypus")
+codealpaca_dataset = load_dataset("sahil2801/CodeAlpaca-20k")
+evol_codealpaca_dataset = load_dataset("theblackcat102/evol-codealpaca-v1")
 ## Model description
+Step	Training Loss
+-------------
+-60	    1.293000-
+-120	0.673600-
+-180	0.633200-
+-240	0.611600-
+-300	0.633000-
+-360	0.589500-
+-480	0.587600-
+-540	0.569000-
+-600	0.548700-
+-660	0.553100-
+-720	0.531500-
+-780	0.506400-
+-840	0.512500-
+-------------
 ## Intended uses & limitations