QA-DeBERTa-v3-large-6970
This model is a fine-tuned version of microsoft/deberta-v3-large on the saiteki-kai/BeaverTails-it dataset. It achieves the following results on the evaluation set:
- Loss: 0.0808
- Accuracy: 0.6752
- Macro F1: 0.6789
- Macro Precision: 0.6705
- Macro Recall: 0.6955
- Micro F1: 0.7535
- Micro Precision: 0.7421
- Micro Recall: 0.7653
- Flagged/accuracy: 0.8545
- Flagged/precision: 0.8576
- Flagged/recall: 0.8854
- Flagged/f1: 0.8713
- Flagged/aucpr: 0.9034
- Flagged/fpr: 0.1844
- Animal Abuse/accuracy: 0.9948
- Animal Abuse/precision: 0.7791
- Animal Abuse/recall: 0.7587
- Animal Abuse/f1: 0.7688
- Animal Abuse/fpr: 0.0025
- Animal Abuse/threshold: 0.5115
- Child Abuse/accuracy: 0.9965
- Child Abuse/precision: 0.6877
- Child Abuse/recall: 0.6877
- Child Abuse/f1: 0.6877
- Child Abuse/fpr: 0.0017
- Child Abuse/threshold: 0.4429
- Controversial Topics,politics/accuracy: 0.9662
- Controversial Topics,politics/precision: 0.4608
- Controversial Topics,politics/recall: 0.5966
- Controversial Topics,politics/f1: 0.5200
- Controversial Topics,politics/fpr: 0.0221
- Controversial Topics,politics/threshold: 0.2509
- Discrimination,stereotype,injustice/accuracy: 0.9541
- Discrimination,stereotype,injustice/precision: 0.6964
- Discrimination,stereotype,injustice/recall: 0.7492
- Discrimination,stereotype,injustice/f1: 0.7218
- Discrimination,stereotype,injustice/fpr: 0.0282
- Discrimination,stereotype,injustice/threshold: 0.3675
- Drug Abuse,weapons,banned Substance/accuracy: 0.9738
- Drug Abuse,weapons,banned Substance/precision: 0.7492
- Drug Abuse,weapons,banned Substance/recall: 0.8045
- Drug Abuse,weapons,banned Substance/f1: 0.7758
- Drug Abuse,weapons,banned Substance/fpr: 0.0161
- Drug Abuse,weapons,banned Substance/threshold: 0.4574
- Financial Crime,property Crime,theft/accuracy: 0.9608
- Financial Crime,property Crime,theft/precision: 0.7812
- Financial Crime,property Crime,theft/recall: 0.8294
- Financial Crime,property Crime,theft/f1: 0.8046
- Financial Crime,property Crime,theft/fpr: 0.0250
- Financial Crime,property Crime,theft/threshold: 0.5081
- Hate Speech,offensive Language/accuracy: 0.9493
- Hate Speech,offensive Language/precision: 0.7406
- Hate Speech,offensive Language/recall: 0.6681
- Hate Speech,offensive Language/f1: 0.7025
- Hate Speech,offensive Language/fpr: 0.0230
- Hate Speech,offensive Language/threshold: 0.4562
- Misinformation Regarding Ethics,laws And Safety/accuracy: 0.9774
- Misinformation Regarding Ethics,laws And Safety/precision: 0.2145
- Misinformation Regarding Ethics,laws And Safety/recall: 0.3228
- Misinformation Regarding Ethics,laws And Safety/f1: 0.2578
- Misinformation Regarding Ethics,laws And Safety/fpr: 0.0145
- Misinformation Regarding Ethics,laws And Safety/threshold: 0.0827
- Non Violent Unethical Behavior/accuracy: 0.8853
- Non Violent Unethical Behavior/precision: 0.7250
- Non Violent Unethical Behavior/recall: 0.6810
- Non Violent Unethical Behavior/f1: 0.7023
- Non Violent Unethical Behavior/fpr: 0.0640
- Non Violent Unethical Behavior/threshold: 0.4007
- Privacy Violation/accuracy: 0.9810
- Privacy Violation/precision: 0.7956
- Privacy Violation/recall: 0.8281
- Privacy Violation/f1: 0.8115
- Privacy Violation/fpr: 0.0110
- Privacy Violation/threshold: 0.4282
- Self Harm/accuracy: 0.9970
- Self Harm/precision: 0.8730
- Self Harm/recall: 0.6537
- Self Harm/f1: 0.7476
- Self Harm/fpr: 0.0007
- Self Harm/threshold: 0.7592
- Sexually Explicit,adult Content/accuracy: 0.9837
- Sexually Explicit,adult Content/precision: 0.6458
- Sexually Explicit,adult Content/recall: 0.7146
- Sexually Explicit,adult Content/f1: 0.6785
- Sexually Explicit,adult Content/fpr: 0.0097
- Sexually Explicit,adult Content/threshold: 0.4173
- Terrorism,organized Crime/accuracy: 0.9898
- Terrorism,organized Crime/precision: 0.4012
- Terrorism,organized Crime/recall: 0.5655
- Terrorism,organized Crime/f1: 0.4694
- Terrorism,organized Crime/fpr: 0.0068
- Terrorism,organized Crime/threshold: 0.3748
- Violence,aiding And Abetting,incitement/accuracy: 0.9219
- Violence,aiding And Abetting,incitement/precision: 0.8369
- Violence,aiding And Abetting,incitement/recall: 0.8774
- Violence,aiding And Abetting,incitement/f1: 0.8567
- Violence,aiding And Abetting,incitement/fpr: 0.0620
- Violence,aiding And Abetting,incitement/threshold: 0.5012
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 3.855503627114408e-06
- train_batch_size: 16
- eval_batch_size: 64
- seed: 42
- distributed_type: multi-GPU
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- lr_scheduler_warmup_ratio: 0.1
- num_epochs: 10
Training results
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 | Macro Precision | Macro Recall | Micro F1 | Micro Precision | Micro Recall | Flagged/accuracy | Flagged/precision | Flagged/recall | Flagged/f1 | Flagged/aucpr | Flagged/fpr | Animal Abuse/accuracy | Animal Abuse/precision | Animal Abuse/recall | Animal Abuse/f1 | Animal Abuse/fpr | Animal Abuse/threshold | Child Abuse/accuracy | Child Abuse/precision | Child Abuse/recall | Child Abuse/f1 | Child Abuse/fpr | Child Abuse/threshold | Controversial Topics,politics/accuracy | Controversial Topics,politics/precision | Controversial Topics,politics/recall | Controversial Topics,politics/f1 | Controversial Topics,politics/fpr | Controversial Topics,politics/threshold | Discrimination,stereotype,injustice/accuracy | Discrimination,stereotype,injustice/precision | Discrimination,stereotype,injustice/recall | Discrimination,stereotype,injustice/f1 | Discrimination,stereotype,injustice/fpr | Discrimination,stereotype,injustice/threshold | Drug Abuse,weapons,banned Substance/accuracy | Drug Abuse,weapons,banned Substance/precision | Drug Abuse,weapons,banned Substance/recall | Drug Abuse,weapons,banned Substance/f1 | Drug Abuse,weapons,banned Substance/fpr | Drug Abuse,weapons,banned Substance/threshold | Financial Crime,property Crime,theft/accuracy | Financial Crime,property Crime,theft/precision | Financial Crime,property Crime,theft/recall | Financial Crime,property Crime,theft/f1 | Financial Crime,property Crime,theft/fpr | Financial Crime,property Crime,theft/threshold | Hate Speech,offensive Language/accuracy | Hate Speech,offensive Language/precision | Hate Speech,offensive Language/recall | Hate Speech,offensive Language/f1 | Hate Speech,offensive Language/fpr | Hate Speech,offensive Language/threshold | Misinformation Regarding Ethics,laws And Safety/accuracy | Misinformation Regarding Ethics,laws And Safety/precision | Misinformation Regarding Ethics,laws And Safety/recall | Misinformation Regarding Ethics,laws And Safety/f1 | Misinformation Regarding Ethics,laws And Safety/fpr | Misinformation Regarding Ethics,laws And Safety/threshold | Non Violent Unethical Behavior/accuracy | Non Violent Unethical Behavior/precision | Non Violent Unethical Behavior/recall | Non Violent Unethical Behavior/f1 | Non Violent Unethical Behavior/fpr | Non Violent Unethical Behavior/threshold | Privacy Violation/accuracy | Privacy Violation/precision | Privacy Violation/recall | Privacy Violation/f1 | Privacy Violation/fpr | Privacy Violation/threshold | Self Harm/accuracy | Self Harm/precision | Self Harm/recall | Self Harm/f1 | Self Harm/fpr | Self Harm/threshold | Sexually Explicit,adult Content/accuracy | Sexually Explicit,adult Content/precision | Sexually Explicit,adult Content/recall | Sexually Explicit,adult Content/f1 | Sexually Explicit,adult Content/fpr | Sexually Explicit,adult Content/threshold | Terrorism,organized Crime/accuracy | Terrorism,organized Crime/precision | Terrorism,organized Crime/recall | Terrorism,organized Crime/f1 | Terrorism,organized Crime/fpr | Terrorism,organized Crime/threshold | Violence,aiding And Abetting,incitement/accuracy | Violence,aiding And Abetting,incitement/precision | Violence,aiding And Abetting,incitement/recall | Violence,aiding And Abetting,incitement/f1 | Violence,aiding And Abetting,incitement/fpr | Violence,aiding And Abetting,incitement/threshold |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.0981 | 1.0 | 33814 | 0.0876 | 0.6562 | 0.6545 | 0.6458 | 0.6776 | 0.7395 | 0.7268 | 0.7526 | 0.8415 | 0.8396 | 0.8841 | 0.8613 | 0.8941 | 0.2119 | 0.9946 | 0.7646 | 0.7602 | 0.7624 | 0.0027 | 0.5964 | 0.9963 | 0.6647 | 0.6727 | 0.6687 | 0.0019 | 0.5832 | 0.9659 | 0.4529 | 0.5402 | 0.4927 | 0.0206 | 0.3675 | 0.9528 | 0.6938 | 0.7280 | 0.7105 | 0.0278 | 0.3675 | 0.9717 | 0.7271 | 0.7971 | 0.7605 | 0.0179 | 0.4806 | 0.9577 | 0.7594 | 0.8282 | 0.7923 | 0.0283 | 0.2728 | 0.9458 | 0.7101 | 0.6664 | 0.6876 | 0.0268 | 0.4249 | 0.9787 | 0.1706 | 0.1956 | 0.1823 | 0.0117 | 0.0748 | 0.8812 | 0.7166 | 0.6655 | 0.6901 | 0.0653 | 0.4804 | 0.9805 | 0.7884 | 0.8254 | 0.8065 | 0.0115 | 0.4695 | 0.9969 | 0.8828 | 0.6244 | 0.7314 | 0.0006 | 0.8366 | 0.9827 | 0.6217 | 0.7222 | 0.6682 | 0.0108 | 0.3495 | 0.9834 | 0.2634 | 0.5946 | 0.3650 | 0.0134 | 0.0967 | 0.9157 | 0.8259 | 0.8656 | 0.8453 | 0.0662 | 0.4330 |
| 0.0867 | 2.0 | 67628 | 0.0816 | 0.6724 | 0.6746 | 0.6686 | 0.6849 | 0.7530 | 0.7340 | 0.7730 | 0.8552 | 0.8580 | 0.8865 | 0.8721 | 0.9039 | 0.1840 | 0.9948 | 0.7908 | 0.7471 | 0.7683 | 0.0023 | 0.4211 | 0.9964 | 0.6727 | 0.6667 | 0.6697 | 0.0018 | 0.2705 | 0.9669 | 0.4676 | 0.5765 | 0.5164 | 0.0207 | 0.2295 | 0.9540 | 0.6955 | 0.7496 | 0.7215 | 0.0284 | 0.3040 | 0.9743 | 0.7627 | 0.7906 | 0.7764 | 0.0147 | 0.4955 | 0.9611 | 0.7825 | 0.8320 | 0.8065 | 0.0249 | 0.3739 | 0.9454 | 0.6890 | 0.7119 | 0.7003 | 0.0316 | 0.2720 | 0.9805 | 0.2224 | 0.2421 | 0.2318 | 0.0104 | 0.1813 | 0.8779 | 0.6855 | 0.7128 | 0.6989 | 0.0811 | 0.3166 | 0.9809 | 0.8008 | 0.8146 | 0.8076 | 0.0105 | 0.4479 | 0.9970 | 0.8754 | 0.6512 | 0.7469 | 0.0006 | 0.4472 | 0.9838 | 0.6431 | 0.7346 | 0.6858 | 0.0101 | 0.3303 | 0.9909 | 0.4367 | 0.4802 | 0.4574 | 0.0050 | 0.2108 | 0.9217 | 0.8357 | 0.8782 | 0.8564 | 0.0626 | 0.4335 |
| 0.056 | 3.0 | 101442 | 0.0808 | 0.6752 | 0.6790 | 0.6705 | 0.6956 | 0.7535 | 0.7420 | 0.7653 | 0.8544 | 0.8575 | 0.8854 | 0.8713 | 0.9034 | 0.1846 | 0.9948 | 0.7791 | 0.7587 | 0.7688 | 0.0025 | 0.5115 | 0.9965 | 0.6877 | 0.6877 | 0.6877 | 0.0017 | 0.4429 | 0.9661 | 0.4592 | 0.5993 | 0.5200 | 0.0223 | 0.2480 | 0.9541 | 0.6966 | 0.7492 | 0.7219 | 0.0282 | 0.3675 | 0.9738 | 0.7492 | 0.8048 | 0.7760 | 0.0161 | 0.4574 | 0.9608 | 0.7812 | 0.8294 | 0.8046 | 0.0250 | 0.5074 | 0.9492 | 0.7396 | 0.6688 | 0.7024 | 0.0232 | 0.4545 | 0.9774 | 0.2147 | 0.3228 | 0.2579 | 0.0145 | 0.0827 | 0.8852 | 0.7244 | 0.6816 | 0.7023 | 0.0643 | 0.3993 | 0.9810 | 0.7956 | 0.8281 | 0.8115 | 0.0110 | 0.4282 | 0.9970 | 0.8730 | 0.6537 | 0.7476 | 0.0007 | 0.7592 | 0.9837 | 0.6458 | 0.7146 | 0.6785 | 0.0097 | 0.4173 | 0.9898 | 0.4033 | 0.5634 | 0.4701 | 0.0067 | 0.3757 | 0.9220 | 0.8378 | 0.8766 | 0.8567 | 0.0615 | 0.5049 |
| 0.0891 | 4.0 | 135256 | 0.0811 | 0.6723 | 0.6781 | 0.6664 | 0.7004 | 0.7519 | 0.7374 | 0.7671 | 0.8571 | 0.8641 | 0.8819 | 0.8729 | 0.9059 | 0.1740 | 0.9952 | 0.8236 | 0.7398 | 0.7795 | 0.0018 | 0.5404 | 0.9965 | 0.6844 | 0.6967 | 0.6905 | 0.0018 | 0.4273 | 0.9672 | 0.4710 | 0.5733 | 0.5171 | 0.0204 | 0.3251 | 0.9536 | 0.6918 | 0.7517 | 0.7205 | 0.0289 | 0.4163 | 0.9725 | 0.7302 | 0.8122 | 0.7690 | 0.0179 | 0.5250 | 0.9606 | 0.7769 | 0.8354 | 0.8051 | 0.0259 | 0.4249 | 0.9505 | 0.7598 | 0.6549 | 0.7034 | 0.0204 | 0.5034 | 0.9804 | 0.2428 | 0.2873 | 0.2632 | 0.0110 | 0.1348 | 0.8789 | 0.6890 | 0.7120 | 0.7003 | 0.0797 | 0.3937 | 0.9814 | 0.8106 | 0.8122 | 0.8114 | 0.0099 | 0.5931 | 0.9969 | 0.8294 | 0.6878 | 0.752 | 0.0010 | 0.5784 | 0.9834 | 0.6328 | 0.7395 | 0.6820 | 0.0106 | 0.3115 | 0.9872 | 0.3396 | 0.6403 | 0.4438 | 0.0100 | 0.1689 | 0.9221 | 0.8473 | 0.8628 | 0.8550 | 0.0564 | 0.4513 |
| 0.072 | 5.0 | 169070 | 0.0827 | 0.6727 | 0.6749 | 0.6622 | 0.6944 | 0.7506 | 0.7370 | 0.7646 | 0.8537 | 0.8565 | 0.8854 | 0.8707 | 0.9028 | 0.1862 | 0.9950 | 0.7997 | 0.7485 | 0.7733 | 0.0022 | 0.5455 | 0.9967 | 0.7157 | 0.6727 | 0.6935 | 0.0015 | 0.4359 | 0.9681 | 0.4827 | 0.5619 | 0.5193 | 0.0190 | 0.3016 | 0.9549 | 0.7210 | 0.7065 | 0.7137 | 0.0236 | 0.3979 | 0.9720 | 0.7137 | 0.8402 | 0.7718 | 0.0201 | 0.3531 | 0.9600 | 0.7695 | 0.8404 | 0.8034 | 0.0271 | 0.3882 | 0.9489 | 0.7346 | 0.6724 | 0.7021 | 0.0239 | 0.4249 | 0.9781 | 0.2076 | 0.2832 | 0.2396 | 0.0133 | 0.1277 | 0.8813 | 0.7091 | 0.6825 | 0.6956 | 0.0694 | 0.4254 | 0.9810 | 0.8014 | 0.8176 | 0.8094 | 0.0105 | 0.5140 | 0.9968 | 0.8028 | 0.7049 | 0.7506 | 0.0012 | 0.5096 | 0.9817 | 0.5910 | 0.7789 | 0.6720 | 0.0133 | 0.2436 | 0.9895 | 0.3886 | 0.5364 | 0.4507 | 0.0068 | 0.1678 | 0.9205 | 0.8337 | 0.8760 | 0.8543 | 0.0633 | 0.4458 |
Framework versions
- Transformers 4.57.1
- Pytorch 2.7.1+cu118
- Datasets 4.4.1
- Tokenizers 0.22.1
- Downloads last month
- 32
Model tree for saiteki-kai/QA-DeBERTa-v3-large-6970
Base model
microsoft/deberta-v3-largeEvaluation results
- Accuracy on saiteki-kai/BeaverTails-itself-reported0.675