Model Card for Model ID

Aligned version of meta-llama/Llama-3.1-8B-Instruct for Logical Reasoning

Model Description

This is an aligned model using DPO in order to improve the base model's performance in formal reasoning in first-order logic.

  • Developed by: [Grupo de Ingeniería Lingüística]
  • Language(s) (NLP): [English]
  • License: [Whichever one Llama 3 uses]
  • Finetuned from model [meta-llama/Llama-3.1-8B-Instruct]:

Model Sources [optional]

  • Repository: [Github]
  • Paper: Into The Limits of Logic: Alignment Methods for Formal Logic Reasoning

Evaluation

Citation [optional]

BibTeX:

[More Information Needed]

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Kurosawama/Llama-3.1-8B-Instruct-Full-align

Finetuned
(1899)
this model

Dataset used to train Kurosawama/Llama-3.1-8B-Instruct-Full-align

Collection including Kurosawama/Llama-3.1-8B-Instruct-Full-align