Kurosawama
/

Llama-3.1-8B-Instruct-Full-align

first-order-logic

Model card Files Files and versions

Model Card for Model ID

Aligned version of meta-llama/Llama-3.1-8B-Instruct for Logical Reasoning

Model Description

This is an aligned model using DPO in order to improve the base model's performance in formal reasoning in first-order logic.

Developed by: [Grupo de Ingeniería Lingüística]
Language(s) (NLP): [English]
License: [Whichever one Llama 3 uses]
Finetuned from model [meta-llama/Llama-3.1-8B-Instruct]:

Model Sources [optional]

Repository: [Github]
Paper: Into The Limits of Logic: Alignment Methods for Formal Logic Reasoning

Evaluation

Citation [optional]

BibTeX:

[More Information Needed]

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Kurosawama/Llama-3.1-8B-Instruct-Full-align

Base model

meta-llama/Llama-3.1-8B

Finetuned

meta-llama/Llama-3.1-8B-Instruct

Finetuned

(1899)

this model

Dataset used to train Kurosawama/Llama-3.1-8B-Instruct-Full-align

Collection including Kurosawama/Llama-3.1-8B-Instruct-Full-align

Into The Limits of Logic

Models and datasets used in the experiments reported in the paper Into The Limits of Logic: Alignment Methods for Logical Reasoning • 14 items • Updated Oct 2