Icelandic Irony Detector (RoBERTa, Icelandic)
Task: binary sequence classification → ["not_ironic", "ironic"]
Base model: mideind/IceBERT-igc (Icelandic RoBERTa)
Intended domain: Icelandic social-media style text (short, informal; emojis, punctuation variants).
TL;DR
A compact Icelandic RoBERTa fine-tuned to recognize ironic vs non-ironic sentences.
Useful as a filter to reduce misunderstandings in online communication.
How to use
from transformers import AutoTokenizer, AutoModelForSequenceClassification
model_id = "ambj24/icelandic-irony"
tok = AutoTokenizer.from_pretrained(model_id)
mod = AutoModelForSequenceClassification.from_pretrained(model_id)
text = "Já, frábær þjónusta… 😑"
inputs = tok(text, return_tensors="pt")
logits = mod(**inputs).logits
probs = logits.softmax(-1).tolist()[0]
labels = ["not_ironic", "ironic"]
pred = labels[probs.index(max(probs))]
print(pred, dict(zip(labels, probs)))
Input length: works well for short posts; we trained with a max length around 128 tokens.
Mix: Icelandic social-media style text.
- Downloads last month
- 11