Maria Marina's picture

2 18

Maria Marina

zlatamaria

·

marialysyuk

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

The Rogue Scalpel: Activation Steering Compromises LLM Safety

upvoted a paper 2 days ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

upvoted a paper 13 days ago

OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features

View all activity

Organizations

upvoted 2 papers 2 days ago

The Rogue Scalpel: Activation Steering Compromises LLM Safety

Paper • 2509.22067 • Published 23 days ago • 26

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published 13 days ago • 97

upvoted a paper 13 days ago

OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features

Paper • 2509.22033 • Published 23 days ago • 16

upvoted 3 papers 2 months ago

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Paper • 2508.11383 • Published Aug 15 • 39

HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds

Paper • 2508.12782 • Published Aug 18 • 25

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7 • 46

upvoted 5 papers 4 months ago

DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization

Paper • 2505.20975 • Published May 27 • 36

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 131

Image Reconstruction as a Tool for Feature Analysis

Paper • 2506.07803 • Published Jun 9 • 29

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7 • 71

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 139

upvoted 3 papers 5 months ago

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

Paper • 2506.04089 • Published Jun 4 • 47

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Paper • 2505.21189 • Published May 27 • 61

Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images

Paper • 2505.07704 • Published May 12 • 29

upvoted 2 papers 7 months ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 231

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published Feb 28 • 132

upvoted 2 papers 8 months ago

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19 • 36

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20 • 91