4 15 2

Mikhail Seleznyov

myyycroft

Dont-Care-Didnt-Ask

AI & ML interests

NLP, Time series forecasting, Multimodal models.

Recent Activity

upvoted a paper 2 days ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

upvoted a paper 13 days ago

OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features

upvoted a paper 16 days ago

The Rogue Scalpel: Activation Steering Compromises LLM Safety

View all activity

Organizations

upvoted a paper 2 days ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published 13 days ago • 97

upvoted a paper 13 days ago

OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features

Paper • 2509.22033 • Published 23 days ago • 16

upvoted a paper 16 days ago

The Rogue Scalpel: Activation Steering Compromises LLM Safety

Paper • 2509.22067 • Published 23 days ago • 26

updated 3 datasets 22 days ago

published a dataset 22 days ago

myyycroft/GSM-Plus_model_generations

Viewer • Updated 22 days ago • 9.21k • 70

published 2 datasets 23 days ago

myyycroft/GSM-Symbolic_model_generations

Viewer • Updated 22 days ago • 5k • 75

myyycroft/aime_2025_model_generations

Viewer • Updated 22 days ago • 30 • 49

updated a model about 2 months ago

myyycroft/Qwen2.5-7B-Instruct-hyperfitted-easy-500

Text Generation • 8B • Updated Aug 29 • 1

published a model about 2 months ago

myyycroft/Qwen2.5-7B-Instruct-hyperfitted-easy-500

Text Generation • 8B • Updated Aug 29 • 1

updated a model about 2 months ago

myyycroft/Qwen2.5-7B-Instruct-hyperfitted-easy

Text Generation • 8B • Updated Aug 29 • 3

published a model about 2 months ago

myyycroft/Qwen2.5-7B-Instruct-hyperfitted-easy

Text Generation • 8B • Updated Aug 29 • 3

updated a model about 2 months ago

myyycroft/Qwen2-7B-Instruct-hyperfitted-easy

Text Generation • 8B • Updated Aug 29 • 1

published a model about 2 months ago

myyycroft/Qwen2-7B-Instruct-hyperfitted-easy

Text Generation • 8B • Updated Aug 29 • 1

New activity in apple/GSM-Symbolic about 2 months ago

Duplicate examples

#3 opened about 2 months ago by

myyycroft

New activity in qintongli/GSM-Plus about 2 months ago

Duplicate examples

#3 opened about 2 months ago by

myyycroft

upvoted a paper 2 months ago

HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds

Paper • 2508.12782 • Published Aug 18 • 25

authored 2 papers 2 months ago

xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics

Paper • 2406.14553 • Published Jun 20, 2024 • 2

ViSTa Dataset: Do vision-language models understand sequential tasks?

Paper • 2411.13211 • Published Nov 20, 2024

Mikhail Seleznyov

AI & ML interests

Recent Activity

Organizations

myyycroft's activity

Duplicate examples

Duplicate examples