12 36 4

Melisa Russak

melisa

melisa-writer

AI & ML interests

I love definitions

Recent Activity

upvoted a paper 15 days ago

A New Pair of GloVes

updated a model 16 days ago

Writer/colab

upvoted a paper 20 days ago

Einstein Fields: A Neural Perspective To Computational General Relativity

View all activity

Organizations

upvoted a paper 15 days ago

A New Pair of GloVes

Paper • 2507.18103 • Published 16 days ago • 7

upvoted a paper 20 days ago

Einstein Fields: A Neural Perspective To Computational General Relativity

Paper • 2507.11589 • Published 24 days ago • 7

upvoted a paper 30 days ago

A Systematic Analysis of Hybrid Linear Attention

Paper • 2507.06457 • Published about 1 month ago • 22

upvoted an article about 1 month ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

about 1 month ago

• 639

upvoted a paper 2 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 267

upvoted a paper 3 months ago

Bielik v3 Small: Technical Report

Paper • 2505.02550 • Published May 5 • 68

upvoted 2 papers 6 months ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 132

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published Feb 5 • 44

upvoted a paper 7 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 283

upvoted a paper 8 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 90

upvoted 2 papers 9 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 49

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10

upvoted an article 9 months ago

Article

Fine-tuning LLMs with Singular Value Decomposition

•

Jun 2, 2024

• 12

upvoted 3 papers 9 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 50

Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse

Paper • 2410.21333 • Published Oct 27, 2024 • 12

Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation

Paper • 2410.18565 • Published Oct 24, 2024 • 47

upvoted 2 papers 10 months ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 99

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 55

upvoted 2 papers 11 months ago

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5, 2024 • 93

ContextCite: Attributing Model Generation to Context

Paper • 2409.00729 • Published Sep 1, 2024 • 14