13 10

Алексеев Алексей

VictoriaWilliam

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

nataliaaolmo/hubert-base-ls960-gender_voice-finetuned5

upvoted a paper 5 days ago

Shallow Prefill, Deep Decoding: Efficient Long-Context Inference via Layer-Asymmetric KV Visibility

upvoted a paper 8 days ago

R^3-SQL: Ranking Reward and Resampling for Text-to-SQL

View all activity

Organizations

None yet

liked a model about 15 hours ago

nataliaaolmo/hubert-base-ls960-gender_voice-finetuned5

Updated about 16 hours ago • 1

upvoted a paper 5 days ago

Shallow Prefill, Deep Decoding: Efficient Long-Context Inference via Layer-Asymmetric KV Visibility

Paper • 2605.06105 • Published 12 days ago • 3

upvoted a paper 8 days ago

R^3-SQL: Ranking Reward and Resampling for Text-to-SQL

Paper • 2604.25325 • Published 21 days ago • 3

upvoted a paper 12 days ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published 13 days ago • 97

liked a dataset 12 days ago

cadene/droid

Preview • Updated Feb 27, 2025 • 253k • 16

liked a dataset 18 days ago

allenai/c4

Viewer • Updated Jan 9, 2024 • 10.4B • 804k • 573

liked a model 25 days ago

aioaneid/nanochat_n_layer_12_seq_len_1024_n_embd_1024

Updated about 4 hours ago • 2

upvoted 2 papers 26 days ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Paper • 2604.18224 • Published 29 days ago • 22

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 27 days ago • 240

upvoted 4 papers about 1 month ago

Self-Execution Simulation Improves Coding Models

Paper • 2604.03253 • Published Mar 11 • 35

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 628

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

liked 4 models about 1 month ago

arithmetic-circuit-overloading/Llama-3.3-70B-Instruct-v2-3d-4M-400K-0.1-reverse-padzero-99-128D-2L-2H-512I

Text Generation • 662k • Updated Apr 4 • 57 • 1

liked 2 datasets about 2 months ago

Eimhin03/NM3-irish-augmented-iter5

Viewer • Updated Apr 1 • 10.8k • 221 • 1

HuggingFaceFW/finephrase

Viewer • Updated Mar 31 • 1.02B • 410k • 113

upvoted a paper about 2 months ago

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

Paper • 2603.26728 • Published Mar 20 • 12

Алексеев Алексей

AI & ML interests

Recent Activity

Organizations

VictoriaWilliam's activity