10 14

Sebastian Torres

sebastiantorres

AI & ML interests

None yet

Recent Activity

liked a dataset 2 days ago

hf-doc-build/doc-build-dev

upvoted a paper 4 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

upvoted a paper 6 days ago

Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs

View all activity

Organizations

None yet

liked a dataset 2 days ago

hf-doc-build/doc-build-dev

Updated 30 days ago • 729k • 24

upvoted a paper 4 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 7 days ago • 254

upvoted a paper 6 days ago

Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs

Paper • 2605.07153 • Published 12 days ago • 7

liked a model 9 days ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 5.97k • • 4.96k

liked a model 13 days ago

DCSlucifer/lab21-qwen25-3b-lora-r16

Updated 13 days ago • 1

liked a model 19 days ago

colbert-ir/colbertv2.0

Updated Apr 5, 2024 • 17.5M • 344

liked a dataset 27 days ago

harii999/gt

Updated about 3 hours ago • 10.8k • 3

upvoted a paper 27 days ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published Apr 15 • 62

liked 2 models about 1 month ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 1.74k • 906

thdekerk/Huihui-gemma-4-31B-it-v2-MLX-8bit

Any-to-Any • 9B • Updated Apr 13 • 357 • 2

upvoted a paper about 1 month ago

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

Paper • 2604.02648 • Published Apr 3 • 47

liked a model about 1 month ago

Qwen/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated Sep 17, 2025 • 6.67M • • 847

liked a dataset about 1 month ago

DCAgent2/terminal_bench_2_b1_top32_seq_20260407_182112

Viewer • Updated Apr 8 • 258 • 24 • 1

liked a model about 2 months ago

manja316/modelscan-bypass-marshal

Updated Apr 11 • 1

upvoted a paper about 2 months ago

CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare

Paper • 2603.24157 • Published Mar 25 • 10

liked 2 datasets about 2 months ago

daaxila/twitter-jiwawa1314-2026.03.25-2036706214286155974-bR-32XcIA0k9_o8D-part1

Viewer • Updated Apr 1 • 1 • 17 • 1

OpenMOSS-Team/OmniAction

Updated Mar 27 • 135k • 280

upvoted 3 papers 2 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 311

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

Sebastian Torres

AI & ML interests

Recent Activity

Organizations

sebastiantorres's activity