2 35 9

nanatata

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

upvoted a paper 10 days ago

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

upvoted a paper 11 days ago

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

View all activity

Organizations

None yet

upvoted a paper 7 days ago

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Paper • 2601.10156 • Published 8 days ago • 24

upvoted a paper 10 days ago

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

Paper • 2601.06860 • Published 12 days ago • 16

upvoted a paper 11 days ago

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Paper • 2601.05808 • Published 14 days ago • 36

upvoted a paper 14 days ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 18 days ago • 100

upvoted 2 papers about 2 months ago

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

Paper • 2512.03746 • Published Dec 3, 2025 • 17

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 151

liked a dataset about 2 months ago

We-Math/VTBench

Viewer • Updated Nov 26, 2025 • 500 • 59 • 7

upvoted 2 papers about 2 months ago

MedSAM3: Delving into Segment Anything with Medical Concepts

Paper • 2511.19046 • Published Nov 24, 2025 • 51

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 93

upvoted a paper 2 months ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 44

upvoted a paper 3 months ago

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published Nov 6, 2025 • 97

liked 2 datasets 3 months ago

We-Math/V-Perception-40K

Viewer • Updated Nov 7, 2025 • 36.7k • 60 • 7

We-Math/V-Interaction-400K

Viewer • Updated Nov 7, 2025 • 253k • 448 • 14

liked a model 3 months ago

We-Math/V-Thinker

8B • Updated Nov 6, 2025 • 75 • 9

liked a dataset 3 months ago

dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17, 2025 • 1.07k • 39 • 6

upvoted 5 papers 3 months ago

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29, 2025 • 66

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 78

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 72

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 100

Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation

Paper • 2510.17354 • Published Oct 20, 2025 • 35

nanatata

AI & ML interests

Recent Activity

Organizations

nanatata's activity