Stoney Kang's picture

681 27

Stoney Kang

sikang99

·

AI & ML interests

Remote Control based on Vision

Recent Activity

upvoted a paper about 22 hours ago

FASA: Frequency-aware Sparse Attention

upvoted a paper 2 days ago

HY3D-Bench: Generation of 3D Assets

upvoted a paper 2 days ago

MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering

View all activity

Organizations

upvoted a paper about 22 hours ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published 4 days ago • 111

upvoted 8 papers 2 days ago

HY3D-Bench: Generation of 3D Assets

Paper • 2602.03907 • Published 4 days ago • 22

MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering

Paper • 2601.22859 • Published 8 days ago • 16

VLS: Steering Pretrained Robot Policies via Vision-Language Models

Paper • 2602.03973 • Published 3 days ago • 20

SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation

Paper • 2602.02402 • Published 4 days ago • 31

EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models

Paper • 2602.04515 • Published 3 days ago • 33

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published 2 days ago • 228

MARS: Modular Agent with Reflective Search for Automated AI Research

Paper • 2602.02660 • Published 4 days ago • 56

YOLOE-26: Integrating YOLO26 with YOLOE for Real-Time Open-Vocabulary Instance Segmentation

Paper • 2602.00168 • Published 8 days ago • 1

upvoted 2 papers 3 days ago

Generative Visual Code Mobile World Models

Paper • 2602.01576 • Published 5 days ago • 38

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published 6 days ago • 217

upvoted a collection 4 days ago

Open-AgentRL

RLAnything & DemyAgent: Open-Source RL for LLMs and Agentic Scenarios • 12 items • Updated 4 days ago • 5

upvoted 3 papers 4 days ago

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published 4 days ago • 30

Causal World Modeling for Robot Control

Paper • 2601.21998 • Published 8 days ago • 28

PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction

Paper • 2601.22046 • Published 8 days ago • 21

upvoted 4 papers 5 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 8 days ago • 81

DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation

Paper • 2601.22904 • Published 8 days ago • 13

DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

Paper • 2601.20218 • Published 10 days ago • 15

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published 7 days ago • 129

upvoted a collection 6 days ago

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 18 items • Updated 2 days ago • 30