Siyin Wang (SII)'s picture

Siyin Wang (SII)

sinwang

·

https://sinwang20.github.io/

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

OpenMOSS-Team/MOSS-VL-Base-0408

published a dataset 19 days ago

sinwang/omniaction-libero-spatial

updated a dataset 19 days ago

sinwang/omniaction-libero-spatial

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 423

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published Mar 5 • 56

upvoted 5 papers 3 months ago

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published Jan 21 • 75

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Paper • 2601.13836 • Published Jan 20 • 37

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published Jan 16 • 67

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published Jan 4 • 59

upvoted 2 papers 4 months ago

Multi-hop Reasoning via Early Knowledge Alignment

Paper • 2512.20144 • Published Dec 23, 2025 • 7

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published Dec 8, 2025 • 60

upvoted 3 papers 5 months ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published Nov 19, 2025 • 98

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published Nov 19, 2025 • 25

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 242

upvoted a paper 6 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 98

upvoted a collection 6 months ago

RoboOmni

Proactive Robot Manipulation in Omni-modal Context • 9 items • Updated 23 days ago • 13

upvoted 4 papers 6 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 21

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 62

Sparser Block-Sparse Attention via Token Permutation

Paper • 2510.21270 • Published Oct 24, 2025 • 25

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15, 2025 • 47

upvoted a paper 8 months ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Paper • 2508.11987 • Published Aug 16, 2025 • 73

upvoted a paper 10 months ago

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache

Paper • 2506.11886 • Published Jun 13, 2025 • 20