AI & ML interests
Multimodal Large Language Models, Unified SVG Tasks
Recent Activity
We are the InternSVG team from the Shanghai AI Laboratory, dedicated to empowering the InternVL series models with unified capabilities for SVG vector graphic understanding, editing, and generation.
Current Work:
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
The InternSVG Family ā a comprehensive suite that unifies data, benchmarks, and models for SVG understanding, editing, and generation. It consists of:
š§© SAgoge ā the largest and most diverse multimodal SVG dataset, covering icons, illustrations, chemistry diagrams, and dynamic animations;
š SArena ā a companion benchmark offering unified task definitions and standardized evaluation protocols across SVG domains;
š¤ InternSVG Models ā multimodal large language models trained for SVG understanding, editing, and generation.
Project Links
š Project Page: https://hmwang2002.github.io/release/internsvg/
š ArXiv Paper: https://arxiv.org/abs/2510.11341
š» GitHub Repository: https://github.com/hmwang2002/InternSVG
š SArena Benchmark: https://huggingface.co/datasets/InternSVG/SArena
š¦ SAgoge Dataset and InternSVG Model Weights ā coming soon