Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
13
46
58
Tong Zhu
Spico
Follow
yzhangcs's profile picture
JusenX's profile picture
21world's profile picture
26 followers
·
74 following
https://Spico197.github.io
TongZhu197
Spico197
AI & ML interests
Information Extraction, Mixture-of-Experts, LLM
Recent Activity
commented
on
a paper
1 day ago
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale
upvoted
an
article
about 1 month ago
Your MoE Model Does Not Have to Select Fixed Number of Experts
published
an
article
about 1 month ago
Your MoE Model Does Not Have to Select Fixed Number of Experts
View all activity
Organizations
Spico
's models
7
Sort:Â Recently updated
Spico/LLaMA-MoE-v1-2_8-UniformSFT
Text Generation
•
7B
•
Updated
Feb 28, 2024
•
3
Spico/LLaMA-MoE-v1-2_8-DynamicSFT
Text Generation
•
7B
•
Updated
Feb 28, 2024
•
3
Spico/sheared-llama-2.7b-deita-6k-sft
Text Generation
•
3B
•
Updated
Feb 25, 2024
•
2
•
1
Spico/internlm2-7b-hf-llama
Text Generation
•
Updated
Feb 23, 2024
•
3
Spico/mirror-chinese-mrcqa-alpha
Updated
Dec 4, 2023
Spico/Humback-Myx
Text Generation
•
Updated
Aug 19, 2023
•
6
•
3
Spico/Humback-M0
Text Generation
•
Updated
Aug 18, 2023
•
4
•
3