Song Qiang

namespace-sq

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

upvoted a paper 2 months ago

IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection

liked a dataset 2 months ago

yandex/yambda

View all activity

Organizations

None yet

upvoted 2 papers 2 months ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 68

IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection

Paper • 2506.00979 • Published Jun 1 • 13

liked a dataset 2 months ago

yandex/yambda

Viewer • Updated 19 days ago • 5.31B • 19.6k • 191

liked a model 2 months ago

hfl/chinese-roberta-wwm-ext

Fill-Mask • Updated Mar 1, 2022 • 81.4k • • 352

upvoted an article 6 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 876

liked a Space about 1 year ago

1.02k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a model over 1 year ago

CohereLabs/c4ai-command-r-plus

Text Generation • 104B • Updated Apr 16 • 3.21k • • 1.75k

upvoted a paper over 1 year ago

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Paper • 2403.04132 • Published Mar 7, 2024 • 41

liked 2 models over 1 year ago

BAAI/bge-m3

togethercomputer/m2-bert-80M-32k-retrieval

upvoted a paper over 1 year ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 64

upvoted 2 collections over 1 year ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 244

MoEs papers reading list

Collection

60 items • Updated Nov 4, 2024 • 141

liked a model over 1 year ago

microsoft/phi-2

Text Generation • 3B • Updated Apr 29, 2024 • 882k • 3.38k

liked 2 Spaces over 1 year ago

963

Model Memory Utility

🚀

Calculate memory usage for training models

2.41k

Whisper

📉

Transcribe audio or YouTube videos into text

liked 3 models almost 2 years ago

liked a dataset almost 2 years ago

BAAI/COIG-PC

Viewer • Updated Jun 14, 2024 • 540M • 527 • 270