Sigrid Jin's picture

Sigrid Jin

sigridjineth

·

https://sigridjin.medium.com

AI & ML interests

Newbie

Recent Activity

liked a model about 3 hours ago

Omartificial-Intelligence-Space/Semantic-Ar-Qwen-Embed-0.6B

liked a model about 3 hours ago

tomaarsen/Qwen3-Embedding-0.6B-18-layers

liked a model about 8 hours ago

Qwen/Qwen3-Embedding-0.6B

View all activity

Organizations

sigridjineth's activity

upvoted an article 4 days ago

Article

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

By

and 1 other •

6 days ago

• 23

upvoted a collection 5 days ago

NanoBEIR 🍺

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 17

upvoted a collection 7 days ago

VLM2Vec

The VLM2Vec embedding models. • 9 items • Updated about 1 month ago • 6

upvoted 3 collections about 1 month ago

VoRA

Everything for the paper "Vision as LoRA". • 10 items • Updated Apr 20 • 6

💜 Kotlin ML Pack

A collection of datasets, fine-tuned models and benchmarks to train your models for perfect Kotlin code generation. • 9 items • Updated Jun 11, 2024 • 23

Mellum

Series of code models by JetBrains • 5 items • Updated 17 days ago • 25

upvoted a paper about 1 month ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55

upvoted 2 papers 3 months ago

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published Mar 10 • 39

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Paper • 2503.00865 • Published Mar 2 • 65

upvoted a collection 3 months ago

GemmaX2

GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated Feb 7 • 22

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 401

upvoted an article 3 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 148

upvoted 2 papers 6 months ago

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Paper • 2402.07440 • Published Feb 12, 2024 • 1

DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Paper • 2406.00856 • Published Jun 2, 2024 • 12

upvoted a collection 6 months ago

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 11 items • Updated 1 day ago • 18

upvoted a paper 6 months ago

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Paper • 2405.20204 • Published May 30, 2024 • 37

upvoted 4 collections 6 months ago

jina-clip

Multimodal text-image embeddings • 4 items • Updated Dec 14, 2024 • 11

MMTEB

Our contribution to the Massive Multilingual Text Embedding Benchmark (MMTEB). Retrieval and reranking benchmarks in 16 languages. • 4 items • Updated Jun 6, 2024 • 3

Arctic-embed

A collection of text embedding models optimized for retrieval accuracy and efficiency • 8 items • Updated Dec 5, 2024 • 23

ColPali Models

Pre-trained checkpoints for the ColPali model. • 8 items • Updated Jan 23 • 5