Self-Hinting Language Models Enhance Reinforcement Learning Paper • 2602.03143 • Published 10 days ago • 27
Amharic Text Embedding Models Collection Text Embedding and ColBERT models based on Amharic RoBERTa and BERT for Amharic passage retrieval • 10 items • Updated Jun 11, 2025 • 6
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15, 2025 • 223
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Dec 10, 2025 • 21
Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 15
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 313
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ Jul 9, 2024 • 77
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities Paper • 2410.07722 • Published Oct 10, 2024 • 15