DNA Bench: When Silence is Smarter -- Benchmarking Over-Reasoning in Reasoning LLMs Paper • 2503.15793 • Published Mar 20
DNA Bench: When Silence is Smarter -- Benchmarking Over-Reasoning in Reasoning LLMs Paper • 2503.15793 • Published Mar 20
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39
SynthCypher: A Fully Synthetic Data Generation Framework for Text-to-Cypher Querying in Knowledge Graphs Paper • 2412.12612 • Published Dec 17, 2024 • 4
SynthCypher: A Fully Synthetic Data Generation Framework for Text-to-Cypher Querying in Knowledge Graphs Paper • 2412.12612 • Published Dec 17, 2024 • 4
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 14
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 14
Azimuth: Systematic Error Analysis for Text Classification Paper • 2212.08216 • Published Dec 16, 2022
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content Paper • 2406.11811 • Published Jun 17, 2024 • 16
Exploring Zero and Few-shot Techniques for Intent Classification Paper • 2305.07157 • Published May 11, 2023 • 1
Towards Modular LLMs by Building and Reusing a Library of LoRAs Paper • 2405.11157 • Published May 18, 2024 • 31
Reducing hallucination in structured outputs via Retrieval-Augmented Generation Paper • 2404.08189 • Published Apr 12, 2024 • 1
XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference Paper • 2404.15420 • Published Apr 23, 2024 • 11
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders Paper • 2404.05961 • Published Apr 9, 2024 • 66
SemEval 2023 Task 6: LegalEval - Understanding Legal Texts Paper • 2304.09548 • Published Apr 19, 2023