llama-moe (LLaMA-MoE)

huxy912

authored a paper 3 months ago

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18 • 53

tongjingqi

authored 2 papers 5 months ago

Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

Paper • 2505.13886 • Published May 20 • 6

Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning

Paper • 2405.06680 • Published May 5, 2024 • 1

Xiaoye08

authored a paper 5 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 89

Xiaoye08

authored a paper 6 months ago

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20 • 62

Xiaoye08

authored a paper 7 months ago

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published May 13 • 41

Xiaoye08

authored a paper 8 months ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published Mar 27 • 42

Xiaoye08

authored 2 papers 9 months ago

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Paper • 2503.12821 • Published Mar 17 • 9

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published Mar 7 • 8

huxy912

authored 2 papers 11 months ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 61

LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Paper • 2411.15708 • Published Nov 24, 2024

Xiaoye08

authored a paper 11 months ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 61

huxy912

updated 2 models about 1 year ago

llama-moe/LLaMA-MoE-v2-3_8B-residual-sft

8B • Updated Dec 3, 2024 • 10 • 2

llama-moe/LLaMA-MoE-v2-3_8B-2_8-sft

8B • Updated Dec 3, 2024 • 27 • 3

Spico

authored 5 papers about 1 year ago

NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models

Paper • 2410.11805 • Published Oct 15, 2024 • 14

ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM

Paper • 2408.12076 • Published Aug 22, 2024 • 12

Timo: Towards Better Temporal Reasoning for Language Models

Paper • 2406.14192 • Published Jun 20, 2024 • 1

Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark

Paper • 2405.08355 • Published May 14, 2024

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28, 2024 • 21

Xiaoye08

authored a paper about 1 year ago

Mirror: A Universal Framework for Various Information Extraction Tasks

Paper • 2311.05419 • Published Nov 9, 2023

AI & ML interests

Team members 6

llama-moe's activity