Tobias Völzing
wumingshi
·
AI & ML interests
None yet
Recent Activity
updated
a collection
24 days ago
Fundamental
updated
a collection
24 days ago
Agents
updated
a collection
24 days ago
Fine-Tuning
Organizations
None yet
LLM
-
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery
Paper • 2310.18356 • Published • 24 -
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper • 2401.01325 • Published • 27 -
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 48
Training
3D
Small
Fundamental
-
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 43 -
Point Transformer V3: Simpler, Faster, Stronger
Paper • 2312.10035 • Published • 21 -
Extending Context Window of Large Language Models via Semantic Compression
Paper • 2312.09571 • Published • 16 -
PanGu-π: Enhancing Language Model Architectures via Nonlinearity Compensation
Paper • 2312.17276 • Published • 16
RAG
FLLM
Code Generation
-
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Paper • 2310.18628 • Published • 8 -
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 73 -
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
Paper • 2401.00788 • Published • 23 -
mistralai/Codestral-22B-v0.1
22B • Updated • 15.5k • 1.31k
Fine-Tuning
-
PockEngine: Sparse and Efficient Fine-tuning in a Pocket
Paper • 2310.17752 • Published • 14 -
Instruction-tuning Aligns LLMs to the Human Brain
Paper • 2312.00575 • Published • 14 -
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper • 2401.01325 • Published • 27 -
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Paper • 2401.06080 • Published • 28
REL
-
Controlled Decoding from Language Models
Paper • 2310.17022 • Published • 15 -
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Paper • 2310.20587 • Published • 18 -
Inpainting-Guided Policy Optimization for Diffusion Large Language Models
Paper • 2509.10396 • Published • 15
Reverse Engineering
Hallucination
Reasoning
Agents
FLLM
LLM
-
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery
Paper • 2310.18356 • Published • 24 -
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper • 2401.01325 • Published • 27 -
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 48
Code Generation
-
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Paper • 2310.18628 • Published • 8 -
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 73 -
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
Paper • 2401.00788 • Published • 23 -
mistralai/Codestral-22B-v0.1
22B • Updated • 15.5k • 1.31k
Training
Fine-Tuning
-
PockEngine: Sparse and Efficient Fine-tuning in a Pocket
Paper • 2310.17752 • Published • 14 -
Instruction-tuning Aligns LLMs to the Human Brain
Paper • 2312.00575 • Published • 14 -
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper • 2401.01325 • Published • 27 -
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Paper • 2401.06080 • Published • 28
3D
REL
-
Controlled Decoding from Language Models
Paper • 2310.17022 • Published • 15 -
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Paper • 2310.20587 • Published • 18 -
Inpainting-Guided Policy Optimization for Diffusion Large Language Models
Paper • 2509.10396 • Published • 15
Small
Reverse Engineering
Fundamental
-
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 43 -
Point Transformer V3: Simpler, Faster, Stronger
Paper • 2312.10035 • Published • 21 -
Extending Context Window of Large Language Models via Semantic Compression
Paper • 2312.09571 • Published • 16 -
PanGu-π: Enhancing Language Model Architectures via Nonlinearity Compensation
Paper • 2312.17276 • Published • 16
Hallucination
RAG
Reasoning