Agents
updated
A Zero-Shot Language Agent for Computer Control with Structured
Reflection
Paper
• 2310.08740
• Published • 15
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Paper
• 2310.12823
• Published • 36
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring
Emergent Behaviors
Paper
• 2308.10848
• Published • 1
CLEX: Continuous Length Extrapolation for Large Language Models
Paper
• 2310.16450
• Published • 10
An Early Evaluation of GPT-4V(ision)
Paper
• 2310.16534
• Published • 22
Personas as a Way to Model Truthfulness in Language Models
Paper
• 2310.18168
• Published • 5
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation
Paper
• 2311.00272
• Published • 11
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
Paper
• 2311.02262
• Published • 14
Ultra-Long Sequence Distributed Transformer
Paper
• 2311.02382
• Published • 6
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning
Paper
• 2311.02303
• Published • 12
Prompt Engineering a Prompt Engineer
Paper
• 2311.05661
• Published • 22
Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized
Model Responses
Paper
• 2312.00763
• Published • 23
Merlin:Empowering Multimodal LLMs with Foresight Minds
Paper
• 2312.00589
• Published • 27
Instruction-tuning Aligns LLMs to the Human Brain
Paper
• 2312.00575
• Published • 15
DeepCache: Accelerating Diffusion Models for Free
Paper
• 2312.00858
• Published • 23
PathFinder: Guided Search over Multi-Step Reasoning Paths
Paper
• 2312.05180
• Published • 10
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
• 2312.10003
• Published • 44
Faithful Persona-based Conversational Dataset Generation with Large
Language Models
Paper
• 2312.10007
• Published • 11
Supervised Knowledge Makes Large Language Models Better In-context
Learners
Paper
• 2312.15918
• Published • 9
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention
and Distributed KVCache
Paper
• 2401.02669
• Published • 17
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper
• 2401.05033
• Published • 18
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model
Paper
• 2401.02330
• Published • 18
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual
Perception
Paper
• 2401.16158
• Published • 20
Weaver: Foundation Models for Creative Writing
Paper
• 2401.17268
• Published • 45
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper
• 2402.05140
• Published • 23
More Agents Is All You Need
Paper
• 2402.05120
• Published • 57
Premise Order Matters in Reasoning with Large Language Models
Paper
• 2402.08939
• Published • 28
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Paper
• 2402.09727
• Published • 38
Chain-of-Thought Reasoning Without Prompting
Paper
• 2402.10200
• Published • 109
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs
Miss
Paper
• 2402.10790
• Published • 42
Evaluating Very Long-Term Conversational Memory of LLM Agents
Paper
• 2402.17753
• Published • 19
GAIA: a benchmark for General AI Assistants
Paper
• 2311.12983
• Published • 245
Learning to Decode Collaboratively with Multiple Language Models
Paper
• 2403.03870
• Published • 21
SOTOPIA-π: Interactive Learning of Socially Intelligent Language
Agents
Paper
• 2403.08715
• Published • 21
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for
Large Language Models
Paper
• 2403.12881
• Published • 18
Evolutionary Optimization of Model Merging Recipes
Paper
• 2403.13187
• Published • 58
LLM Agent Operating System
Paper
• 2403.16971
• Published • 73
AgentLite: A Lightweight Library for Building and Advancing
Task-Oriented LLM Agent System
Paper
• 2402.15538
• Published • 6
LLMs Simulate Big Five Personality Traits: Further Evidence
Paper
• 2402.01765
• Published
LLM Agents in Interaction: Measuring Personality Consistency and
Linguistic Alignment in Interacting Populations of Large Language Models
Paper
• 2402.02896
• Published
Is Cognition and Action Consistent or Not: Investigating Large Language
Model's Personality
Paper
• 2402.14679
• Published
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large
Language Models
Paper
• 2403.02246
• Published • 1
LLM Multi-Agent Systems: Challenges and Open Problems
Paper
• 2402.03578
• Published • 1
AgentScope: A Flexible yet Robust Multi-Agent Platform
Paper
• 2402.14034
• Published • 13
Social Skill Training with Large Language Models
Paper
• 2404.04204
• Published • 15
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of
Diverse Models
Paper
• 2404.18796
• Published • 71
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
• 2405.01535
• Published • 124
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
• 2406.04692
• Published • 59
Octo-planner: On-device Language Model for Planner-Action Agents
Paper
• 2406.18082
• Published • 48
ROS-LLM: A ROS framework for embodied AI with task feedback and
structured reasoning
Paper
• 2406.19741
• Published • 60
LiteSearch: Efficacious Tree Search for LLM
Paper
• 2407.00320
• Published • 40
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
• 2407.01489
• Published • 65
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for
LLM Agents
Paper
• 2407.04363
• Published • 34
Stark: Social Long-Term Multi-Modal Conversation with Persona
Commonsense Knowledge
Paper
• 2407.03958
• Published • 21
LAMBDA: A Large Model Based Data Agent
Paper
• 2407.17535
• Published • 37
PERSONA: A Reproducible Testbed for Pluralistic Alignment
Paper
• 2407.17387
• Published • 20
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS
Paper
• 2408.01584
• Published • 10
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in
Long-Horizon Tasks
Paper
• 2408.03615
• Published • 31
Generating novel experimental hypotheses from language models: A case
study on cross-dative generalization
Paper
• 2408.05086
• Published • 5
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
• 2408.06292
• Published • 128
Benchmarking Agentic Workflow Generation
Paper
• 2410.07869
• Published • 29
marcelbinz/Llama-3.1-Centaur-70B
Text Generation
• 71B • Updated • 641
• 82
Agent Laboratory: Using LLM Agents as Research Assistants
Paper
• 2501.04227
• Published • 95