Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Byron Gibson's picture

18

Byron Gibson

bgibson

·

byrongibson
byrongibson

AI & ML interests

None yet

Organizations

None yet

bgibson 's collections 5

vectara/hallucination_evaluation_model

Text Classification • 0.1B • Updated 3 days ago • 10.5k • 326

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 73
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 41
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 56

allenai/OLMo-7B

Text Generation • 7B • Updated 14 days ago • 2.53k • 648
chatdb/natural-sql-7b

Text Generation • 7B • Updated Feb 4, 2024 • 15.5k • 132
databricks/dbrx-base

Text Generation • 132B • Updated Apr 19, 2024 • 560
databricks/dbrx-instruct

Text Generation • 132B • Updated Apr 19, 2024 • 9.7k • 1.12k

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260
Paused

97

97

Transformers to Core ML

⚡

Display a loading screen with a spinner
enterprise-explorers/Llama-2-7b-chat-coreml

Text Generation • Updated Jul 18, 2023 • 5.54k • 138
tiiuae/falcon-7b-instruct

Text Generation • 7B • Updated Oct 12, 2024 • 94.1k • 1.02k

nomic-ai/gpt4all-j-prompt-generations

Viewer • Updated Apr 24, 2023 • 809k • 201 • 222

vectara/hallucination_evaluation_model

Text Classification • 0.1B • Updated 3 days ago • 10.5k • 326

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260
Paused

97

97

Transformers to Core ML

⚡

Display a loading screen with a spinner
enterprise-explorers/Llama-2-7b-chat-coreml

Text Generation • Updated Jul 18, 2023 • 5.54k • 138
tiiuae/falcon-7b-instruct

Text Generation • 7B • Updated Oct 12, 2024 • 94.1k • 1.02k

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 73
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 41
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 56

nomic-ai/gpt4all-j-prompt-generations

Viewer • Updated Apr 24, 2023 • 809k • 201 • 222

allenai/OLMo-7B

Text Generation • 7B • Updated 14 days ago • 2.53k • 648
chatdb/natural-sql-7b

Text Generation • 7B • Updated Feb 4, 2024 • 15.5k • 132
databricks/dbrx-base

Text Generation • 132B • Updated Apr 19, 2024 • 560
databricks/dbrx-instruct

Text Generation • 132B • Updated Apr 19, 2024 • 9.7k • 1.12k

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs