Together

company

Verified

https://together.ai

togethercompute

togethercomputer

Inference Provider

2,678,734 monthly requests

AI & ML interests

Foundation Models, Decentralized Computing, Open Source AI.

Recent Activity

KaiserWhoLearns authored a paper 21 days ago

What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models

KaiserWhoLearns authored a paper about 1 month ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

KaiserWhoLearns submitted a paper about 1 month ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

View all activity

Papers

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

View all Papers

Articles

Fine-tune Any LLM from the Hugging Face Hub with Together AI

KaiserWhoLearns

authored a paper 21 days ago

What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models

Paper • 2506.06485 • Published Jun 6, 2025 • 5

KaiserWhoLearns

authored a paper about 1 month ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Paper • 2604.08510 • Published Apr 9 • 4

KaiserWhoLearns

submitted a paper to Daily Papers about 1 month ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Paper • 2604.08510 • Published Apr 9 • 4

submitted a paper to Daily Papers about 1 month ago

Introspective Diffusion Language Models

Paper • 2604.11035 • Published Apr 13 • 24

KaiserWhoLearns

authored a paper 2 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

KaiserWhoLearns

submitted a paper to Daily Papers 2 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

submitted a paper to Daily Papers 3 months ago

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

Paper • 2602.21196 • Published Feb 24 • 7

KaiserWhoLearns

authored a paper 3 months ago

FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights

Paper • 2602.02905 • Published Feb 2 • 5

posted an update 10 months ago

Post

373

🚀 Full-Quality Wan2.2 Video Generation on a single 24GB GPU — Powered by DFloat11

We just released the DFloat11 compressed Wan2.2 models. Now you can run full-quality Wan2.2 video generation on a single 24GB GPU, thanks to DFloat11 compression and CPU offloading.

🔗 Image-to-Video: DFloat11/Wan2.2-I2V-A14B-DF11
🔗 Text-to-Video: DFloat11/Wan2.2-T2V-A14B-DF11

authored a paper about 1 year ago

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Paper • 2505.07782 • Published May 12, 2025 • 19

authored a paper about 1 year ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published Apr 15, 2025 • 31

authored 2 papers over 1 year ago

Language Models Prefer What They Know: Relative Confidence Estimation via Confidence Preferences

Paper • 2502.01126 • Published Feb 3, 2025 • 4

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 126

authored a paper over 1 year ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 58

authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 58

authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 58

authored 3 papers over 1 year ago

Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

Paper • 1911.02557 • Published Nov 6, 2019

A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning

Paper • 2204.10815 • Published Apr 22, 2022

Self-Aware Feedback-Based Self-Learning in Large-Scale Conversational AI

Paper • 2205.00029 • Published Apr 29, 2022