Submitted by HankYe 2 KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems Duke Center for Computational Evolutionary Intelligence (CEI) 5 2
1 FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models Duke Center for Computational Evolutionary Intelligence (CEI) 8