May 12, 2026
We present VibeServe, a multi-agent system that synthesizes a complete LLM serving runtime end-to-end, specialized to a user-specified model, hardware, and workload.January 31, 2026
January 2026 was a milestone month for the SyFI Lab, with six papers published across MLSys and ICLR—spanning inference, training, scheduling, retrieval, and model architecture.October 03, 2025
We present LLMc, an open-source tool to compress natural language using LLMs as the world's most reference-packed dictionary.September 29, 2025
We present VoxServe, a high-throughput, low-latency serving system designed specifically for Speech Language Models.