January 31, 2026
January 2026 was a milestone month for the SyFI Lab, with six papers published across MLSys and ICLR—spanning inference, training, scheduling, retrieval, and model architecture.October 03, 2025
We present LLMc, an open-source tool to compress natural language using LLMs as the world's most reference-packed dictionary.September 29, 2025
We present VoxServe, a high-throughput, low-latency serving system designed specifically for Speech Language Models.