SyFI Lab Systems for Future Intelligence

Blog

Let AI Agents Write Your Serving Stack with VibeServe

May 12, 2026

We present VibeServe, a multi-agent system that synthesizes a complete LLM serving runtime end-to-end, specialized to a user-specified model, hardware, and workload.
SyFI in January 2026: A Big Month for Systems-Driven AI Research

January 31, 2026

January 2026 was a milestone month for the SyFI Lab, with six papers published across MLSys and ICLR—spanning inference, training, scheduling, retrieval, and model architecture.
Meet LLMc: Beating All Compression with LLMs

October 03, 2025

We present LLMc, an open-source tool to compress natural language using LLMs as the world's most reference-packed dictionary.
Efficient Serving of SpeechLMs with VoxServe

September 29, 2025

We present VoxServe, a high-throughput, low-latency serving system designed specifically for Speech Language Models.