SyFI Lab Systems for Future Intelligence

Blog

SyFI Team Wins CUDA Kernel Agent Contest at MLSys 2026

May 28, 2026

Team UW SyFI won three awards across two tracks at the FlashInfer AI Kernel Generation Contest at MLSys 2026 — every line of kernel code written by coding agents, not humans.
Let AI Agents Write Your Serving Stack with VibeServe

May 12, 2026

We present VibeServe, a multi-agent system that synthesizes a complete LLM serving runtime end-to-end, specialized to a user-specified model, hardware, and workload.
SyFI in January 2026: A Big Month for Systems-Driven AI Research

January 31, 2026

January 2026 was a milestone month for the SyFI Lab, with six papers published across MLSys and ICLR—spanning inference, training, scheduling, retrieval, and model architecture.
Meet LLMc: Beating All Compression with LLMs

October 03, 2025

We present LLMc, an open-source tool to compress natural language using LLMs as the world's most reference-packed dictionary.
Efficient Serving of SpeechLMs with VoxServe

September 29, 2025

We present VoxServe, a high-throughput, low-latency serving system designed specifically for Speech Language Models.