SyFI Lab Systems for Future Intelligence

The SyFI Lab at the University of Washington builds efficient and resilient infrastructure for the future of AI. As applications grow more complex, we bridge the gap between next-gen models and heterogeneous hardware through cross-stack innovation, delivering scalable, open-source systems validated by industrial partners.

Our research targets three key areas:

  • Efficient AI: Optimizing algorithms and systems to maximize performance for training and inference.
  • Flexible AI: Architecting systems that seamlessly adapt to diverse tasks, strategies, and model structures.
  • Resilient AI: Ensuring AI system reliability at scale while leveraging AI to improve infrastructure robustness.

Publications

VibeServe: Can AI Agents Build Bespoke LLM Serving Systems?

Keisuke Kamahori, Shihang Li, Simon Peter, Baris Kasikci — (2026)

VoxServe: Streaming-Centric Serving System for Speech Language Models

Keisuke Kamahori, Wei-Tzu Lee, Atindra Jha, Rohan Kadekodi, Stephanie Wang, Arvind Krishnamurthy, Baris Kasikci — (2026)

Reducing the GPU Memory Bottleneck with Lossless Compression for ML

Aditya K Kamath, Arvind Krishnamurthy, Marco Canini, Simon Peter — European Conference on Computer Systems (EuroSys) (2026)

PDF
Read more »

Blog Posts

Let AI Agents Write Your Serving Stack with VibeServe

May 12, 2026

We present VibeServe, a multi-agent system that synthesizes a complete LLM serving runtime end-to-end, specialized to a user-specified model, hardware, and workload.
SyFI in January 2026: A Big Month for Systems-Driven AI Research

January 31, 2026

January 2026 was a milestone month for the SyFI Lab, with six papers published across MLSys and ICLR—spanning inference, training, scheduling, retrieval, and model architecture.
Meet LLMc: Beating All Compression with LLMs

October 03, 2025

We present LLMc, an open-source tool to compress natural language using LLMs as the world's most reference-packed dictionary.
Read more »

Talks

The Wrong Contract at Every Layer: Redesigning OS and Hardware for AI Datacenters

May 15, 2026 Dimitrios Skarlatos — Carnegie Mellon University

Scaling Distributed Systems Designs

May 8, 2026 Jialin Li — National University of Singapore

Read more »