Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file modified _cite/.cache/cache.db
Binary file not shown.
29 changes: 25 additions & 4 deletions _data/citations.yaml
Original file line number Diff line number Diff line change
@@ -1,5 +1,26 @@
# DO NOT EDIT, GENERATED AUTOMATICALLY

- id: arxiv:2601.00397
title: 'Revati: Transparent GPU-Free Time-Warp Emulation for LLM Serving'
authors:
- Amey Agrawal
- Mayank Yadav
- Sukrit Kumar
- Anirudha Agrawal
- Garv Ghai
- Souradeep Bera
- Elton Pinto
- Sirish Gambhira
- Mohammad Adain
- Kasra Sohrab
- Chus Antonanzas
- Alexey Tumanov
publisher: arXiv
date: '2026-01-05'
link: https://arxiv.org/abs/2601.00397
image: images/publication_thumbnails/revati.png
plugin: sources.py
file: sources.yaml
- id: arxiv:2507.09019
title: On Evaluating Performance of LLM Inference Serving Systems
authors:
Expand Down Expand Up @@ -27,15 +48,15 @@
- Po-An Tsai
- Zhiding Yu
- Alexey Tumanov
publisher: arXiv
publisher: 21st European Conference on Computer Systems, 2026, Edinburgh
date: '2025-08-14'
link: https://arxiv.org/abs/2502.14051
image: images/publication_thumbnails/rocketkv.png
plugin: sources.py
file: sources.yaml
- id: arxiv:2409.17264
title: 'Medha: Efficiently Serving Multi-Million Context Length LLM Inference Requests
Without Approximations'
title: 'No Request Left Behind: Tackling Heterogeneity in Long-Context LLM Inference
with Medha'
authors:
- Amey Agrawal
- Haoran Qiu
Expand All @@ -47,7 +68,7 @@
- Alexey Tumanov
- Esha Choukse
publisher: arXiv
date: '2025-06-23'
date: '2025-11-27'
link: https://arxiv.org/abs/2409.17264
image: images/publication_thumbnails/medha.png
plugin: sources.py
Expand Down
3 changes: 3 additions & 0 deletions _data/sources.yaml
Original file line number Diff line number Diff line change
@@ -1,7 +1,10 @@
- id: arxiv:2601.00397 # revati
image: images/publication_thumbnails/revati.png
- id: arxiv:2507.09019 # on evaluating llm inf..
image: images/publication_thumbnails/eval-checklist.png
- id: arxiv:2502.14051 # maya
image: images/publication_thumbnails/maya.png
publisher: "21st European Conference on Computer Systems, 2026, Edinburgh"
- id: arxiv:2502.14051 # rocketkv
image: images/publication_thumbnails/rocketkv.png
- id: arxiv:2409.17264 # medha
Expand Down
7 changes: 0 additions & 7 deletions _members/angelina-zhou

This file was deleted.

8 changes: 0 additions & 8 deletions _members/chathurvedhi-talapaneni.md

This file was deleted.

9 changes: 0 additions & 9 deletions _members/faisal-baig.md

This file was deleted.

10 changes: 0 additions & 10 deletions _members/nandan-parikh.md

This file was deleted.

9 changes: 0 additions & 9 deletions _members/uday-goyat.md

This file was deleted.

Binary file added images/publication_thumbnails/revati.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions index.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@ The System for AI Lab (SAIL) at Georgia Tech, led by Prof. Alexey Tumanov, speci

# Recent News

- Our paper on LLM training performance modeling via GPU emulation, [Maya](https://arxiv.org/abs/2503.20191) has been accepted at EuroSys'26.
- 🎉 Congratulations to Payman Behnam, Amey Agrawal, Alind Khare, and Dhruv Garg! Three papers accepted at ACM SIGOPS Operating Systems Review, July 2025.
- We are looking for contributors for our new inference engine [Vajra](https://project-vajra.github.io/). ⚡️
- Our papers on common anti-patterns in LLm Inference systems evaluations is now on [Arxiv](https://arxiv.org/pdf/2507.09019).
Expand Down