About
I am an Assistant Professor in the Department of Computer Science at the University of Illinois at Urbana-Champaign. My research interests are in distributed systems, storage, and operating systems. I co-direct the Distributed And Storage Systems Laboratory (DASSL) at UIUC.
Research signals
Five dimensions sourced from public faculty / publication signals. Sign in to compare against your own profile and see your match score.
Research topics
- Computer Science
- Artificial Intelligence
- Programming language
- Algorithm
- Parallel computing
- Mathematics
- Theoretical computer science
Selected publications
AgileLog: A Forkable Shared Log for Agents on Data Streams
arXiv (Cornell University) · 2026-04-16
preprintOpen accessSenior authorIn modern data-streaming systems, alongside traditional programs, a new type of entity has emerged that can interact with streaming data: AI agents. Unlike traditional programs, AI agents use LLM reasoning to accomplish high-level tasks specified in natural language over streaming data. Unfortunately, current streaming systems cannot fully support agents: they lack the fundamental mechanisms to avoid the performance interference caused by agentic tasks and to safely handle agentic writes. We argue that the shared log, the core abstraction underlying streaming data, must support creating forks of itself, and that such a forkable shared log serves as a great substrate for agents acting on streaming data. We propose AgileLog, a new shared log abstraction that provides novel forking primitives for agentic use cases. We design Bolt, an implementation of the AgileLog abstraction, that uses novel techniques to make forks cheap, and provide logical and performance isolation.
A Logically Disaggregated Cache for Replicated Storage Systems - Artifact
Zenodo (CERN European Organization for Nuclear Research) · 2026-03-04
otherOpen accessThis repository contains the artifact for the paper "A Logically Disaggregated Cache for Replicated Storage Systems". The artifact includes the implementation of LDC in TWIG-KV, an eventually-consistent key-value store with primary-backup replication, along with scripts to run the system.
A Logically Disaggregated Cache for Replicated Storage Systems - Artifact
Zenodo (CERN European Organization for Nuclear Research) · 2026-03-04
otherOpen accessThis repository contains the artifact for the paper "A Logically Disaggregated Cache for Replicated Storage Systems". The artifact includes the implementation of LDC in TWIG-KV, an eventually-consistent key-value store with primary-backup replication, along with scripts to run the system.
AgileLog: A Forkable Shared Log for Agents on Data Streams
arXiv (Cornell University) · 2026-04-16
articleOpen accessSenior authorIn modern data-streaming systems, alongside traditional programs, a new type of entity has emerged that can interact with streaming data: AI agents. Unlike traditional programs, AI agents use LLM reasoning to accomplish high-level tasks specified in natural language over streaming data. Unfortunately, current streaming systems cannot fully support agents: they lack the fundamental mechanisms to avoid the performance interference caused by agentic tasks and to safely handle agentic writes. We argue that the shared log, the core abstraction underlying streaming data, must support creating forks of itself, and that such a forkable shared log serves as a great substrate for agents acting on streaming data. We propose AgileLog, a new shared log abstraction that provides novel forking primitives for agentic use cases. We design Bolt, an implementation of the AgileLog abstraction, that uses novel techniques to make forks cheap, and provide logical and performance isolation.
A Logically Disaggregated Cache for Replicated Storage Systems
2026-04-24
articleOpen accessSenior authorWe study if replicated storage systems effectively utilize the caches embedded within each replica. Our study reveals that existing systems manage the embedded caches in each replica in silos, leading to significant cache redundancy across replicas and consequently low performance. To address this problem, we introduce logically disaggregated cache (Ldc), a new approach to managing caches in replicated storage systems. Ldc disaggregates the embedded caches from the replicas to form a single, logical cache. Ldc then allows any replica to access any part of the logical cache, which reduces redundancy caused by reads. Because writes pollute all caches, Ldc quickly demotes written objects to limit redundancy caused by writes. Ldc, however, realizes that reducing redundancy may hurt performance in some cases and thus employs an online analyzer to strike a balance between cache redundancy and coverage. We implement Ldc in three systems: an eventually-consistent KV store, a strongly-consistent KV store, and a production database. Using microbenchmarks, macrobenchmarks, and real-world traces, we show that the Ldc versions perform significantly better than the original systems (e.g., 2.6× to 5.4× higher throughput in the eventually-consistent KV store under YCSB).
LazyLog: A New Shared Log Abstraction and Design for Modern Low-Latency Applications
ACM Transactions on Computer Systems · 2025-08-19
articleOpen accessSenior authorShared logs offer linearizable total order across storage shards. However, they enforce this order eagerly upon ingestion, leading to high latencies. We observe that in many modern shared-log applications, while linearizable ordering is necessary, it is not required eagerly when ingesting data but only later when data is consumed. Further, readers are naturally decoupled in time from writers in these applications. Based on this insight, we propose LazyLog, a novel shared log abstraction. LazyLog lazily binds records (across shards) to linearizable global positions and enforces this before a log position can be read. Such lazy ordering enables low ingestion latencies. Given the time decoupling, LazyLog can establish the order well before reads arrive, minimizing overhead upon reads. We build two LazyLog systems that provide linearizable total order across shards. Our experiments show that LazyLog systems deliver significantly lower latencies than conventional, eager-ordering shared logs.
LazyLog: A New Shared Log Abstraction for Low-Latency Applications
2024-11-04 · 6 citations
articleOpen accessSenior authorShared logs offer linearizable total order across storage shards. However, they enforce this order eagerly upon ingestion, leading to high latencies. We observe that in many modern shared-log applications, while linearizable ordering is necessary, it is not required eagerly when ingesting data but only later when data is consumed. Further, readers are naturally decoupled in time from writers in these applications. Based on this insight, we propose LazyLog, a novel shared log abstraction. LazyLog lazily binds records (across shards) to linearizable global positions and enforces this before a log position can be read. Such lazy ordering enables low ingestion latencies. Given the time decoupling, LazyLog can establish the order well before reads arrive, minimizing overhead upon reads. We build two LazyLog systems that provide linearizable total order across shards. Our experiments show that LazyLog systems deliver significantly lower latencies than conventional, eager-ordering shared logs.
SplitFT: Fault Tolerance for Disaggregated Datacenters via Remote Memory Logging
2024-04-18 · 2 citations
articleSenior authorWe introduce SplitFt, a new fault-tolerance approach for storage-centric applications in disaggregated data centers. SplitFt uses a novel split architecture, where large writes are directly performed on the underlying disaggregated storage system, while small writes are made fault-tolerant within the compute layer. The split architecture enables applications to achieve strong durability guarantees without compromising performance. SplitFt makes small writes fault-tolerant using a new abstraction called near-compute logs or Ncl, which leverages underutilized memory on remote nodes to log small writes in a fast, cheap, and transparent manner. We port three POSIX applications (RocksDB, Redis, and SQLite) to SplitFt and show that they offer strong guarantees compared to weak versions of the applications that can lose data; SplitFt applications do so while approximating weak versions' performance (only 0.1%-10% overhead under YCSB). Compared to strong versions, SplitFt improves performance significantly (2.5× to 27× under write-heavy workloads).
Exploiting Nil-external Interfaces for Fast Replicated Storage
ACM Transactions on Storage · 2022-06-06 · 3 citations
article1st authorCorrespondingDo some storage interfaces enable higher performance than others? Can one identify and exploit such interfaces to realize high performance in storage systems? This article answers these questions in the affirmative by identifying nil-externality , a property of storage interfaces. A nil-externalizing (nilext) interface may modify state within a storage system but does not externalize its effects or system state immediately to the outside world. As a result, a storage system can apply nilext operations lazily, improving performance. In this article, we take advantage of nilext interfaces to build high-performance replicated storage. We implement Skyros , a nilext-aware replication protocol that offers high performance by deferring ordering and executing operations until their effects are externalized. We show that exploiting nil-externality offers significant benefit: For many workloads, Skyros provides higher performance than standard consensus-based replication. For example, Skyros offers 3× lower latency while providing the same high throughput offered by throughput-optimized Paxos.
Zenodo (CERN European Organization for Nuclear Research) · 2021-09-21
datasetOpen access1st authorCorrespondingTraces for reproducing the results in our SOSP '21 paper: Exploiting Nil-Externality for Fast Replicated Storage.
Frequent coauthors
- 20 shared
Ramnatthan Alagappan
University of Illinois Urbana-Champaign
- 17 shared
Andrea C. Arpaci-Dusseau
University of Wisconsin–Madison
- 17 shared
Remzi H. Arpaci-Dusseau
- 4 shared
Sahana Srinivasan
Singapore National Eye Center
- 4 shared
T Mamatha
- 4 shared
S Abhinav
R.V. College of Engineering
- 4 shared
M R Anala
- 3 shared
Lili Qiu
Labs
UIUC Distributed and Storage Systems LabPI
Focuses on distributed and storage systems research
Education
Ph.D., Computer Sciences
University of Wisconsin – Madison
M.S.
Indian Institute of Technology – Bombay
B.S.
Indian Institute of Technology – Bombay
Awards & honors
- NSF CAREER Award 2024
- SOSP' 24 Best Paper Award
- FAST '20 Best Paper Award
- Facebook Ph.D., Fellowship, 2019
- FAST '18 Best Paper Award
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Aishwarya Ganesan
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup