Paper ³¹

2025

[Paper Note] Achieving Low-Latency Graph-Based Vector Search via Aligning Best-First Search Algorithm With SSD 11-06

[Paper Note] Supporting Our AI Overlords Redesigning Data Systems to Be Agent-First 10-30

[Paper Note] Hyperledger Fabric a Distributed Operating System for Permissioned Blockchains 10-25

[Paper Note] Fast State Restoration in LLM Serving With HCache 10-20

[Paper Note] Strata Hierarchical Context Caching for Long Context Language Model Serving 09-13

[Paper Note] Sparse Indexing Large Scale, Inline Deduplication Using Sampling and Locality 09-11

[Paper Note] H2O Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models 09-11

[Paper Note] IMPRESS an Importance-Informed Multi-Tier Prefix KV Storage System for Large Language Model Inference 09-06

[Paper Note] Orca a Distributed Serving System for Transformer-Based Generative Models 07-02

[Paper Note] Attentionstore Cost-Effective Attention Reuse Across Multi-Turn Conversations in Large Language Model Serving 07-02

[Paper Note] CacheGen KV Cache Compression and Streaming for Fast Large Language Model Serving 06-09

[Paper Note] SGLang Efficient Execution of Structured Language Model Programs 06-07

2024

[Paper Note] ALPS an Adaptive Learning, Priority OS Scheduler for Serverless Functions 09-03

[Paper Note] Demystifying and Checking Silent Semantic Violations in Large Distributed Systems 08-19

[Paper Note] Understanding the Performance Implications of the Design Principles in Storage-Disaggregated Databases 08-15

[Paper Note] Understanding, Detecting and Localizing Partial Failures in Large System Software 08-05

[Paper Note] Efficient Exposure of Partial Failure Bugs in Distributed Systems With Inferred Abstract States 07-31

[Paper Note] Finding a Needle in Haystack Facebook's Photo Storage 07-10

[Paper Note] Acto Automatic End-to-End Testing for Operation Correctness of Cloud System Management 07-03

[Paper Note] Wisckey Separating Keys From Values in Ssd-Conscious Storage 03-12

1
2