Category Archives: Papers

Papers published by the MaceSystems group.

Composable Reliability for Asynchronous Systems: Treating Failures as Slow Processes (USENIX ATC 2012)

Conventional distributed systems wisdom is to treat slow nodes the same as failed nodes, through the use of┬áleases and timeouts, handling merely slow nodes by effectively rebooting or restarting them. While this wisdom has led to simpler systems that can … Continue reading

Posted in Papers | Leave a comment

Structured Comparative Analysis of Systems Logs to Diagnose Performance Problems (NSDI 2012)

This paper describes our work on Distalyzer: a tool for automatically diagnosing performance problems in distributed systems. It was accepted for publication at NSDI 2012, and is work done by Karthik Nagaraj, Charles Killian and Jennifer Neville. Diagnosis and correction … Continue reading

Posted in Papers | Tagged , , , , | Leave a comment

Gatling: Automatic Attack Discovery in Large-Scale Distributed Systems (NDSS 2012)

Most distributed systems are designed to meet application-prescribed metrics that ensure availability and high-performance for practical usage. However compromised participants can manipulate protocol semantics through attacks that target the messages exchanged with honest nodes and degrade performance significantly. To date, … Continue reading

Posted in Papers | Leave a comment

Hierarchy-Aware Distributed Overlays in Data Centers using DC2 (COMSNETS 2012)

Today’s data center architectures are often built in the form of multi-rooted tree topologies with with less overall bandwidth at higher levels of the tree than at the bottom. In datacenter parlance, this is referred to as the over-subscription factor, … Continue reading

Posted in Papers | Tagged , , , | Leave a comment

InContext: Simple Parallelism for Distributed Applications (HPDC 2011)

The event-driven model is a popular framework for those who want to build a complex distributed application while keeping the simple event-driven system features where the lower details can be hidden. However, introducing parallelism in the event-driven system has not … Continue reading

Posted in Papers | Leave a comment

Finding Latent Performance Bugs in Systems Implementations (FSE 2010)

Performance is one of the later but important component in building efficient Distributed systems. Precise performance guarantees are essential for high throughput and time sensitive distributed applications, and hence developers have to chase performance. Most of the time, performance problems … Continue reading

Posted in Papers | 2 Comments