replication

  • Reading Group. Log-structured Protocols in Delos

    ·

    Placeholder Icon

    For the 87th DistSys paper, we looked at “Log-structured Protocols in Delos” by Mahesh Balakrishnan, Chen Shen, Ahmed Jafri, Suyog Mapara, David Geraghty, Jason Flinn Vidhya Venkat, Ivailo Nedelchev, Santosh Ghosh, Mihir Dharamshi, Jingming Liu, Filip Gruszczynski, Jun Li Rounak Tibrewal, Ali Zaveri, Rajeev Nagar, Ahmed Yossef, Francois Richard, Yee Jiun Song. The paper appeared…

    Read More

  • Reading Group. NrOS: Effective Replication and Sharing in an Operating System

    ·

    Placeholder Icon

    The 77thth paper discussion in our reading group was “NrOS: Effective Replication and Sharing in an Operating System” from OSDI’21. While not a distributed systems paper, it borrows high-level distributed systems ideas (namely, state machine replication) to create a new NUMA-optimized sequential kernel. See, all modern machines have many CPU cores. OS kernels must be…

    Read More

  • Reading Group. Avocado: A Secure In-Memory Distributed Storage System

    ·

    Placeholder Icon

    Our 76th reading group meeting covered “Avocado: A Secure In-Memory Distributed Storage System” ATC’21 paper. Unfortunately, the original presenter of the paper could not make it to the discussion, and I had to improvise the presentation on the fly: So, the Avocado paper builds a distributed in-memory key-value database with a traditional complement of operations:…

    Read More

  • Reading Group. Viewstamped Replication Revisited

    ·

    Placeholder Icon

    Our 74th paper was a foundational one — we looked at Viestamped Replication protocol through the lens of the “Viewstamped Replication Revisited” paper. Joran Dirk Greef presented the protocol along with bits of his engineering experience using the protocol in practice. Viestamped Replication (VR) solves the problem of state machine replication in a crash fault…

    Read More

  • Reading Group. Meerkat: Multicore-Scalable Replicated Transactions Following the Zero-Coordination Principle

    ·

    Placeholder Icon

    Our 72nd paper was on avoiding coordination as much as possible. We looked at the “Meerkat: Multicore-Scalable Replicated Transactions Following the Zero-Coordination Principle” EuroSys’20 paper by Adriana Szekeres, Michael Whittaker, Jialin Li, Naveen Kr. Sharma, Arvind Krishnamurthy, Dan R. K. Ports, Irene Zhang. As the name suggests, this paper discusses coordination-free distributed transaction execution. In…

    Read More

  • Reading Group. Fault-Tolerant Replication with Pull-Based Consensus in MongoDB

    ·

    Placeholder Icon

    In the last reading group meeting, we discussed MongoDB‘s replication protocol, as described in the “Fault-Tolerant Replication with Pull-Based Consensus in MongoDB” NSDI’21 paper. Our reading group has a few regular members from MongoDB, and this time around, Siyuan Zhou, one of the paper authors, attended the discussion, so we had a perfect opportunity to…

    Read More

  • Scalable but Wasteful or Why Fast Replication Protocols are Actually Slow

    ·

    In the last decade or so, quite a few new state machine replication protocols emerged in the literature and the internet. I am “guilty” of this myself, with the PigPaxos appearing in this year’s SIGMOD and the PQR paper at HotStorage’19. There are better-known examples as well — EPaxos inspired a lot of development in…

    Read More

  • Reading Group. Strong and Efficient Consistency with Consistency-Aware Durability

    ·

    Placeholder Icon

    In the 62nd reading group session, we covered the “Strong and Efficient Consistency with Consistency-Aware Durability” paper from FAST’20. Jesse did an excellent presentation for the group that explains the core of the paper rather well: This paper describes a problem with many leader-based replication protocols. It specifically focuses on ZooKeper and Zab, but similar…

    Read More

  • Reading Group. Near-Optimal Latency Versus Cost Tradeoffs in Geo-Distributed Storage

    ·

    Placeholder Icon

    Short Summary Yesterday we discussed Pando, a geo-replication system achieving near-optimal latency-cost tradeoff in storage systems. Pando uses large Flexible Paxos deployments and erasure coding to do its magic. Pando relies on having many storage sites to locate sites closer to users. It then uses Flexible Paxos to optimize read and write quorums to have…

    Read More

  • Reading Group. Autoscaling Tiered Cloud Storage in Anna.

    ·

    Placeholder Icon

    This week we looked at “Autoscaling Tiered Cloud Storage in Anna.” This is the second Anna paper. The first one introduces Anna Key-Value store, and the second paper talks about various “cloud-native” improvements. The presentation by Michael Whittaker is available here: Short Summary Anna is an eventual-consistent key-value data store, where each value is a…

    Read More