heidihoward / distributed-consensus-reading-list

пятница, 18 марта 2022 г. в 00:30:35

https://github.com/heidihoward/distributed-consensus-reading-list

A list of papers about distributed consensus.

Distributed Consensus Reading List 📚

Since its inception in the 1980s, distributed consensus has been the subject of extensive academic research. Whilst definitions vary, distributed consensus (or equivalently, atomic broadcast) most often refers to the problem of how to decide an ordered sequence of values between a set of distributed nodes. This can be used to implement an append-only replicated log which can be utilized either directly or indirectly, to provide services such as primary backup replication or state machine replication. These abstractions can, in turn, form the building blocks of new abstractions, such as a distributed key-value store. Some consensus algorithms instead decide only a single value or a partially ordered sequence of values. What unifies distributed consensus algorithms is the fact that they are always safe, regardless of delays and crashes (though they are not necessarily Byzantine fault tolerance), and are guaranteed to make progress provided sufficient liveness.

This is a long list of papers relating to distributed consensus. Many of the papers listed below fit into more than one section. However, for simplicity, each paper is listed only in the most relevant section. Where possible, open access links for each paper have been provided. Contributions via pull requests are welcome.

⭐️ Influential papers - If you are looking for a starting point, a subset of the most influential papers on distributed consensus are highlighted using a yellow star. ⭐️

💎 Hidden gems - Papers which I personally love but are not as highly cited as the influential papers 💎

Key: acmdl = ACM Digital Library

The sections are as follows:

Distributed consensus
Related Topics
Future reading list

Distributed Consensus

Theoretical results

This section lists theoretical results relating to distributed consensus.

⭐️ Time, Clocks, and the Ordering of Events in a Distributed System, CACM 1978 [acmdl,pdf]
- Easily one of the most influential papers in distributed computing. Introduces the "happens-before" relation and Lamport clocks.
The implementation of reliable distributed multiprocess systems, Computer Networks 1978 [pdf]
- Precursor to Paxos, notes on achieving fault-tolerance with some degree of clock synchronization.
⭐️ Impossibility of Distributed Consensus with One Faulty Process, JACM 1985 [acmdl,pdf]
- The famous FLP result, proving that distributed consensus algorithms need synchrony to guarantee progress.
On the Minimal Synchronism Needed for Distributed Consensus, JACM 1987 [acmdl,pdf]
- Follow up to the FLP result with some extra considerations
Unreliable Failure Detectors for Reliable Distributed Systems, JACM 1996 [acmdl,pdf]
The Weakest Failure Detector for Solving Consensus, JACM 1996 [acmdl,pdf]
Omega Meets Paxos: Leader Election and Stability without Eventual Timely Links, DISC 2005 [acmdl,pdf]
Lower Bounds for Asynchronous Consensus, Distributed Computing 2006 [acmdl,pdf]
The Heard-Of Model: Computing in Distributed Systems with Benign Failures, Distributed Computing 2009 [acmdl,pdf]
- Featured in the morning paper
Virtually Synchronous Methodology for Dynamic Service Replication, MS Tech report 2010 [pdf]

Surveys

This section lists surveys, tutorials, and systemization of knowledge papers covering distributed consensus algorithms.

A Modular Approach to Fault-Tolerant Broadcasts and Related Problems, Tech Report 1994 [acmdl,pdf]
How to Build a Highly Available System Using Consensus, WDAG 1996 [acmdl,pdf]
Revisiting the PAXOS algorithm, WDAG 1997 [acmdl,pdf]
The ABCD’s of Paxos, PODC 2001 [acmdl,pdf]
⭐️ Paxos Made Simple, SIGACT News 2001 [pdf]
- Describes single-degree Paxos as well as Multi-Paxos for SMR
- Is much more readable than the original Paxos paper
- Featured in the morning paper
Deconstructing paxos, SIGACT News 2003 [pdf,acmdl]
Total order broadcast and multicast algorithms: Taxonomy and survey, CSUR 2004 [acmdl,pdf]
💎 Vive La Difference: Paxos vs. Viewstamped Replication vs. Zab, TDSC 2005 [pdf]
- Featured in the morning paper
The Alpha of Indulgent Consensus, Comp Journal 2007 [acmdl,pdf]
The Paxos Register, SRDS 2007 [acmdl,pdf]
Classic Paxos vs. Fast Paxos: Caveat Emptor, HotDep 2007 [acmdl,pdf]
Tutorial Summary: Paxos Explained from Scratch, OPODIS 2013 [acmdl,pdf]
💎 Paxos Made Moderately Complex, CSUR 2015 [acmdl,pdf]
On the Parallels between Paxos and Raft, and how to Port Optimizations, PODC 2019 [acmdl,pdf]
Paxos vs Raft: Have we reached consensus on distributed consensus?, PaPoC 2020 [acmdl,arxiv]
60 Years of Mastering Concurrent Computing through Sequential Thinking, SIGACT News 2020 [acmdl]
What's Live? Understanding Distributed Consensus, PODC 2021 [acmdl,arxiv]
SoK: A Generalized Multi-Leader State Machine Replication Tutorial, JSys 2021 [pdf]

Algorithms for consensus

This section lists papers describing algorithms for distributed consensus. These papers tend to be theory papers (venues such as PODC, DISC, OPODIS) whereas the Implementations of consensus section focuses on systems papers.

Another Advantage of Free Choice: Completely Asynchronous Agreement Protocols, PODC 1983 [acmdl]
- As known as the Ben-Or algorithm
Reliable communication in the presence of failures, TOCS 1987 [acmdl,pdf]
⭐️ Consensus in the Presence of Partial Synchrony, JACM 1988 [acmdl,pdf]
Viewstamped Replication: A New Primary Copy Method to Support Highly-Available Distributed Systems, PODC 1988 [acmdl,pdf]
- Featured in the morning paper
Efficient Message Ordering in Dynamic Networks, PODC 1996 [acmdl,pdf]
⭐️ The Part-Time Parliament, TOCS 1998 [acmdl,pdf]
- The original paper introducing Paxos
- Featured in the morning paper
Disk Paxos, DISC 2000 [acmdl,pdf]
- This paper describes how to replace acceptors in Paxos with disks
- Each disk is divided into blocks, one for each proposer. Each proposer may only write to its own block and read from other blocks, which they do in each of the two usual Paxos phases
- Each block contains the rough equivalent to last promised ballot number and last accepted proposal for the assigned proposer
Specifying and Using a Partitionable Group Communication Service, TOCS 2001 [acmdl,pdf]
Active Disk Paxos with infinitely many processes, PODC 2002 [acmdl,pdf]
- This paper makes Disk Paxos more “Paxos like” by assuming the disks support more operations e.g. conditional write
- ADP claims that Disk Paxos requires a fixed set of proposers and that ADP fixes this.
Cheap Paxos, DSN 2004 [acmdl,pdf]
Generalized Consensus and Paxos, Tech Report 2005 [pdf]
- Introduces the idea of deciding a partial ordering of values instead of a total ordering
Fast Paxos, Distributed Computing 2006 [pdf]
- Variant of Paxos where proposers can bypass the leader by allowing multiple values to be proposed in the same ballot. This requires stronger quorums intersection, e.g. fast paxos needs 3/4 of acceptors (instead of a simple majority) to provide the same liveness guarantees as classic Paxos.
Consensus on Transaction Commit, TODS 2006 [acmdl,pdf]
- Featured in the morning paper
Multicoordinated Paxos, PODC 2007 [acmdl,pdf]
- Variant of Paxos which replaces the one leader with a group of leaders. Clients send operations to all leaders and they all propose values to the acceptors. Acceptors only accept a value if they have received proposals from a quorum of leaders. Similar to the non-equivocation phase in BFT. Liveness now does not depend on the leader.
Stoppable Paxos, Tech Report 2008 [pdf]
Yet Another Visit to Paxos, Tech report 2009 [pdf]
Dynamic atomic storage without consensus, JACM 2011 [acmdl,pdf]
Fast Genuine Generalized Consensus, SRDS 2011 [acmdl,pdf]
💎 Viewstamped Replication Revisited, Tech Report 2012 [pdf]
- Featured in the morning paper
On Collision-fast Atomic Broadcast, AINA 2014 [pdf]
Paxos Quorum Leases: Fast Reads Without Sacrificing Writes, SOCC 2014 [acmdl,pdf]
- Extends the idea of master read leases to allow the master to promise to use a specified subset of acceptors in every majority quorum. Acceptors in this quorum can then serve reads locally.
- Similar to master read leases, it relies on clock synchrony.
Consus: Taming the Paxi, Unpublished 2016 [arxiv]
Flexible Paxos: Quorum Intersection Revisited, OPODIS 2016 [pdf]
- Featured in the morning paper
💎 CASPaxos: Replicated State Machines without logs, Unpublished 2018 [arxiv]
Fast Flexible Paxos: Relaxing Quorum Intersection for Fast Paxos, ICDCN 2021 [arxiv]
Spire: A Cooperative, Phase-Symmetric Solution to Distributed Consensus, IEEE Access 2021 [pdf]
- Consensus algorithm which permits multiple proposals in the same round (similar to Fast Paxos) but uses two phases instead of larger quorums.
Paxos Made Practical, Unpublished [pdf]

Consensus for specialist hardware

This section lists papers describing consensus algorithms using specialist hardware such as SDN, IP-multicast, or RDMA.

Ring Paxos: A high-throughput atomic broadcast protocol, DSN 2010 [pdf,code]
Multi-Ring Paxos, DSN 2012 [acmdl,pdf]
NetPaxos: consensus at network speed, SOSR 2015 [acmdl,pdf]
Taming uncertainty in distributed systems with help from the network, Eurosys 2015 [acmdl,pdf]
- Featured in the morning paper
DARE: High-Performance State Machine Replication on RDMA Networks, HPDC 2015 [acmdl,pdf]
Paxos Made Switch-y, CCR 2016 [acmdl,pdf]
Consensus in a Box: Inexpensive Coordination in Hardware, NSDI 2016 [acmdl,pdf]
Distributed Consensus and Implications of NVM on Database Management Systems, ACM Queue 2016 [acmdl,html]
- Featured in the morning paper
AllConcur: Leaderless Concurrent Atomic Broadcast, HPDC 2017 [acmdl,pdf]
APUS: Fast and Scalable Paxos on RDMA, SoCC 2017 [acmdl,pdf]
When Raft Meets SDN: How to Elect a Leader and Reach Consensus in an Unruly Network, APNet 2017 [acmdl,pdf]
P4xos: Consensus as a Network Service, Tech Report 2018 [pdf]
- P4xos is also evaluated in The Case For In-Network Computing On Demand [acmdl,pdf,code]
Derecho: Fast State Machine Replication for Cloud Services, TOCS 2019 [acmdl,pdf,code]
- Derecho: Group Communication at the Speed of Light, Unpublished [pdf]
- Groups, Subgroups and Auto-Sharding in Derecho: A Customizable RDMA Framework for Highly Available Cloud Services, Unpublished [pdf]
NetChain: Scale-Free Sub-RTT Coordination, NSDI 2018 [acmdl,pdf]
- Featured in the morning paper
Kernel Paxos, SRDS 2018 [pdf]
Partitioned Paxos via the Network Data Plane, Tech Report 2019 [pdf]
The Impact of RDMA on Agreement, PODC 2019 [pdf]
HovercRaft: Achieving Scalability and Fault-tolerance for microsecond-scale Datacenter Services, Eurosys 2020 [acmdl]
FLAIR: Accelerating Reads with Consistency-Aware Network Routing, NSDI 2020 [acmdl,pdf]
Microsecond Consensus for Microsecond Applications, OSDI 2020 [arxiv]
High availability in cheap distributed key value storage, SoCC 2020 [acmdl]
Odyssey: The Impact of Modern Hardware on Strongly-Consistent Replication Protocols, Eurosys 2021 [acmdl, pdf,techreport,thesis]

Consensus for geo-distributed systems

This section covers papers describing consensus algorithms for WANs and/or geo-replicated systems. Many of these algorithms (such as EPaxos) are leaderless and decide a partial-ordering over values instead of the more traditional total-ordering approach.

Mencius: Building Efficient Replicated State Machines for WANs, OSDI 2008 [acmdl,pdf]
Scalable Consistency in Scatter, SOSP 2011 [acmdl,pdf]
MDCC: Multi-Data Center Consistency, Eurosys 2013 [acmdl,pdf]
There Is More Consensus in Egalitarian Parliaments, SOSP 2013 [acmdl,pdf]
- This paper describes EPaxos, which realizes Generalized Paxos and makes some further improvements (e.g. reducing the size of fast quorums by 1).
Geo-replicated storage with scalable deferred update replication, DSN 2013 [acmdl,pdf]
Low-Latency Multi-Datacenter Databases using Replicated Commit, VLDB 2013 [acmdl,pdf]
Be General and Don’t Give Up Consistency in Geo-Replicated Transactional Systems, OPODIS 2014 [pdf]
CalvinFS: Consistent WAN Replication and Scalable Metadata Management for Distributed File Systems, FAST 2015 [acmdl,pdf]
GlobalFS: A Strongly Consistent Multi-Site File System, SRDS 2016 [pdf]
Canopus: A Scalable and Massively Parallel Consensus Protocol, CoNEXT 2017 [acmdl,pdf]
Multileader WAN Paxos: Ruling the Archipelago with Fast Consensus, Tech report 2017 [pdf]
WPaxos: Wide Area Network Flexible Consensus, Unpublished 2017 [pdf]
- Related blog post, Modelling Paxos performance in wide area
Speeding up Consensus by Chasing Fast Decisions, DSN 2017 [pdf]
- Implements an optimization to EPaxos
Leader Set Selection for Low-Latency Geo-Replicated State Machine, IEEE TPDS 2017 [pdf]
DPaxos: Managing Data Closer to Users for Low-Latency and Mobile Applications, SIGMOD 2018 [acmdl,pdf]
SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines, SoCC 2018 [acmdl,pdf]
FleetDB: Follow-the-workload Data Migration for Globe-Spanning Databases, Tech report 2018 [pdf]
Geographic State Machine Replication, SRDS 2018 [pdf]
Session Guarantees with Raft and Hybrid Logical Clocks, ICDCN 2019 [acmdl]
Near-Optimal Latency Versus Cost Tradeoffs in Geo-Distributed Storage, NSDI 2020 [pdf]
State-Machine Replication for Planet-Scale Systems, Eurosys 2020 [acmdl,arxiv]
Low-Latency Geo-Replicated State Machines with Guaranteed Writes, PaPoC 2020 [acmdl]
EPaxos Revisited, NSDI 2021 [pdf]
Efficient Replication via Timestamp Stability, Eurosys 2021 [acmdl,arxiv]
- Describes Tempo, a leaderless partial ordering protocol that uses timestamp ordering.
Reducing the Latency of Dependent Operations in Large-Scale Geo-Distributed Systems, PhD Thesis 2021 [pdf]
LEGOStore: A Linearizable Geo-Distributed Store Combining Replication and Erasure Coding, Preprint 2021 [arxiv]

Consensus in production

This section lists papers describing experiences of deploying distributed consensus in production.

⭐️ The Chubby lock service for loosely-coupled distributed systems, OSDI 2006 [acmdl,pdf]
- Featured in the morning paper
⭐️ Paxos Made Live - An Engineering Perspective, PODC 2007 [acmdl,pdf]
- Featured in the morning paper
⭐️ ZooKeeper: Wait-free coordination for Internet-scale systems, ATC 2010 [acmdl,pdf]
- Featured in the morning paper
Windows Azure Storage: a highly available cloud storage service with strong consistency, SOSP 2011 [acmdl,pdf]
Megastore: Providing Scalable, Highly Available Storage for Interactive Services, CIDR 2011 [pdf]
- Megastore uses SMR with witnesses, replicas that participate in log replication but do not run a state machine and read-only replicas that only run a state machine. This paper seems to use an unusual definition of Multi-Paxos where each instance is district but the 1a/1b messages for slot i is piggybacked onto 2a2/b for slot i-1.
Zab: High-performance broadcast for primary-backup systems, DSN 2011 [acmdl,pdf]
- Widely utilized Apache licensed open source project written in Java project website
- Apache Kafka uses Zookeeper, as well as its own replication protocol, by described here. This is no longer true.
- Architecture is similar to Google's Chubby but unlike Chubby is described in detail and is open source
- Writes are linearizable, reads may be stale
- Note: calling sync before a write doesn't make it linearizable
- Clients may have multiple outstanding requests, they will be handled FIFO
- Uses primary-backup replication instead of state machine replication
- Featured in the morning paper
Large-scale cluster management at Google with Borg, Eurosys 2015 [acmdl,pdf]
- Featured in the morning paper
PaxosStore: High-availability Storage Made Practical in WeChat, VLDB 2017 [acmdl,pdf]
Bizur: A Key-value Consensus Algorithm for Scalable File-systems, Unpublished 2017 [pdf]
SLOG: Serializable, Low-latency, Geo-replicated Transactions, VLDB 2019 [acmdl,pdf]
- Featured in the morning paper
CockroachDB: The Resilient Geo-Distributed SQL Database, ICMD 2020 [acmdl]
Millions of Tiny Databases, NSDI 2020 [pdf]
- Featured in the morning paper
Virtual Consensus in Delos, OSDI 2020 [pdf]
Log-structured Protocols in Delos, SOSP 2021 [pdf]

Implementations of consensus

This section lists papers describing implementations of distributed consensus algorithms.

Replication and fault-tolerance in the ISIS system, Tech Report 1985 [acmdl,pdf]
The ISIS project: real experience with a fault tolerant programming system, OSR 1991 [acmdl,pdf]
Replication in the Harp File System, SOSP 1991 [acmdl,pdf]
Boxwood: Abstractions as the Foundation for Storage Infrastructure, OSDI 2004 [acmdl,pdf]
The Farsite Project: A Retrospective, OSR 2007 [acmdl,pdf]
Paxos for System Builders: An Overview, LADIS 2008 [acmdl,pdf]
Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore, VLDB 2011 [acmdl,pdf]
Paxos replicated state machines as the basis of a high-performance data store, NSDI 2011 [acmdl,pdf]
Granola: Low-Overhead Distributed Transaction Coordination, ATC 2012 [acmdl,pdf]
S-Paxos: Offloading the Leader for High Throughput State Machine Replication, SRDS 2012 [acmdl,pdf]
Calvin: Fast Distributed Transactions for Partitioned Database Systems, SIGMOD 2012 [acmdl,pdf]
- Featured in the morning paper
Commodifying Replicated State Machines with OpenReplica, Tech report 2012 [pdf]
Optimizing Paxos with batching and pipelining, Theoretical Computer Science 2013 [acmdl,pdf]
Tango: Distributed data structures over a shared log, SOSP 2013 [acmdl,pdf]
CORFU: A Distributed Shared Log, TOCS 2013 [acmdl,pdf]
- Featured in the morning paper
Scalable State-Machine Replication, DSN 2014 [acmdl,pdf]
When Paxos Meets Erasure Code: Reduce Network and Storage Cost in State Machine Replication, HPDC 2014 [acmdl]
⭐️ In Search of an Understandable Consensus Algorithm (Extended Version), ATC 2014 [acmdl,pdf]
- AKA the RAFT consensus paper
- Implemented in etcd, open sourced and widely utilized including by kubernetes
- Implemented in CockroachDB, Consul from HashiCorp, Atomix and many other systems.
- Featured in the morning paper
Consensus: Bridging Theory and Practice, PhD Thesis 2014 [pdf]
- PhD thesis describing RAFT consensus in more detail
Paxos made transparent, SOSP 2015 [acmdl,pdf]
Designing Distributed Systems Using Approximate Synchrony in Data Center Networks, NSDI 2015 [acmdl,pdf]
No compromises: distributed transactions with consistency, availability, and performance, SOSP 2015 [acmdl,pdf]
Building Consistent Transactions with Inconsistent Replication, SOSP 2015 [acmdl,pdf]
- Weakly consistent key-val store with support for linearizability as requested
- Geo-distributed performance is evaluated
- Useful related blog post Lessons learned from TAPIR(s)
- Featured in the morning paper
MetaSync: File Synchronization Across Multiple Untrusted Storage Services, ATC 2015 [pdf,acmdl]
Making Fast Consensus Generally Faster, DSN 2016 [pdf]
Azure Data Lake Store: A Hyperscale Distributed File Service for Big Data Analytics, SIGMOD 2017 [acmdl,pdf]
- Featured in the morning paper
Leader or Majority: Why have one when you can have both? Improving Read Scalability in Raft-like consensus protocols, HotCloud 2017 [pdf,acmdl,slides]
Bolt-On Global Consistency for the Cloud, SoCC 2018 [acmdl,pdf]
Stable and Consistent Membership at Scale with Rapid, ATC 2018 [pdf]
- Uses Fast Paxos to decide on membership changes. Conflicts are rare as the proposed value is the output of a membership algorithm so proposers usually propose the same proposal.
- Fast Paxos implementation is here and here
The FuzzyLog: A Partially Ordered Shared Log, OSDI 2018 [pdf]
Aegean: Replication beyond the client-server model, SOSP 2019 [acmdl]
- Also supports BFT
Exploiting Commutativity For Practical Fast Replication, NSDI 2019 [acmdl,pdf]
- More details in author's thesis
- Featured in the morning paper
Unifying Consensus and Atomic Commitment for Effective Cloud Data Management, VLDB 2019 [acmdl,pdf]
Linearizable Quorum Reads in Paxos, HotStorage 2019 [pdf,slides]
- A two phase quorum read algorithm which does not require the leader and does not rely on bounded clock drift like read leases.
RMWPaxos: Fault-Tolerant In-Place Consensus Sequences, Unpublished 2020 [arxiv]
Bipartisan Paxos: A Modular State Machine Replication Protocol, Unpublished [pdf]
Scalog: Seamless Reconfiguration and Total Order in a Scalable Shared Log, NSDI 2020 [pdf]
Hermes: A Fast, Fault-Tolerant and Linearizable Replication Protocol, ASPLOS 2020 [acmdl,arxiv]
- author's thesis
PigPaxos: Devouring the communication bottlenecks in distributed consensus, ICMD 2020 [arxiv,acmdl]
CRaft: An Erasure-coding-supported Version of Raft for Reducing Storage Cost and Network Cost, FAST 2020 [pdf]
Scaling Replicated State Machines with Compartmentalization, VLDB 2021 [pdf]
Rabia: Simplifying State-Machine Replication Through Randomization, SOSP 2021 [arxiv]
Boki: Stateful Serverless Computing with Shared Logs, SOSP 2021 [pdf]
- Latest edition in the line of papers on shared totally-ordered logs: Tango, Corfu, vCorfu, Scalog, Delos
Gossip Consensus, Middleware 2021 [pdf]
- Looks at using gossip to reduce communication overhead of Multi-Paxos. An unstructured version of PigPaxos/Canopus.

Evaluations of consensus

This section lists papers describing standalone evaluations of consensus algorithms.

The Performance of Paxos in the Cloud, SRDS 2014 [acmdl,pdf]
Consensus in the Cloud: Paxos Systems Demystified, Tech report 2016 [pdf]
Spectrum: A Framework for Adapting Consensus Protocols, Unpublished 2019 [pdf]
Dissecting the Performance of Strongly-Consistent Replication Protocols, SIGMOD 2019 [acmdl,pdf]
- Author's blog post
Blockchains and Distributed Databases: a Twin Study [arxiv]
- Performance anaylsis of 5 consensus systems, 3 non-byzantine algorithms (including etcd) and 2 byzantine consensus algorithms
Scalable but Wasteful: Current State of Replication in the Cloud, HotStorage 2021 [acmdl]
- Study of the efficiency (CPU utilization) of Multi-Paxos vs EPaxos, finding that EPaxos provides better throughput than Multi-Paxos at the cost of much worse efficiency.

State machine replication

This section lists papers about the application of consensus to State Machine Replication (SMR/RSMs) and Linearizability.

⭐️ Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial, CSUR 1990 [acmdl,pdf]
⭐️ Linearizability: A Correctness Condition for Concurrent Objects, TOPLAS 1990 [acmdl,pdf]
Implementing Linearizability at Large Scale and Low Latency, SOSP 2015 [acmdl,pdf]
Cheap and Available State Machine Replication, ATC 2016 [acmdl,pdf]
Fine-Grained Replicated State Machines for a Cluster Storage System, NSDI 2020 [pdf]

Reconfiguration

This section lists papers on reconfiguration & leader election.

The SMART Way to Migrate Replicated Stateful Services, EuroSys 2006 [acmdl,pdf]
💎 Vertical Paxos and Primary-Backup Replication, PODC 2009 [acmdl,pdf]
Reconfiguring a State Machine, SIGACT News 2010 [acmdl,pdf]
Dynamic Reconfiguration of Primary/Backup Clusters, ATC 2012 [acmdl,pdf]
- Describes Zookeeper's approach to reconfiguration.
Take me to your leader! Online Optimization of Distributed Storage Configurations, VLDB 2015 [pdf]
Unbounded Pipelining in Dynamically Reconfigurable Paxos Clusters, Unpublished 2016 [pdf]
- Related blog post, UPaxos and primary-backup replication
Matchmaker Paxos: A Reconfigurable Consensus Protocol, JSys 2021 [pdf,arxiv]

Future reading list

The following lists contain places to watch for new writings in the field of distributed consensus.

Reading lists

Academic conferences & symposiums

USENIX Symposium on Networked Systems Design and Implementation (NSDI) [2019,2020]
USENIX Conference on File and Storage Technologies (FAST) [2019,2020]
European Conference on Computer Systems (EuroSys) [2019,2020]
IEEE/IFIP International Conference on Dependable Systems and Networks (DSN) [2019,2020]
ACM Symposium on Parallelism in Algorithms and Architectures (SPAA) [website,2019]
ACM SIGMOD/PODS Conference [2019,2020]
ACM SIGMETRICS / IFIP Performance [2019,2020]
ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI) [2019,2020]
ACM Symposium on Theory of Computing (STOC) [2019,2020]
ACM Symposium on Principles of Distributed Computing (PODC) [website]
IEEE International Conference on Distributed Computing Systems (ICDCS) [2019,2020]
USENIX Annual Technical Conference (ATC) [2019,2020]
ACM Annual Conference of the Special Interest Group on Data Communication (SIGCOMM) [website,2019,2020]
International Conference on Very Large Data Bases (VLDB) [2019,2020]
USENIX Symposium on Operating Systems Design and Implementation (OSDI) [website,2018,2020]
- Biennial evens only
International Symposium on Reliable Distributed Systems (SRDS) [website,2019]
International Symposium on Distributed Computing (DISC) [website,2019]
International Conference on Principles of Distributed Systems (OPODIS) [2019]
ACM Symposium on Operating Systems Principles (SOSP) [2019]
- Biennial odds only
ACM Symposium on Cloud Computing (SoCC) [2019]
Conference on Innovative Data Systems Research (CIDR) [2021]

Dan Tsafrir maintains a useful list of systems conferences by deadline.

Academic workshops

Workshop on Principles and Practice of Consistency for Distributed Data (PaPoC) [2019]
ACM SIGOPS Workshop on Large-Scale Distributed Systems and Middleware (LADIS) [website]
USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage) [2019]
Workshop on Hot Topics in Operating Systems (HotOS) [2019]
ACM Workshop on Hot Topics in Networks (HotNets) [2019]
USENIX Workshop on Hot Topics in Cloud Computing (HotCloud) [2019]
USENIX Workshop on Hot Topics in Edge Computing (HotEdge) [2019]
International Workshop on Distributed Cloud Computing (DCC) [2019]

heidihoward / distributed-consensus-reading-list

Distributed Consensus Reading List 📚

Distributed Consensus

Theoretical results

Surveys

Algorithms for consensus

Consensus for specialist hardware

Consensus for geo-distributed systems

Consensus in production

Implementations of consensus

Evaluations of consensus

State machine replication

Reconfiguration

Related Topics

Weaker consistency models

Failures

Clocks

Correctness of consensus algorithms

Quorum systems

Byzantine fault tolerance

Alternative fault models in distributed consensus

Misc

Future reading list

Blogroll

Reading lists

Academic conferences & symposiums

Academic workshops

Academic journals & magazines