distributed system
在 A Brief Introduction to Distributed Systems 的介绍中,分布式系统被定义为:
A distributed system is a collection of autonomous computing elements that appears to its users as a single coherent system.
图灵奖获得者 Leslie Lamport 将分布式系统描述为:
one in which the failure of a computer you did not even know existed can render your own computer unusable.
- consensus
- Impossibility of Distributed Consensus with One Faulty Process
- Paxos Made Simple
- Paxos Made Live
- Viewstamped Replication
- Zab: High-performance broadcast for primary-backup systems
- Vive La Difference: Paxos vs. Viewstamped Replication vs. Zab
- In Search of an Understandable Consensus Algorithm
- Paxos vs Raft: have we reached consensus on distributed consensus?
- A principle for resilient sharing of distributed resources
- Chain Replication for Supporting High Throughput and Availability
- Paxos Replicated State Machines as the Basis of a High-Performance Data Store
- Detecting Causal Relationships in Distributed Computations: In Search of the Holy Grail
- Distributed Snapshots: Determining Global States of Distributed Systems
- Lightweight Asynchronous Snapshots for Distributed Dataflows
- ZooKeeper: wait-free coordination for internet-scale systems
- scheduler