distributed system

在 A Brief Introduction to Distributed Systems 的介绍中，分布式系统被定义为:

A distributed system is a collection of autonomous computing elements that appears to its users as a single coherent system.

图灵奖获得者 Leslie Lamport 将分布式系统描述为:

one in which the failure of a computer you did not even know existed can render your own computer unusable.

consensus
A principle for resilient sharing of distributed resources
Chain Replication for Supporting High Throughput and Availability
Paxos Replicated State Machines as the Basis of a High-Performance Data Store
Detecting Causal Relationships in Distributed Computations: In Search of the Holy Grail
Distributed Snapshots: Determining Global States of Distributed Systems
Lightweight Asynchronous Snapshots for Distributed Dataflows
ZooKeeper: wait-free coordination for internet-scale systems
scheduler
- Large-scale cluster management at Google with Borg