1. Paper Notes
  2. 1. benchmarks
    1. 1.1. ssb
  3. 2. bigdata
    1. 2.1. mapreduce
    2. 2.2. nephele
    3. 2.3. dataflow model
    4. 2.4. flink
    5. 2.5. flink state management
  4. 3. databases
    1. 3.1. columnstores vs rowstores
    2. 3.2. kv
      1. 3.2.1. rocksdb cidr17
    3. 3.3. mmdb
      1. 3.3.1. mmdb overview
    4. 3.4. oltp
      1. 3.4.1. through the looking glass
      2. 3.4.2. staring into the abyss
    5. 3.5. olap
      1. 3.5.1. lakehouse
      2. 3.5.2. delta lake
      3. 3.5.3. vertica
    6. 3.6. htap
      1. 3.6.1. greenplum
    7. 3.7. concurrency control
      1. 3.7.1. evaluation of in-memory mvcc
    8. 3.8. cdc
      1. 3.8.1. dblog
    9. 3.9. rum conjecture
  5. 4. datalayout
    1. 4.1. cstore
    2. 4.2. cstore compression
    3. 4.3. dremel
    4. 4.4. rcfile
    5. 4.5. orc
    6. 4.6. table placement methods
  6. 5. data structures
    1. 5.1. btree family
      1. 5.1.1. bw-tree
    2. 5.2. hash table
      1. 5.2.1. linear hashing
    3. 5.3. trie family
      1. 5.3.1. art
      2. 5.3.2. hot
    4. 5.4. skip list
    5. 5.5. bloom filter
  7. 6. distributed system
    1. 6.1. consensus
      1. 6.1.1. flp
      2. 6.1.2. paxos made simple
      3. 6.1.3. paxos made live
      4. 6.1.4. viewstamped replication
      5. 6.1.5. zab
      6. 6.1.6. paxos vs. vr vs. zab
      7. 6.1.7. raft
      8. 6.1.8. paxos vs raft
    2. 6.2. scheduler
      1. 6.2.1. borg
    3. 6.3. primary backup
    4. 6.4. chain replication
    5. 6.5. bolosky
    6. 6.6. holy grail
    7. 6.7. chandy lamport
    8. 6.8. asynchronous barrier snapshotting
    9. 6.9. zookeeper
  8. 7. filesystem
    1. 7.1. gfs
    2. 7.2. polarfs
  9. 8. storage
    1. 8.1. kv store
      1. 8.1.1. dynamo
    2. 8.2. kudu
    3. 8.3. bluestore

论文阅读笔记

databases

  • Column-stores vs. row-stores: how different are they really?
  • Main Memory Database Systems
    • Main Memory Database Systems: An Overview
  • KV databases
    • Optimizing Space Amplification in RocksDB
  • OLTP
    • OLTP Through the Looking Glass, and What We Found There
    • Staring into the Abyss: An Evaluation of Concurrency Control with One Thousand Cores
  • OLAP
    • Lakehouse: A New Generation of Open Platforms that Unify DataWarehousing and Advanced Analytics
    • Delta Lake: HighPerformance ACID Table Storage over Cloud Object Stores
    • The Vertica Analytic Database: C-Store 7 Years Later
  • HTAP
    • Greenplum: A Hybrid Database for Transactional and Analytical Workloads
  • Concurrency Control
    • An Empirical Evaluation of In-Memory Multi-Version Concurrency Control
  • Change Data Capture
    • DBLog: AWatermark Based Change-Data-Capture Framework
  • Designing Access Methods: The RUM Conjecture

Optional readings

  • M. Stonebraker, et al., What Goes Around Comes Around
  • A. Pavlo, et al., What’s Really New with NewSQL?
  • Yihe Huang, et al., Opportunities for Optimism in Contended MainMemory Multicore Transactions