Distributed Systems
- 📖 Designing Data-Intensive Applications (M Kleppmann 2018)
- 📄 Photon: A Fast Query Engine for Lakehouse Systems (berkeley.edu)
- 📄 MapReduce: Simplified Data Processing on Large Clusters
- 📄 Kafka: a Distributed Messaging System for Log Processing
- 📄 Dremel: Interactive Analysis of Web-Scale Datasets
- 📄 Dremel: A Decade of Interactive SQL Analysis at Web Scale
- 🔗 Building and operating a pretty big storage system called S3