Part I. Foundations of Data Systems

The first four chapters go through the fundamental ideas that apply to all data systems, whether running on a single machine or distributed across a cluster of machines:

  1. Chapter 1 introduces the terminology and approach used in this book, e.g., reliability, scalability, and maintainability, and how we can try to achieve these goals.
  2. Chapter 2 compares several different data models and query languages—the most visible distinguishing factor between databases from a developer's point of view.
  3. Chapter 3 turns to the internals of storage engines and looks at how databases lay out data on disk. Different storage engines are optimized for different workloads, and choosing the right one can have a huge effect on performance.
  4. Chapter 4 compares various formats for data encoding (serialization) and especially examines how they fare in an environment where application requirements change and schemas need to adapt over time.