Data
An annotated collection of 13 books, papers & essays on data, spanning 1970 to 2019. Featuring works by Edgar F. Codd, Edward Tufte, Jakob Nielsen and 8 more — each with editorial commentary oriented to digital product practice.
A Relational Model of Data for Large Shared Data Banks
Codd's 1970 paper proposed that data storage should be separated from its physical representation on disk and instead organized into relations — tables with rows and columns governed by mathematical set theory. At the ti…
The Visual Display of Quantitative Information
Tufte's first and most influential book established the principles of data visualization as a serious discipline, arguing that graphical excellence consists of complex ideas communicated with clarity, precision, and effi…
Envisioning Information
The second volume of Tufte's trilogy on information design, focused on the problem of escaping flatland — how to represent complex, multidimensional data on the two-dimensional surfaces of paper and screen. Where The Vis…
Usability Engineering
Where Card, Moran, and Newell gave HCI its theoretical foundation, Nielsen gave it a pragmatic engineering methodology. This book codified usability heuristics, discount usability testing, severity ratings for defects, a…
Visual Explanations
The third volume of Tufte's trilogy, concerned with pictures of verbs — the visual representation of mechanisms, processes, cause and effect. The Challenger disaster chapter alone justifies the entire book: Tufte reconst…
Towards Robust Distributed Systems
In this keynote at the 2000 ACM Symposium on Principles of Distributed Computing, Brewer conjectured that a distributed system cannot simultaneously guarantee consistency, availability, and partition tolerance — you must…
The Ghost Map: The Story of London's Most Terrifying Epidemic — and How It Changed Science, Cities, and the Modern World
Johnson reconstructs the 1854 Broad Street cholera outbreak in London and the investigation by physician John Snow and clergyman Henry Whitehead that proved the disease was waterborne, not airborne. The book is fundament…
Life Beyond Distributed Transactions: An Apostate's Opinion
Helland, a veteran of Tandem, Microsoft, and Amazon, argues that as systems scale beyond a single machine, the classical guarantee of distributed transactions — where all parties agree atomically — becomes impractical or…
Eventually Consistent
Vogels, Amazon's CTO, wrote this essay to explain why eventual consistency is not a compromise or a bug but a deliberate architectural choice driven by the realities of operating at planetary scale. He walks through the…
Doing Data Science: Straight Talk from the Frontline
Based on Schutt's Columbia University course and O'Neil's experience as a data scientist at various startups, this book captures the discipline of data science at the moment it was coalescing from statistics, machine lea…
Weapons of Math Destruction
O'Neil, a mathematician who moved from academia to Wall Street to data science, identifies a class of predictive models she calls Weapons of Math Destruction: opaque, unregulated, and operating at scale in domains where…
Designing Data-Intensive Applications
Kleppmann's book is the contemporary reference for understanding how data systems actually work — from the internals of B-trees and LSM-trees to the semantics of distributed consensus protocols. What distinguishes it fro…
Do we have a Data Culture?
Kremser and Brunauer sharpen what "data culture" actually means by treating it as a subtype of organisational culture — not a technology stack or a dashboard habit but a set of shared assumptions about how decisions get…