Here are some classics: - MaReduce paper - BigTable paper - Amazon DynamoDB paper - Spark RDD paper Papers from the comments: - Cassandra - Kafka And ?
Start with basics (I don’t know the exact titles of the papers): 1. CAP theorem by Eric Brewer 2. Logical clocks by Leslie Lamport
Gilbert: Brewers conjecture and the feasibility of available partition tolerant web services
Is this the original paper title?
There's a lot more to the field than just what's used in industry... Lamport: Time, clocks, and the ordering of events in a distributed system. The Byzantine Generals problem. Specifying systems, Paxos made simple Lampson: Hints for computer system design Gray: Transaction processing, Data Cube, The notions of consistency
Just to be clear, you have 4 papers listed for Lamport in your post. Might be useful to split them in new lines.
Yeah. If you ever get the chance of taking a seminar with Lamport (he does some at Amazon time to time) jump on it. I've been lucky enough to take two week long seminars with him and its great. He's truly a special mind. He thinks about problems completely differently than computer scientists do, probably the reason he's made so many important contributions
Looks like we are getting good suggestions here. Looking forward to more replies.
Paxos
Gfs, raft
Spanner paper, azure storage
Cops and cops+
How can Roy Fielding’s REST paper not be in this list yet?!
Cassandra , Kafka.
HBase?