【陈怀临:这个年轻人很优秀。优秀的不仅仅是能把这些经典论文看完,而是还花心思翻译出来。他的博客是:银河里的星星。其联系方式是phylips@bmy】 分布式领域论文译序 sql&nosql年代记 海量数据的存储计算和查询 一.google论文系列 1. google系列论文译序 2. The anatomy of a large-scale hypertextual Web search engine 3. 面向星球的网络搜索:google集群架构 4. GFS:google文件系统 5. mapreduce: Simplied Data Processing on Large Clusters 6. bigtable: A Distributed Storage System for Structured Data 7. Chubby: The Chubby lock service for loosely-coupled distributed systems 8. Sawzall:Interpreting the Data–Parallel Analysis with Sawzall 9. Pregel: A System for Large-Scale Graph Processing 10. Dremel: Interactive Analysis of WebScale Datasets 11. Percolator: Large-scale Incremental Processing Using Distributed Transactions and Notifications 12. MegaStore: Providing Scalable, Highly Available Storage for Interactive Services 13. Case Study GFS: Evolution on Fast-forward 14. Google File System II: Dawn of the Multiplying Master Nodes google系列论文翻译集(合集) 二.分布式理论系列 00. Appraising Two Decades of Distributed Computing Theory Research 0. How to Build a Highly Available System Using Consensus 1. 分布式理论系列译序 2. A brief history of Consensus_ 2PC and Transaction Commit 3. 拜占庭将军问题 –Leslie Lamport 4. Impossibility of distributed consensus with one faulty process 5. Leases:租约机制 6. paxos made simple 7. 2pc-2阶段提交:Notes on Data Base Operating Systems –Jim Gray 8. 3pc-3阶段提交:NONBLOCKING COMMIT PROTOCOLS 9. Life beyond Distributed Transactions:an Apostate’s Opinion 10. A Comparison of the Byzantine Agreement Problem and the Transaction Commit Problem –Jim Gray 11. Uniform consensus is harder than consensus 12. Consensus on Transaction Commit –Jim Gray & Leslie Lamport 13. Paxos Made Live – An Engineering Perspective 14. Fast Paxos –Leslie Lamport 15. The Part Time Parliament –Leslie Lamport 16. Time Clocks and the Ordering of Events in a Distributed System –Leslie Lamport 17. Distributed Snapshots: Determining Global States of a Distributed System –Leslie Lamport 18. Virtual Time and Global States of Distributed Systems 19. Timestamps in Message-Passing Systems That Preserve the Partial Ordering 20. Fundamentals of Distributed Computing:A Practical Tour of Vector Clock Systems 三.Sql&NoSql理论系列 0. Towards Robust Distributed Systems:Brewer’s 2000 PODC key notes 1. CAP理论 2. Harvest, Yield, and Scalable Tolerant Systems 3. Brewer’s conjecture and the feasibility of consistent, available, partition- tolerant web services 4. BASE模型:BASE an Acid Alternative 5. 最终一致性 6. 可扩展性设计模式 7. 可伸缩性原则 8. MapReduce:一个巨大的倒退 9. The Five-Minute Rule20 Years Later(and How Flash Memory Changes the Rules) 10. The Transaction Concept:Virtues and Limitations –Jim Gray 11. NoSql生态系统 12. scalability-availability-stability-patterns 四.基本算法和数据结构 1. 大数据量,海量数据处理方法总结 2. 大数据量,海量数据处理方法总结(续) 3. Consistent Hashing And Random Trees 4. Merkle Trees 5. Scalable Bloom Filters 6. Introduction to Distributed Hash Tables 7. B-Trees and Relational Database Systems 8. The log-structured merge-tree 9. lock free data structure 10. Data Structures for Spatial Database 11. Gossip 12. lock free algorithm 13. The Graph Traversal Pattern 五.基本系统和实践经验 1. Berkeley DB white paper 2. Dynamo: Amazon’s Highly Available Key-value Store 3. Cassandra – A Decentralized Structured Storage System 4. PNUTS: Yahoo!’s Hosted Data Serving Platform 5. Hive – A Petabyte Scale Data Warehouse Using Hadoop 5. Benchmarking Cloud Serving Systems with YCSB 6. Designs, Lessons and Advice from Building Large Distributed Systems –Jeff Dean 7. Challenges in Building Large-Scale Information Retrieval Systems –Jeff Dean 8. Apache Hadoop Goes Realtime at Facebook 9. Hadoop平台优化综述 六.其他辅助系统 1. The ganglia distributed monitoring system:design, implementation, and experience 2. Chukwa: A large-scale monitoring system 七.其他 Reflections on Trusting Trust –Ken Thompson Who Needs an Architect? Go To statements considered harmfull –Edsger W.Dijkstra No Silver Bullet Essence and Accidents of Software Engineering –Frederick P. Brooks |