Thursday, October, 2014CodingTMD ' s Reading list following Reading list is selected from the papers I had read in the PA St 3 years. It would help you to gain a basic knowledge of the what happened in the current industry and bring you a little sense about how to Design a distributed system with certain principles.
Feel free to post the good paper your had read in the comments for sharing.:)
Concurrency
- In Search of an understandable Consensus algorithm. Diego Ongaro, John ousterhout, 2013
- A Simple totally Ordered broadcast Protocol. Benjamin Reed, Flavio P. junqueira,2008
- Paxos made Live-an Engineering perspective. Tushar Deepak Chandra, Robert Griesemer, Joshua Redstone, 2007
- The Chubby Lock Service for loosely-coupled distributed Systems. Mike Burrows, 2006
- Paxos Made Simple. Leslie Lamport, 2001
- Impossibility of distributed Consensus with one faulty Process. Michael Fischer, Nancy Lynch, Michael Patterson, 1985
- The Byzantine generals problem. Leslie Lamport, 1982
- An algorithm to Concurrency Control and Recovery in replicated distributed Databases. PA Bernstein, N Goodman, 1984
- Wait-free synchronization. M Herlihy ..., 1991
- Zookeeper:wait-free Coordination for Internet-scale systems. P Hunt, M Konar, FP Junqueira, 2010
<!--[If!supportlists]-->
Consistency
- Highly Available transactions:virtues and limitations. Peter Bailis, Aaron Davidson, Alan Fekete, Ali ghodsi, Joseph M. Hellerstein, Ion Stoica, 2013
- Consistency tradeoffs in modern distributed Database System Design. Daniel J. Abadi, 2012
- CAP Twelve years later:how the "Rules" has Changed. Eric Brewer, 2012
- Optimistic Replication. Yasushi Saito and Marc Shapiro, 2005
- Brewer ' s conjecture and the feasibility of consistent, Available, Partition-tolerant Web Services. Seth Gilbert, Nancy Lynch, 2002
- Harvest, Yield, and scalable tolerant Systems. Armando Fox, Eric A. Brewer, 1999
- Linearizability:a correctness Condition for Concurrent Objects. Maurice P. Herlihy, Jeannette M. Wing, 1990
- Time, clocks, and the ordering of Events in a distributed System. Leslie Lamport, 1978
<!--[If!supportlists]-->
CONFLICT-FREE Data Structures
- A comprehensive Study of convergent and commutative replicated Data Types. Mark Shapiro, Nuno Preguiça, Carlos Baquero, Marek Zawirski, 2011
- A commutative replicated Data Type for cooperative Editing. Nuno Preguica, Joan Manuel Marques, Marc Shapiro, Mihai Letia, 2009
- Crdts:consistency without Concurrency Control. Mihai Letia, Nuno Preguiça, Marc Shapiro, 2009
- Conflict-free replicated data types. Marc Shapiro, Nuno Preguiça, Carlos Baquero, Marek Zawirski, 2011
- Designing a commutative replicated data type. Marc Shapiro, Nuno Preguiça, 2007
Distributed Programming
- Logic and lattices for distributed programming. Neil Conway, William Marczak, Peter Alvaro, Joseph M. Hellerstein, David Maier, 2012
- Dedalus:datalog in time and Space. Peter Alvaro, William R. Marczak, Neil Conway, Joseph M. Hellerstein, David Maier, Russell Sears, 2011
- Mapreduce:simplified Data processing on Large Clusters. Jeffrey Dean, Sanjay Ghemawat, 2004
- A Note on distributed Computing. Samuel C. Kendall, Jim Waldo, Ann Wollrath, Geoff Wyant, 1994
- An overview of the Scala programming Language. M Odersky, P altherr, V Cremet, B Emir, S man, 2004
- Erlang. Joe Ar Mstrong, 2010
<!--[If!supportlists]-->
SystemsImplemented and theoretical distributed systems.
- A History of the Virtual synchrony Replication Model. Ken Birman, 2010
- CASSANDRA-A decentralized structured Storage System. Avinash Lakshman, Prashant Malik, 2009
- Dynamo:amazon ' s highly Available key-value Store. Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Siva Subramanian, Peter Vosshall and Werner Vogels, 2007
- Stasis:flexible transactional Storage. Russell Sears, Eric Brewer, 2006
- BIGTABLE:A distributed Storage System for structured Data. Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson c. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber, 2006
- The Google File System. Sanjay Ghemawat, Howard Gobioff, and Shun-tak Leung, 2003
- Lessons from Giant-scale Services. Eric A. Brewer, 2001
- Towards robust distributed Systems. Eric A. Brewer, 2000
- Cluster-based Scalable Network Services. Armando Fox, Steven D. Gribble, Yatin chawathe, Eric A. Brewer, Paul Gauthier, 1997
- The Process Group approach to Reliable distributed Computing. Ken Birman, 1993
- Bitcoin:a peer-to-peer Electronic Cash System.
- The Hadoop distributed File System. Konstantin Shvachko, Hairong Kuang, Sanjay Radia, Robert Chansler, 2010
- Hive–a petabyte scale Data Warehouse Using Hadoop. A Thusoo, JS Sarma, N Jain, Z Shao, 2010
- Scalable Web Architecture and distributed Systems. Kate Matsudaira,
- KAFKA:A distributed Messaging System for LOG processing. J Kreps, N Narkhede, 2011
- Storm:distributed and Fault-tolerant real-time computation. Nathan Marz, 2012
- Spark:cluster Computing withworking sets. M Zaharia, M Chowdhury, MJ Franklin ..., 2010
- Flat Datacenter Storage. EB Nightingale, J Elson, J Fan, OS Hofmann, J Howell ..., 2012
- Ananta:cloud Scale Load balancing. P Patel, D Bansal, L Yuan, A Murthy ..., 2013
- F1-the Fault-tolerant distributed RDBMS supporting Google ' s Ad business. Jeff Shute, Stephan Ellner ..., 2012
- BigTable, Dynamo & Cassandra–a Review. A Kala Karun, S Surendran, 2012
- Windows Azure storage:a highly Available Cloud Storage Service with strong consistency. B Calder, J Wang, A ogus, N nilakantan ..., 2011
<!--[If!supportlists]-->
Reliability
- The dangers of Replication and a solution. J Gray, P Helland, p O ' Neil, D shasha-acm sigmod Record, 1996
<!--[If!supportlists]-->
Industry Implementation
- Hadoop Architecture and its Usage at Facebook. Dhruba Borthakur, 2009
- WEB SEARCH for A planet:the GOOGLE CLUSTER ARCHITECTURE. LA Barroso, J Dean, U Holzle-micro, Ieee, 2003
- HDFS scalability:the limits to growth. Konstantin v. Shvachko, 2010
- Autopilot:automatic Data Center Management. Michael Isard, 2007
- Storage Infrastructure behind Facebook messages:using HBase at scale. As Aiyer, M Bautin, GJ Chen, P Damania, 2012
- Scaling Memcache at Facebook. R Nishtala, H fugal, S Grimm, M Kwiatkowski, 2013
- Finding a needle in Haystack:facebook ' s photo storage. D Beaver, S Kumar, HC Li, J Sobel, P Vajge, 2010
- Apache Hadoop Goes Realtime at Facebook. D Borthakur, J Gray, JS Sarma ..., 2011
- Data Warehousing and Analytics Infrastructure at Facebook. A Thusoo, Z Shao, S Anthony, D Borthakur ..., 2010
- Large scale Computing @ Linkedin. Bhupesh Bansal, 2009
- An analysis of Facebook Photo Caching. Q Huang, K Birman, R van Renesse, W Lloyd ..., 2013
- The "Big Data" Ecosystem at LinkedIn. R sumbaly, J kreps, S Shah, 2013
- Data Infrastructure at LinkedIn. A Auradkar, C Botev, S Das ..., 2012
<!--[If!supportlists]-->
Language
- Deep C (and C + +). Olve Maudal and Jon Jagger, 2011
<!--[If!supportlists]-->
Others
- Columnstores vs. Rowstores:how Different is they really? DJ Abadi, SR Madden, N Hachem, 2008
- Hadoop and its evolving ecosystem. J. Yates Monteith, John D. McGregor, and John E. Ingram
- Orleans:cloud Computing for Everyone. S Bykov, A Geller, G Kliot, JR Larus, R Pandya, 2011
- Twitter Data Analytics. Shamanth Kumar, Fred morstatter, Huan Liu, 2013
- MapReduce is good enough? If all has a Hammer, Throw away everything that's not a nail! Jimmy Lin, 2012
<!--[If!supportlists]-->
Data Mining
- Data Mining with Big data. x Wu, x Zhu, GQ Wu, W Ding, 2014
- Samoa:a Platform for Mining Big Data Streams. G De Francisci Morales, 2013
- Mining Big data:current Status, and Forecast to the future. W Fan, A Bifet, 2013
- Scaling Big Data Mining infrastructure:the Twitter Experience. J Lin, D Ryaboy, 2013
<!--[If!supportlists]-->
Books
- Cloud Design Pattern.
- Data Access for highly scalable solutions.
- Computer Architecture-a Quantitative approach.
- Distributed Systems-concepts and Design. Fifth Edition. George Coulouris
- Beautiful Architecture. Diomidis Spinellis, Georgios Gousios etc
- Mining social media:tracking Content and predicting Behavior. Manos Tsagkias
- Seven Databases in Seven Weeks. Eric Redmond and Jim R. Wilson
<!--[If!supportlists]-->
CODINGTMD ' s Reading List