References

  1. Michael Stonebraker and Uğur Γ‡etintemel: β€œ'One Size Fits All': An Idea Whose Time Has Come and Gone,” at 21st International Conference on Data Engineering (ICDE), April 2005.

  2. Walter L. Heimerdinger and Charles B. Weinstock: β€œA Conceptual Framework for System Fault Tolerance,” Technical Report CMU/SEI-92-TR-033, Software Engineering Institute, Carnegie Mellon University, October 1992.

  3. Ding Yuan, Yu Luo, Xin Zhuang, et al.: β€œSimple Testing Can Prevent Most Critical Failures: An Analysis of Production Failures in Distributed Data-Intensive Systems,” at 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI), October 2014.

  4. Yury Izrailevsky and Ariel Tseitlin: β€œThe Netflix Simian Army,” netflixtechblog.com, July 19, 2011.

  5. Daniel Ford, FranΓ§ois Labelle, Florentina I. Popovici, et al.: β€œAvailability in Globally Distributed Storage Systems,” at 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI), October 2010.

  6. Brian Beach: β€œHard Drive Reliability Update – Sep 2014,” backblaze.com, September 23, 2014.

  7. Laurie Voss: β€œAWS: The Good, the Bad and the Ugly,” blog.awe.sm, December 18, 2012.

  8. Haryadi S. Gunawi, Mingzhe Hao, Tanakorn Leesatapornwongsa, et al.: β€œWhat Bugs Live in the Cloud?,” at 5th ACM Symposium on Cloud Computing (SoCC), November 2014. doi:10.1145/2670979.2670986

  9. Nelson Minar: β€œLeap Second Crashes Half the Internet,” somebits.com, July 3, 2012.

  10. Amazon Web Services: β€œSummary of the Amazon EC2 and Amazon RDS Service Disruption in the US East Region,” aws.amazon.com, April 29, 2011.

  11. Richard I. Cook: β€œHow Complex Systems Fail,” Cognitive Technologies Laboratory, April 2000.

  12. Jay Kreps: β€œGetting Real About Distributed System Reliability,” blog.empathybox.com, March 19, 2012.

  13. David Oppenheimer, Archana Ganapathi, and David A. Patterson: β€œWhy Do Internet Services Fail, and What Can Be Done About It?,” at 4th USENIX Symposium on Internet Technologies and Systems (USITS), March 2003.

  14. Nathan Marz: β€œPrinciples of Software Engineering, Part 1,” nathanmarz.com, April 2, 2013.

  15. Michael Jurewitz: β€œThe Human Impact of Bugs,” jury.me, March 15, 2013.

  16. Raffi Krikorian: β€œTimelines at Scale,” at QCon San Francisco, November 2012.

  17. Martin Fowler: Patterns of Enterprise Application Architecture. Addison Wesley, 2002. ISBN: 978-0-321-12742-6

  18. Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, et al.: β€œDynamo: Amazon's Highly Available Key-Value Store,” at 21st ACM Symposium on Operating Systems Principles (SOSP), October 2007.

  19. Greg Linden: β€œMake Data Useful,” slides from the presentation at Stanford University Data Mining class (CS345), December 2006.

  20. Tammy Everts: β€œThe Real Cost of Slow Time vs Downtime,” slideshare.net, November 5, 2014.

  21. Jake Brutlag: β€œSpeed Matters,” ai.googleblog.com, June 23, 2009.

  22. Tyler Treat: β€œEverything You Know About Latency Is Wrong,” bravenewgeek.com, December 12, 2015.

  23. Jeffrey Dean and Luiz AndrΓ© Barroso: β€œThe Tail at Scale,” Communications of the ACM, volume 56, number 2, pages 74–80, February 2013. doi:10.1145/2408776.2408794

  24. Graham Cormode, Vladislav Shkapenyuk, Divesh Srivastava, and Bojian Xu: β€œForward Decay: A Practical Time Decay Model for Streaming Systems,” at 25th IEEE International Conference on Data Engineering (ICDE), March 2009.

  25. Ted Dunning and Otmar Ertl: β€œComputing Extremely Accurate Quantiles Using t-Digests,” github.com, March 2014.

  26. Gil Tene: β€œHdrHistogram,” hdrhistogram.org.

  27. Baron Schwartz: β€œWhy Percentiles Don’t Work the Way You Think,” solarwinds.com, November 18, 2016.

  28. James Hamilton: β€œOn Designing and Deploying Internet-Scale Services,” at 21st Large Installation System Administration Conference (LISA), November 2007.

  29. Brian Foote and Joseph Yoder: β€œBig Ball of Mud,” at 4th Conference on Pattern Languages of Programs (PLoP), September 1997.

  30. Frederick P Brooks: β€œNo Silver Bullet – Essence and Accident in Software Engineering,” in The Mythical Man-Month, Anniversary Edition, Addison-Wesley, 1995. ISBN: 978-0-201-83595-3

  31. Ben Moseley and Peter Marks: β€œOut of the Tar Pit,” at BCS Software Practice Advancement (SPA), 2006.

  32. Rich Hickey: β€œSimple Made Easy,” at Strange Loop, September 2011.

  33. Hongyu Pei Breivold, Ivica Crnkovic, and Peter J. Eriksson: β€œAnalyzing Software Evolvability,” at 32nd Annual IEEE International Computer Software and Applications Conference (COMPSAC), July 2008. doi:10.1109/COMPSAC.2008.50

Last updated