Availability and Recovery: Discussion
Today was a rare "purse discussion". We discussed strategies for maintaining availability in light of failure. We discuessed the role of specialized hardware (load-balancing switches) directory services (DNS), repliction, consistency models (allowing some staleness, when possible), coordination (elected and appointed coordinators, truly distributed decisions), understanding natural partitionings of data and services, and humans in maintaining system availability.
If you missed it, check out the video.