[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Cloudflare is down

On (2013-03-04 13:23 -0500), Jeff Wheeler wrote:
> We have lots of stupid people in our industry because so few
> understand "The Way Things Work."

We have tendency to view mistakes we do as unavoidable human errors and
mistakes other people do as avoidable stupidity.

We should actively plan for mistakes/errors, if you actively plan for no
'stupid mistakes', you're gonna have bad time

>From my point of view, outages are caused by:
1) operator
2) software defect
3) hardware defect

Most people design only against 3), often with design which actually
increases likelihood of 2) and 1), reducing overall MTBF on design which
strictly theoretically increases it.