North American Network Operators Group

Date Prev | Date Next | Date Index | Thread Index | Author Index | Historical

RE: more on SF outage

  • From: Peter Kranz
  • Date: Wed Jul 25 03:17:25 2007
  • Dkim-signature: v=0.5; a=rsa-sha256; c=simple/simple;;; t=1185336414; bh=4qfe/R3abdeMKyCWlQ6R13bhirucBf4b TJAYh2DeeuY=; h=Reply-To:From:To:References:In-Reply-To:Subject: Date:Organization:Message-ID:MIME-Version:Content-Type: Content-Transfer-Encoding:X-Mailer:Thread-Index:Content-Language: X-Virus-Scanned:X-Virus-Status; b=BP2qUXUXAETmdHyBt43b/WTzPnidZuOY w+Fugre2jp8b/KfbBWgzFNygWQbtx9Vqz54mhlRP6S0W3zsD1S5R8A==

Once the final analysis of this event is provided, it is likely going to be
due to a failure of one of the redundant systems to handle the event as
designed due to a software or other low level failure. It's a very complex
system designed to exceed anything in the region as far as redundancy goes,
but as a result it's got a lot of moving parts, and like the space shuttle,
can fail unexpectedly. You can bet engineering is scratching their head and
calling in the vendors to figure out what went wrong. Last time this
occurred it took weeks to pinpoint the root cause.