09 Aug 2008 0200-0300 (-0500) Confluence outage
starting in 1 day
news
about 1 year ago
Disk failure on codehaus03

We lost a disk on codehaus03 (this is the machine that does everything but Confluence and JIRA)

Of all the machines that can lose a disk – this isn’t the one. It’s running under massive load 24×7 and has no spare capacity.

Contegix discovered it, replaced the drive and rebuilt the array while Bob was off drinking beer at JavaOne. Praise be to Contegix.

The side effect was that the machine went into even higher load level, slowed down further and we ended up with 6000 mail messages in the queue. At that point, some spammers thought sending us a few 1000 emails would be helpful; it wasn’t.

This is why your emails were getting delayed

This is now 90% cleared, and things are returning to normal.

Any problems; contact support at codehaus.org.

Thank you for your patience. We’ve lived through how this could have gone (1 year ago…) – this is a much better scenario.

Powered by OpenXource Xircles™ (Version: 0.1-6447)