Advertisement

We need your help now

Support from readers like you keeps The Journal open.

You are visiting us because we have something you value. Independent, unbiased news that tells the truth. Advertising revenue goes some way to support our mission, but this year it has not been enough.

If you've seen value in our reporting, please contribute what you can, so we can continue to produce accurate and meaningful journalism. For everyone who needs it.

Facebook, Instagram and WhatsApp outage caused by error during maintenance work, company says

The social networks and messaging service went offline for more than five hours on Monday.

THE FACEBOOK OUTAGE which took the social network, as well as Instagram and WhatsApp, offline for more than five hours was caused by an error during a routine maintenance job, the company has said.

Billions of the platforms’ users had been left unable to get online on Monday by the fault, which the company said was “an outage caused not by malicious activity, but an error of our own making”.

Santosh Janardhan, Facebook’s vice president of infrastructure, said that during what was “routine maintenance work” on the firm’s backbone network “a command was issued with the intention to assess the availability of global backbone capacity, which unintentionally took down all the connections in our backbone network, effectively disconnecting Facebook data centres globally”.

Writing in a blog post he said: “Our systems are designed to audit commands like these to prevent mistakes like this, but a bug in that audit tool prevented it from properly stopping the command.

“This change caused a complete disconnection of our server connections between our data centres and the internet. And that total loss of connection caused a second issue that made things worse.”

Janardhan said it also took time to fix because of the way Facebook’s servers are designed, in order to offer better physical security.

“They’re hard to get into, and once you’re inside, the hardware and routers are designed to be difficult to modify even when you have physical access to them,” he said.

He confirmed that Facebook then had to bring the servers back online slowly, to avoid any further issues.

“We knew that flipping our services back on all at once could potentially cause a new round of crashes due to a surge in traffic,” he said.

“Every failure like this is an opportunity to learn and get better, and there’s plenty for us to learn from this one.

“After every issue, small and large, we do an extensive review process to understand how we can make our systems more resilient. That process is already under way.”

As well as sparking debate about the public use of social media, the outage also saw EU competition commissioner Margrethe Vestager repeat calls for greater competition in the tech sector – saying the incident highlighted the negative impact of big tech firms controlling large swathes of the online world.

“We need alternatives and choices in the tech market, and must not rely on a few big players, whoever they are,” she wrote on Twitter.

Readers like you are keeping these stories free for everyone...
A mix of advertising and supporting contributions helps keep paywalls away from valuable information like this article. Over 5,000 readers like you have already stepped up and support us with a monthly payment or a once-off donation.

View 14 comments
Close
14 Comments
This is YOUR comments community. Stay civil, stay constructive, stay on topic. Please familiarise yourself with our comments policy here before taking part.
Leave a Comment
    Submit a report
    Please help us understand how this comment violates our community guidelines.
    Thank you for the feedback
    Your feedback has been sent to our team for review.

    Leave a commentcancel

     
    JournalTv
    News in 60 seconds