Free Republic
Browse · Search
News/Activism
Topics · Post Article

Skip to comments.

Facebook outage caused by a single mistake; has huge implications
9to5Mac ^ | 5 October 2021 | Ben Lovejoy

Posted on 10/05/2021 5:52:37 AM PDT by ShadowAce

Yesterday’s Facebook outage – which took down Facebook Messenger, Instagram, and WhatsApp as well as the main service – resulted from a mistake by the company’s own network engineers.

The mistake led to all of Facebook’s services being inaccessible, with one analogy likening it to a failure in the “air traffic control” services for network traffic …

We reported yesterday on the massive failure.

It’s not just you: Facebook, Instagram, and WhatsApp are all currently down for users around the world. We’re seeing error messages on all three services across iOS applications as well as on the web. Users are being greeted with error messages such as: “Sorry, something went wrong,” “5xx Server Error,” and more.

The outage is affecting every Facebook-owned platform, according to data on Downdetector and Twitter. This includes Instagram, Facebook, WhatsApp, and Facebook Messenger […] While some Facebook, Instagram, and WhatsApp outages only affect certain geographic regions, the services are down worldwide today. 

It gradually appeared that the problem might relate to DNS – the domain name servers that tell devices which IP addresses to use to access services – but it was unclear what exactly had happened, and whether this was an external hack, malicious action by an insider, or a catastrophic mistake.

Facebook has now admitted in a blog post that it was a mistake.

Our engineering teams have learned that configuration changes on the backbone routers that coordinate network traffic between our data centers caused issues that interrupted this communication. This disruption to network traffic had a cascading effect on the way our data centers communicate, bringing our services to a halt.

It took a long time to resolve the problem because the inaccessible systems included the servers and tools engineers would normally use to solve the problem remotely. Reports suggest that lower-level employees had to gain physical access to the data centers, and then rely on step-by-step instructions from more senior engineers in order to undo the mistake. Complicating this, the networks being unavailable meant that Facebook’s door access systems were also offline, physically preventing access.

How to understand the Facebook outage

We’ll doubtless get the full story in time, but the consensus view emerging is that the problem was some mix of domain name server (DNS) and border gateway protocol (BGP) configuration.

The best analogy I’ve seen is to think of network traffic as being like planes. Your device wants to fly to facebook.com. Your plane first needs to know the GPS coordinates of the destination airport, that is, the IP address it should connect to. It gets that information by asking a DNS, which tells it that facebook.com is located at (for example) 66.220.144.0.

But getting to the final destination – the actual server that can perform the task you want to do – relies on a kind of air traffic control system for network traffic, and that’s the BGP. The BGP tells your device which route to fly through the various servers en route to your final destination.

It appears that Facebook completely lost its BGP systems – so there was no way for Facebook to tell devices how to reach their destination. And that included Facebook’s own engineers reaching the systems they needed to undo the mistake.

From trusted source: Person on FB recovery effort said the outage was from a routine BGP update gone wrong. But the update blocked remote users from reverting changes, and people with physical access didn't have network/logical access. So blocked at both ends from reversing it.

— briankrebs (@briankrebs) October 4, 2021

The outage has huge implications

If this were just people being unable to post cat videos for a few hours, that would be one thing (though, come on, what is life without cat videos?). But WhatsApp is effectively a critical piece of communications infrastructure in many countries, routinely used for communication between patients and doctors, for example, and used by many for payments.

The extended outage has drawn attention to how vulnerable the entire world is to failures of this nature.

For example, millions of people rely on Google DNS servers to reach every server on the planet. Imagine those servers going down for an extended period. That wouldn’t just affect consumers, it would disrupt commerce and critical infrastructure. Factory production, fleet transport, retail… the works.

The whole world is critically dependent on a relatively small number of servers, all of which could be taken offline by a mistake of the kind that happened here. A lot of thought needs to be put into how we prevent a far more significant internet outage in the future.


TOPICS: Business/Economy; News/Current Events
KEYWORDS: facebook; facebookoutage; internet; socialmedia
Navigation: use the links below to view more comments.
first 1-2021-4041-6061-74 next last

1 posted on 10/05/2021 5:52:37 AM PDT by ShadowAce
[ Post Reply | Private Reply | View Replies]

To: rdb3; JosephW; martin_fierro; Still Thinking; zeugma; Vinnie; ironman; Egon; raybbr; AFreeBird; ...

2 posted on 10/05/2021 5:52:50 AM PDT by ShadowAce (Linux - The Ultimate Windows Service Pack )
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

So it WASN’T putin’s henchmen hackers. Dang!


3 posted on 10/05/2021 5:54:26 AM PDT by rktman (Destroy America from within? Check! WTH? Enlisted USN 1967 to end up with this? 😕)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

The world was a better place for 6 hours.


4 posted on 10/05/2021 5:54:29 AM PDT by Lurkinanloomin ( (Natural born citizens are born here of citizen parents)(Know Islam, No Peace-No Islam, Know Peace)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce
Imagine those servers going down for an extended period

Imagine them being destroyed and not replaced, ever, in order to save human-ness in a world gone mad.

What if it turns out that the prophet for our times wasn't George Orwell, but Frank Herbert?

5 posted on 10/05/2021 5:58:19 AM PDT by Jim Noble (The nation cannot be saved until the GOP is destroyed)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

‘Mistake.’

Yep. Like Biden in the WH is just a, ‘mistake.’

I hope it was Friendly Fire. I HOPE it costs Zuckernerd BILLIONS. I hope it happens again. :)


6 posted on 10/05/2021 5:58:38 AM PDT by Diana in Wisconsin (I don't have 'Hobbies.' I'm developing a robust post-Apocalyptic skill set. )
[ Post Reply | Private Reply | To 1 | View Replies]

To: rktman

“So it WASN’T putin’s henchmen hackers. Dang!”

Of course it was...you’ll see. LOL.


7 posted on 10/05/2021 5:58:41 AM PDT by BobL (I shop at Walmart and eat at McDonald's, I just don't tell anyone, like most here.)
[ Post Reply | Private Reply | To 3 | View Replies]

To: ShadowAce

Too bad they did not have their Two-Factor Authentication turned on 🤣🤣🤣🤣


8 posted on 10/05/2021 6:00:04 AM PDT by Lockbox (politicians, they all seemed like game show hosts to me.... Sting)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

I am hoping facebook becomes a thing of the past. Most people will grab hold of something new but then grow tired of it. They did it with Myspace and AOL and those seem to have faded away. Hopefully people are becoming bored with facebook.


9 posted on 10/05/2021 6:00:08 AM PDT by boycott
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

YAWN, who cares. I find it amusing that the whole damn world came to a standstill because no one could access FB. My God these people need to get a hobby or better yet, a job


10 posted on 10/05/2021 6:00:28 AM PDT by rockabyebaby (THE BEST IS YET TO COME - (PRESIDENT DONALD J. TRUMP)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

Were it not for reading about this here on FR I would never have known it even happened.


11 posted on 10/05/2021 6:01:41 AM PDT by billyboy15
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce
For example, millions of people rely on Google DNS servers to reach every server on the planet. Imagine those servers going down for an extended period.

If Xi Jinping's cyber warriors were not already aware of these vulnerabilities, they are now. How many of his agents are already employed in the IT departments of strategically critical industries? And I hear they work cheaper than Americans...

12 posted on 10/05/2021 6:01:54 AM PDT by Mr. Jeeves ([CTRL]-[GALT]-[DELETE])
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce
Every add on to a house is a potential roof leak.

FaceBook has been (we believe) complicit in collusion with the commie government to spy and phish us all.

IF this explanation is true ... they DID have to go back to the beginning, unwinding a huge ball of yarn, to ... not find the glitch, but patch up all the glitch caused.

Too many additions to the house, imo.

13 posted on 10/05/2021 6:03:35 AM PDT by knarf (As for my people, children are their oppressors, and women rule over them. )
[ Post Reply | Private Reply | To 1 | View Replies]

To: rockabyebaby

A life


14 posted on 10/05/2021 6:05:08 AM PDT by 556x45
[ Post Reply | Private Reply | To 10 | View Replies]

To: ShadowAce

The fix causes the same disappointment as when a government shutdown ends.


15 posted on 10/05/2021 6:08:54 AM PDT by ArcadeQuarters (Remember the 2020 backstabbers. No more RINOs ever!)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

“Facebook’s door access systems were also offline, physically preventing access.”

And we are going to let “advanced tech” to shove self driving cars, trucks, airliners, and digital currency on us?

There is a NO GO line to be drawn with these concepts.


16 posted on 10/05/2021 6:09:27 AM PDT by Openurmind (The ultimate test of a moral society is the kind of world it leaves to its children. ~ D. Bonhoeffer)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

Awwww


17 posted on 10/05/2021 6:12:19 AM PDT by SkyDancer (If at first you don't succeed, so much for skydiving)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

For 6 hours I wasn’t able to see what my High School classmates had for breakfast. It was crushing.


18 posted on 10/05/2021 6:13:33 AM PDT by BBQToadRibs2
[ Post Reply | Private Reply | To 1 | View Replies]

To: Openurmind

“self driving cars, trucks”

Miss a payment and you are driving a brick.


19 posted on 10/05/2021 6:14:24 AM PDT by dynachrome ("I will not be reconstructed, and I do not give a damn.")
[ Post Reply | Private Reply | To 16 | View Replies]

To: ShadowAce

Got accustomed to quickly pressing ENTER at “Are you sure?”

I’m sure protection will be baked into this sort of transaction. In the linux world, see “visudo.”


20 posted on 10/05/2021 6:15:35 AM PDT by Cboldt
[ Post Reply | Private Reply | To 1 | View Replies]


Navigation: use the links below to view more comments.
first 1-2021-4041-6061-74 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
News/Activism
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson