Cloudflare 全网出现重大故障
Cloudflare 全网出现重大故障,所有服务均为Degraded Performance经过快速搜索,发现原来是号称拥有premium transit 的US 服务商level 3 出现了故障, 凡是使用了level 3 transit 的服务商,全部出现了重大的服务故障.
从downdetector上可以看到, centurylink,也就是level3, 排在第一位。。。
估计这次故障,level 3 得赔不少钱:lol
消息灵通啊 昨晚这个点CJ登录不了后台,不知道和这个有没有关系:lol Summary of Incident:
———————————————
Yesterday, August 30, 2020 at 6:04am EST, the Hivelocity NOC received alerts regarding a network connectivity issue tied to our Century Link/Level3 (ASN 3356) transit.At this time, our Network Engineering Team noticed BGP sessions with ASN 3356 were flapping and immediately shut them down across our network.However, after we disabled these BGP sessions, effectively de-peering CenturyLink/Level3, they continued announcing our IP space which caused our traffic to be blacked-holed within their backbone network.In laymen’s terms, if your provider was using this backbone, it could not reach our network.We quickly found that this issue was a major outage affecting the entire AS3356 backbone. This included several common Internet Service Providers who lean heavily on this backbone, such as Spectrum and other local ISPs.Due to the nature and scope of this issue, including the fact CenturyLink/Level3 was improperly announcing our routes, the typical solutions we would implement were ineffective.While working with our other transit providers to work around the problem, at roughly 11:00am the Hivelocity network as well as networks across the globe began to normalize as CenturyLink/Level3 began resolving the issue internally
Below is an excerpt from the official CenturyLink RFO we received this morning:
Cause
An offending flowspec announcement prevented Border Gateway Protocol (BGP) from establishing correctly, impacting client services.
Resolution
The IP NOC deployed a configuration change to block the offending flowspec announcement, thus restoring services to a stable state.
Summary
On August 30, 2020 10:04 GMT, CenturyLink identified an issue to be affecting users across multiple markets. The IP Network Operations Center (NOC) was engaged, and initial research identified that an offending flowspec announcement prevented Border Gateway Protocol (BGP) from establishing across multiple elements throughout the CenturyLink Network. The IP NOC deployed a global configuration change to block the offending flowspec announcement, which allowed BGP to begin to correctly establish. As the change propagated through the network, the IP NOC observed all associated service affecting alarms clearing and services returning to a stable state.
Service Impact Times:
———————————————
August 30th, 6:04am EST - 11:30am EST
Additional Information:
———————————————
You can find further information regarding the global impact of yesterday's event at the following links.
https://www.theverge.com/2020/8/30/21407429/cloudflare-down-websites-hulu-feedly-discord
https://9to5mac.com/2020/08/30/centurylink-outage/
https://www.bleepingcomputer.com/news/technology/centurylink-routing-issue-led-to-outages-on-hulu-steam-discord-more/
:lol
页:
[1]