NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Tell HN: Azure outage
mystcb 2 hours ago [-]
Update 16:57 UTC:

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing Azure Front Door issues resulting in a loss of availability of some services. In addition. customers may experience issues accessing the Azure Portal. Customers can attempt to use programmatic methods (PowerShell, CLI, etc.) to access/utilize resources if they are unable to access the portal directly. We have failed the portal away from Azure Front Door (AFD) to attempt to mitigate the portal access issues and are continuing to assess the situation.

We are actively assessing failover options of internal services from our AFD infrastructure. Our investigation into the contributing factors and additional recovery workstreams continues. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:57 UTC on 29 October 2025

---

Update: 16:35 UTC:

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025

---

Azure Portal Access Issues

We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly.

This message was last updated at 16:18 UTC on 29 October 2025

---

Message from the Azure Status Page: https://azure.status.microsoft/en-gb/status

planewave 1 hours ago [-]
Azure Network Availability Issues

Starting at approximately 16:00 UTC, we began experiencing Azure Front Door issues resulting in a loss of availability of some services. We suspect that an inadvertent configuration change as the trigger event for this issue. We are taking two concurrent actions where we are blocking all changes to the AFD services and at the same time rolling back to our last known good state.

We have failed the portal away from Azure Front Door (AFD) to mitigate the portal access issues. Customers should be able to access the Azure management portal directly.

We do not have an ETA for when the rollback will be completed, but we will update this communication within 30 minutes or when we have an update.

This message was last updated at 17:17 UTC on 29 October 2025

croemer 1 minutes ago [-]
"We have initiated the deployment of our 'last known good' configuration. This is expected to be fully deployed in about 30 minutes from which point customers will start to see initial signs of recovery. Once this is completed, the next stage is to start to recover nodes while we route traffic through these healthy nodes."

"This message was last updated at 18:11 UTC on 29 October 2025"

cyptus 1 hours ago [-]
AFD is down quite often regionally in Europe for our services. In 50%+ the cases they just don‘t report it anywhere, even if its for 2h+.
RajT88 1 hours ago [-]
Spam those Azure tickets. If you have a CSAM, build them a nice powerpoint telling the story of all your AFD issues (that's what they are there for).

> In 50%+ the cases they just don‘t report it anywhere, even if its for 2h+.

I assume you mean publicly. Are you getting the service health alerts?

tomashubelbauer 31 minutes ago [-]
CSAM apparently also means Customer Success Account Manager for those who might have gotten startled by this message like me.
linohh 27 minutes ago [-]
Thank you, not going to google that shit.
psunavy03 35 minutes ago [-]
Some really unfortunate acronyms flying around the Microsoft ecosystem . . .
RajT88 33 minutes ago [-]
Quite so. The acronym collision rate is high.
cyptus 44 minutes ago [-]
in many cases: no service health alerts, no status page updates and no confirmations from the support team in tickets. still we can confirm these issues from different customers accross europe. Mostly the issues are regional dependent.
cyberax 27 minutes ago [-]
> CSAM

Child Sex-Abuse Material?!? Well, a nice case of acronym collision.

SAI_Peregrinus 13 minutes ago [-]
They must really depend on their government contracts with this administration…
llama052 1 hours ago [-]
I got a service health alert an hour after it started, saying the portal was having issues. Pretty useless and misleading.
RajT88 56 minutes ago [-]
That should go into the presentation you provide your CSAM with as well.

Storytelling is how issues get addressed. Help the CSAM tell the story to the higher ups.

hallh 57 minutes ago [-]
Same experience. We've recently migrated fully away from AFD due to how unreliable it is.
8cvor6j844qw_d6 2 hours ago [-]
I'll be interested in the incident writeup since DNS is mentioned. It will be interesting in a way if it is similar to what happened at AWS.
Insanity 1 hours ago [-]
It's pretty unlikely. AWS published a public 'RCA' https://aws.amazon.com/message/101925/. A race condition in a DNS 'record allocator' causing all DNS records for DDB to be wiped out.

I'm simplifying a bit, but I don't think it's likely that Azure has a similar race condition wiping out DNS records on _one_ system than then propagates to all others. The similarity might just end at "it was DNS".

parliament32 1 hours ago [-]
That RCA was fun. A distributed system with members that don't know about each other, don't bother with leader elections, and basically all stomp all over each other updating the records. It "worked fine" until one of the members had slightly increased latency and everything cascade-failed down from there. I'm sure there was missing (internal) context but it did not sound like a well-architected system at all.
RajT88 1 hours ago [-]
Needs STONITH
kyrra 1 hours ago [-]
cdr420 1 hours ago [-]
It's always DNS
tempest_ 10 minutes ago [-]
It is a coin flip, heads DNS, tails BGP
layer8 43 minutes ago [-]
DNS has both naming and cache invalidation, so no surprise it’s among the hardest things to get right. ;)
jjp 1 hours ago [-]
Whilst the status message acknowledge's the issue with Front Door (AFD), it seems as though the rest of the actions are about how to get Portal/internal services working without relying on AFD. For those of us using Front Door does that mean we're in for a long haul?
llama052 57 minutes ago [-]
Please migrate off of front door. It's been a failure mode since it came out historically. Anything else is better at this point
everfrustrated 40 minutes ago [-]
Didn't the underlying vendor they used for Azure Front Door go bankrupt? It's probably on life support.
jdc0589 2 hours ago [-]
yea its not just the portal. microsoft.com is down too
PeterCorless 1 hours ago [-]
Seems all Microsoft-related domains are impacted in some way.

https://www.xbox.com/en-US also doesn't fully paint. Header comes up, but not the rest of the page.

https://www.minecraft.net/en-us is extremely slow, but eventually came up.

daxfohl 2 hours ago [-]
Downdetector says aws and gcp are down too. Might be in for a fun day.
rozenmd 1 hours ago [-]
From what I can tell, Downdetector just tracks traffic to their pages without actually checking if the site is down.

The other day during the AWS outage they "reported" OVH down too.

linhns 13 minutes ago [-]
Not sure if this is true. I just login to the console with no glitch.
jdc0589 2 hours ago [-]
yea I saw that, but im not sure on how accurate that is. a few large apps/companies I know to be 100% on AWS in us-east-1 are cranking along just fine.
NetMageSCW 1 hours ago [-]
AWS was performance issues and I believe is resolved.
mystcb 2 hours ago [-]
Yeah, I am guessing it's just a placeholder till they get more info. I thought I saw somewhere that internally within Microsoft it's seen as a "Sev 1" with "all hands on deck" - Annoyingly I can't remember where I saw it, so if someone spots it before I do, please credit that person :D

Edit: Typo!

chad_c 2 hours ago [-]
It was here https://news.ycombinator.com/item?id=45749054 but that comment has been deleted.
bossyTeacher 30 minutes ago [-]
It sure must be embarrassing for the website of the second richest company in the world to be down.
planewave 1 hours ago [-]
yes, and it seems that at least for some login.microsoftonline.com is down too, which is part of the Entra login / SSO flow.
NDizzle 11 minutes ago [-]
They briefly had a statement about using Traffic Manager to work with your AFD to work around this issue, with a link to learn.microsoft.com/...traffic-manager, and the link didn't work. Due to the same issue affecting everyone right now.

They quickly updated the message to REMOVE the link. Comical at this point.

eddie_catflap 58 minutes ago [-]
We saw issues before 16:00 UTC - approx 15:38
jonathanlydall 1 hours ago [-]
Yet another reason to move away from Front Door.

We already had to do it for large files served from Blob Storage since they would cap out at 2MB/s when not in cache of the nearest PoP. If you’ve ever experienced slow Windows Store or Xbox downloads it’s probably the same problem.

I had a support ticket open for months about this and in the end the agent said “this is to be expected and we don’t plan on doing anything about it”.

We’ve moved to Cloudflare and not only is the performance great, but it costs less.

Only thing I need to move off Front Door is a static website for our docs served from Blob Storage, this incident will make us do it sooner rather than later.

out_sider 59 minutes ago [-]
we are considering the same but because our website uses APEX domain we would need to move all DNS resolver to cloudfront right ? Does it have as a nice "rule set builder" as azure ?
jonathanlydall 33 minutes ago [-]
Unless you pay for CloudFlare’s Enterpise plan, you’re required to have them host your DNS zone, you can use a different registrar as long as you just point your NS records to Cloudflare.

Be aware that if you’re using Azure as your registrar, it’s (probably still) impossible to change your NS records to point to CloudFlare’s DNS server, at least it was for me about 6 months ago.

This also makes it impossible to transfer your domain to them either, as CloudFlare’s domain transfer flow requires you set your NS records to point to them before their interface shows a transfer option.

In our case we had to transfer to a different registrar, we used Namecheap.

However, transferring a domain from Azure was also a nightmare. Their UI doesn’t have any kind of transfer option, I eventually found an obscure document (not on their Learn website) which had an az command which would let you get a transfer code which I could give to Namecheap.

Then I had to wait over a week for the transfer timeout to occur because there is no way on Azure side that I could find to accept the transfer immediately.

I found CloudFlare’s way of building rules quite easy to use, different from Front Door but I’m not doing anything more complex than some redirects and reverse proxying.

I will say that Cloudflare’s UI is super fast, with Front Door I always found it painfully slow when trying to do any kind of configuration.

Cloudflare also doesn’t have the problem that Front Door has where it requires a manual process every 6 months or so to renew the APEX certificate.

out_sider 19 minutes ago [-]
Thanks :). We don't use Azure as our registrar. It seems I'll have to plan for this then, we also had another issue, AFD has a hard 500ms tls handshake timeout (doesn't matter how much you put on the origin timeout settings) which means if our server was slow for some reason we would get 504 origin timeout.
Figs 16 minutes ago [-]
CloudFlare != CloudFront
out_sider 1 minutes ago [-]
I meant cloudfare
ThatManulTheCat 2 hours ago [-]
DNS. Ofc.
rconti 30 minutes ago [-]
Sounds like they need to move their portal to a region with more capacity for the desired instance type. /s
bob1029 4 minutes ago [-]
For some reason an Azure outage does not faze me in the same way that an AWS outage does.

I have never had much confidence in Azure as a cloud provider. The vertical integration of all the things for a Microsoft shop was initially very compelling. I was ready to fight that battle. But, this fantasy was quickly ruined by poor execution on Microsoft's part. They were able to convince me to move back to AWS by simply making it difficult to provision compute resources. Their quota system & availability issues are a nightmare to deal with compared to EC2.

At this point I'd rather use GCP over Azure and I have zero seconds of experience with it. The number of things Microsoft gets right in 2025 can be counted single-handedly. The things they do get right are quite good, but everything else tends to be extremely awful.

Uehreka 2 hours ago [-]
I noticed that Starbucks mobile ordering was down and thought “welp, I guess I’ll order a bagel and coffee on Grubhub”, then GrubHub was down. My next stop was HN to find the common denominator, and y’all did not disappoint.
pants2 1 hours ago [-]
Good thing HN is hosted on a couple servers in a basement. Much more reliable than cloud, it seems!
dang 28 minutes ago [-]
airstrike 26 minutes ago [-]
I love that "Ask HN: What'd you do while HN was down?" was a thing
lysace 1 hours ago [-]
It was on AWS at least (for a while) in 2022.

https://news.ycombinator.com/item?id=32030400

jjice 24 minutes ago [-]
Yeah looks like they're back on M5.

dang saying it's temporary: https://news.ycombinator.com/item?id=32031136

    $ dig news.ycombinator.com

    ; <<>> DiG 9.10.6 <<>> news.ycombinator.com
    ;; global options: +cmd
    ;; Got answer:
    ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 54819
    ;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 1

    ;; OPT PSEUDOSECTION:
    ; EDNS: version: 0, flags:; udp: 512
    ;; QUESTION SECTION:
    ;news.ycombinator.com.  IN A

    ;; ANSWER SECTION:
    news.ycombinator.com. 1 IN A 209.216.230.207

    ;; Query time: 79 msec
    ;; SERVER: 100.100.100.100#53(100.100.100.100)
    ;; WHEN: Wed Oct 29 13:59:29 EDT 2025
    ;; MSG SIZE  rcvd: 65
And that IP says it's with M5 again.
parliament32 1 hours ago [-]
Always has been.
hypeatei 1 hours ago [-]
Starbucks mobile was down during the AWS outage too...
SoftTalker 1 hours ago [-]
They are multi-cloud --- vulnerable to all outages!
mring33621 1 hours ago [-]
you wouldn't believe some of the crap enterprise bigco mgmt put in place for disaster recovery.

they think that they are 'eliminating a single point of failure', but in reality, they end up adding multiple, complicated points of mostly failure.

andoma 1 hours ago [-]
Go multi-cloud they said...
Hamuko 1 hours ago [-]
Gonna build my application to be multicloud so that it requires multiple cloud platforms to be online at the same time. The RAID 0 of cloud computing.
1 hours ago [-]
sergiotapia 1 hours ago [-]
Wow I just left a Starbucks drivethru line because it was just not moving. I guess it was because of this.
hedayet 14 minutes ago [-]
The sad thing is - $MSFT isn't even down by 1%. And IIRC, $AMZN actually went up during their previous outage.

So if we look at these companies' bottom lines, all those big wigs are actually doing something right. Sales and lobbying capacity is way more effective than reliability or good engineering (at least in the short term).

navane 6 minutes ago [-]
Look how important we are, is what these failures show
agency 59 minutes ago [-]
So that's why I can't check in for my Alaska Airlines flight... https://news.microsoft.com/source/features/digital-transform...
kurttheviking 29 minutes ago [-]
I am unable to load this article...presumably for related reasons
Jamie452 2 hours ago [-]
Currently standing in a half closed supermarket because the tills are down and they cant take payments
chasd00 1 hours ago [-]
There's a Family Dollar by my house that is down at least 2 full days per month because of bad inet connectivity. I live close enough that with a small tower on my roof i can get line of sight to theirs. I've thought about offering them a backup link off my home inet if they give me 50% of sales whenever its in use. It would be a pretty good deal for them, better some sales when their inet is down vs none.
jrodom 1 hours ago [-]
50% of sales? what do you think the gross margin is on average for each item sold?
consp 55 minutes ago [-]
2-3%, bit higher on perishables. Though i'd just ask lump sum payments in cash since it likely has to no go through corporate (as in, avoid the corporation).
ryandrake 1 hours ago [-]
You'd think any SeriousBusiness would have a backup way to take customers' money. This is the one thing you always want to be able to do: accept payment. If they made it so they can't do that, they deserve the hit to their revenue. People should just walk out of the store with the goods if they're not being charged.

Why doesn't someone in the store at least have one of those manual kachunk-kachunk carbon copy card readers in the back that they can resuscitate for a few days until the technology is turned back on? Did they throw them all away?

voidmain0001 36 minutes ago [-]
If they used standalone merchant terminals, then those typically use the local LAN which can rollover to cellular or PoT in the event of a network outage. The store can process a card transaction with the merchant terminal and then reconcile with the end of day chit. This article from 2008 describes their PoS https://www.retailtouchpoints.com/topics/store-operations/ca...
BenjiWiebe 44 minutes ago [-]
I think a lot of payment terminals have an option to record transactions offline and upload them later, but apparently it's not enabled by default - probably because it increases your risk that someone pays with a bad card.
ElevenLathe 1 hours ago [-]
The kachunk-kachunk credit card machines need raised digits on the cards, and I don't think most banks have been issuing those for years at this point. Mine have been smooth for at least 10 years.
58 minutes ago [-]
BenjiWiebe 43 minutes ago [-]
Pretty sure it'd be a lot better deal for them to have no sales than to pay out 50% of sales on stuff with single digit margins.
david422 1 hours ago [-]
IIRC, the grocery chain I worked for used to have an offline mode to move customers out the door. But it meant that when the system came back online, if the customers card was denied, the customer got free groceries.
cyberax 20 minutes ago [-]
I remember that banks will try to honor the transactions, even if the customer's balance/credit limit is exhausted. It doesn't apply only to some gift cards.
SoftTalker 2 hours ago [-]
Mind-boggling that any retailer would not have the capability to at least run the checkout stations offline.
withinboredom 1 hours ago [-]
I knew an old guy in the '00s who specialized in cobal/fortran for working on tiller software. Guess he retired and they couldn't maintain it
computerdork 1 hours ago [-]
Anyone remember Bob's number?? Bob?! Oh the humanity! We're all gonna be canned!
gianpaj 1 hours ago [-]
Can't download VSCode :D

Error: visual-studio-code: Download failed on Cask 'visual-studio-code' with message: Download failed: https://update.code.visualstudio.com/1.105.1/darwin-arm64/st...

vachina 1 hours ago [-]
microsoft.com and some subdomains (answers.microsoft.com) has no A and AAA records. They screwed up big time.

https://archive.is/Q4izZ

0xbadcafebee 1 hours ago [-]
That specific subdomain has issues with propagation: https://dnschecker.org/#A/answers.microsoft.com (only four resolvers return records)

The root zone and www. do not: https://dnschecker.org/#A/microsoft.com (all resolvers return records)

And querying https://www.microsoft.com/ results in HTTP 200 on the root document, but the page elements return errors (a 504 on the .css/.js documents, a 404 on some fonts, Name Not Resolved on scripts.clarity.ms, Connection Timed Out on wcpstatic.microsoft.com and mem.gfx.ms). That many different kinds of errors is actually kind of impressive.

I'm gonna say this was a networking/routing issue. The CDN stayed up, but everything else non-CDN became unroutable, and different requests traveled through different paths/services, but each eventually hit the bad network path, and that's what created all the different responses. Could also have been a bad deploy or a service stopped running and there's different things trying to access that service in different ways, leading to the weird responses... but that wouldn't explain the failed DNS propagation.

Aperocky 42 minutes ago [-]
wow, right after AWS suffered a similar thing.

I wonder if this is microsoft "learning" to "prevent" such an issue and instead triggered it...

"One often meets his destiny on the path he takes to avoid it" -- Master Oogway

0000000000100 48 minutes ago [-]
Yeah just took down the prod site for one of our clients since we host the front-end out of their CDN. Just got wrapped up panic hosting it somewhere else for the past hour, very quickly reminds you about the pain of cookies...
alt227 32 minutes ago [-]
... and DNS caching, and browser file cache, and sessions...

Moving a website quickly is never fun.

irusensei 8 minutes ago [-]
I was working when I saw the portal page showing only resource groups and lots of items missing. I thought it was a weird browser cache issue.

The actual stuff I was working on (App Insights, Function App) that was still open was operational.

move-on-by 18 minutes ago [-]
Instead of cyber security awareness month, we should rename it to cloud availability awareness month.
kierenj 2 hours ago [-]
Ouch, and login.microsoftonline.com too - i.e. SSO using MS accounts. We'd just rolled that out across most (all?) of our internal systems...

And microsoft.com too - that's gotta hurt

parliament32 2 hours ago [-]
SSO and 365 are working fine for us, but admin portals for Azure/365 are down. Our workloads in Azure don't seem to be impacted.
planewave 1 hours ago [-]
It is interesting to see the differential across different tenants in different geographies:

- on a US tenant I am unable to access login.microsoftonline.com and the login flow stalls on any SSO authentication attempt.

- on a European tenant, probably germany-west, I am able to login and access the Azure portal.

juancroldan 1 hours ago [-]
Guess you have NASSO now (Not A Single Sign On)
ocdtrekkie 2 hours ago [-]
I am still stunned people choose to do this, considering major Office 365 outages are basically a weekly thing now.
NetMageSCW 1 hours ago [-]
We are very dependent on Azure and Microsoft Authentication and Microsoft 365 and haven’t had weekly or even monthly issues. I can think of maybe three issues this year.
gmassman 59 minutes ago [-]
I’ve been migrating our services off of Azure slowly for the past couple of years. The last internet facing things remaining are a static assets bucket and an analytics VM running Matomo. Working with Front Door has been an abysmal experience, and today was the push I needed to finally migrate our assets to Cloudflare.

I feel pretty justified in my previous decisions to move away from Azure. Using it feels like building on quicksand…

alt227 29 minutes ago [-]
All the clouds hav had major outages this year.

At this point I dont believe that any one of them is any better or reliable than the others.

chemodax 2 hours ago [-]
For me the same. It's very confusing that status page [1] is green

[1]: https://azure.status.microsoft/en-us/status

martini333 2 hours ago [-]
That status page is never red. Absolutely useless.

> There are currently no active events. Use Azure Service Health to view other issues that may be impacting your services.

Links to a page on Azure Portal which is down...

endianswap 2 hours ago [-]
It's red right now.
Sharparam 1 hours ago [-]
Only for the Azure Portal, despite Front Door also being down but showing as green on the status page.
12_throw_away 13 minutes ago [-]
Heh, now it says Front Door and "Network Infrastructure" are down. That second one seems bad.
kylecazar 2 hours ago [-]
They added a message at the same time as your comment:

"We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly."

port11 25 minutes ago [-]
So much of Belgium runs on Azure… it's honestly baffling how many services are down, there's no resilience built into (even large) companies anymore.
Steven_Vellon 2 hours ago [-]
For us, it looks like most services are still working (eastus and eastus2). Our AKS cluster is still running and taking requests. Failures seem limited to management portal.
kierenj 2 hours ago [-]
Sorry - my bad. I literally just connected an old XP VM to the internet to activate it.
MangoCoffee 1 hours ago [-]
The Internet is supposed to be decentralized. The big three seem to have all the power now (Amazon, Microsoft, and Google) plus Cloudflare/Oracle.

How did we get here? Is it because of scale? Going to market in minutes by using someone else's computers instead of building out your own, like co-location or dedicated servers, like back in the day.

deaux 4 minutes ago [-]
From today [0].

> Big Tech lobbying is riding the EU’s deregulation wave by spending more, hiring more, and pushing more, according to a new report by NGO’s Corporate Europe Observatory and LobbyControl on Wednesday (29 October).

> Based on data from the EU’s transparency register, the NGOs found that tech companies spend the most on lobbying of any sector, spending €151m a year on lobbying — a 33 percent increase from €113m in 2023.

Gee whizz, I really do wonder how they end up having all the power!

[0] https://news.ycombinator.com/item?id=45744973

kube-system 1 hours ago [-]
It still is very decentralized. We are discussing this via the internet right now.
Mr_Bees69 1 hours ago [-]
Yeah, but MyChart is down.
1 hours ago [-]
acedTrex 1 hours ago [-]
big, if true
1 hours ago [-]
mrinterweb 1 hours ago [-]
A lot of money and years of marketing the cloud as the responsible business decision led us here. Now that the cloud providers have vendor lock-in, few will leave, and customers will continue to wildly overpay for cloud services.
gwbas1c 1 hours ago [-]
Ahh, but you forget what it used to be like. Sites used to go down all the time.

Now, they go down a lot less frequently, but when they do, it's more widespread.

JoBrad 31 minutes ago [-]
It’s the Heisenberg cloud principal.
bossyTeacher 27 minutes ago [-]
Not sure how the current situation is better. Being stranded with no way whatsoever to access most/all of your services sounds way more terrifying than regular issues limited to a couple of services at a time
gwbas1c 5 minutes ago [-]
> no way whatsoever to access most/all of your services

I work on a product hosted on Azure. That's not the case. Except for front door, everything else is running fine. (Front door is a reverse proxy for static web sites.)

The product itself (an iot stormwater management system) is running, but our customers just can't access the website. If they need to do something, they can go out to the sites or call us and we can "rub two sticks together" and bypass the website. (We could also bypass front door if someone twisted our arms.)

Most customers only look at the website a few times a year.

---

That being said, our biggest point of failure is a completely different iot vendor who you probably won't hear about on Hacker News when they, or their data networks, have downtime.

alt227 1 hours ago [-]
Thats the whole point, big players like AWS and MS can go down, but here we are still talking on the internet.

Decentralisation is winning it seems.

jslaby 30 minutes ago [-]
Not everyone has moved over, but I'm sure there have been thoughts or plans to.
nzach 1 hours ago [-]
> How did we get here?

I think the response lies in the surrounding ecosystem.

If you have a company it's easier to scale your team if you use AWS (or any other established ecosystem). It's way easier to hire 10 engineers that are competent with AWS tools than it is to hire 10 engineers that are competent with the IBM tools.

And from the individuals perspective it also make sense to bet on larger platforms. If you want to increase your odds of getting a new job, learning the AWS tools gives you a better ROI than learning the IBM tools.

codethief 35 minutes ago [-]
Meredith Whittaker (of Signal) addressed your question the other day: https://mastodon.world/@Mer__edith/115445701583902092
anonymars 47 minutes ago [-]
Efficiency (aka cost) <---> Resiliency/redundancy

Pick your point on the scale

AndrewKemendo 47 minutes ago [-]
A natural monopoly is a monopoly in an industry in which high infrastructure costs and other barriers to entry relative to the size of the market give the largest supplier in an industry, often the first supplier in a market, an overwhelming advantage over potential competitors. Specifically, an industry is a natural monopoly if a single firm can supply the entire market at a lower long-run average cost than if multiple firms were to operate within it. In that case, it is very probable that a company (monopoly) or a minimal number of companies (oligopoly) will form, providing all or most of the relevant products and/or services.

https://en.wikipedia.org/wiki/Natural_monopoly

pphysch 1 hours ago [-]
Consolidation is the inevitable outcome of free unregulated markets.

In our highly interconnected world, decentralization paradoxically requires a central authority to enforce decentralization by restricting M&A, cartels, etc.

SoKamil 55 minutes ago [-]
Is there a theorem that models this behavior? Capital feels like a mass that attracts more mass the larger it becomes, like gravity.
SecretDreams 1 hours ago [-]
> How did we get here?

Stonks

TacticalCoder 52 minutes ago [-]
[dead]
ape4 2 hours ago [-]
2026: the year of your own metal in a rack
drewnick 1 hours ago [-]
I've been doing it since 1998 in my bedroom with a dual T1 (and on to real DCs later). While I've had some outages for sure it makes me feel better I am not that divergent in uptime in the long run vs big clouds.
dylan604 1 hours ago [-]
Are you still on a dual T1? that's gotta be expensive
daveguy 49 minutes ago [-]
(and on to real DCs later) would imply their bare metal is now located in a data center.
dylan604 42 minutes ago [-]
really should stop skimming the comment when i find a part to comment on <facepalm>
mythz 2 hours ago [-]
High availability is touted as a reason for their high prices, but I swear I read about major cloud outages far more than I experience any outages at Hetzner.
prmoustache 33 minutes ago [-]
I think the biggest features of the big cloud vendors is that when they are down, not only you but your customers and your competitors usually have issues at the same time so everybody just shrug and have a lazy/off day at the same time. Even on call teams reall just have to wait and stay on standby because there is very little they can do. Doing a failover can be slower than waiting for the recovery, not help at all if outage is spanned accross several region, or bring aditional risks.

And more importantly nobody lose any reputation except AWS/Azure/Google.

zavec 31 minutes ago [-]
It's like back in school when there was a snow day!
graemep 1 hours ago [-]
Ostensible reason.

The real reason is that outages are not your fault. Its the new version of "nobody ever got fired for buying IBM" - later it became MS, and now its any big cloud provider.

30 minutes ago [-]
jmaker 40 minutes ago [-]
For one it’s statistics - Hetzner simply runs far fewer major services than hyperscalers. And the services they run are also more affluent, with larger customer bases, so downtimes are systemically critical. Therefore it’s louder.

On the merits though, I agree, haven’t had any serious issues with Hetzner.

bad_haircut72 2 hours ago [-]
Same with DigitalOcean. I run one box and it hasnt gone down for like 2 years
yabones 58 minutes ago [-]
DO has been shockingly reliable for me. I shut down a neglected box almost 900 days uptime the other day. In that time AWS has randomly dropped many of my boxes with no warning requiring a manual stop/start action to recover them... But everybody keeps telling me that DO isn't "as reliable" as the big three are.
robotnikman 32 minutes ago [-]
Same here, I run a few droplets for personal projects and never had any issues with then.
bongodongobob 2 hours ago [-]
It's just the admin portal.
12_throw_away 22 minutes ago [-]
Nope, more than the portal. For instance, I just searched for "Azure Front Door" because I hadn't heard of it before (I now know it's a CDN), and neither the product page itself [1] nor the technical docs [2] are coming up for me.

[1] https://azure.microsoft.com/en-us/products/frontdoor

[2] https://learn.microsoft.com/en-us/azure/frontdoor/front-door...

Foobar8568 47 minutes ago [-]
Plenty of sites are down and/or login not available. It's just really a mess.
ikamm 24 minutes ago [-]
The bank I work at is reporting all Power Apps applications are down.
pocketman 19 minutes ago [-]
It looks like it is just the 365 admin panels for us. Admittedly, we don't currently host any other services on Azure though.
NDizzle 2 hours ago [-]
It's absolutely not only the admin portal.

It's CDN and FrontDoor at least.

bongodongobob 2 hours ago [-]
Interesting, everything else is working just fine for us. Offices across the US.
Jarwain 1 hours ago [-]
Do you use on front door? Our VMs that don't are working fine, but our app services that do aren't.
out_sider 1 hours ago [-]
we use front door (as does miccrosoft.com) and our website was down, I was able to change the DNS records to point directly to our server and will leave it like that for a few hours until everything is green
flumpcakes 2 hours ago [-]
Pretty much all Azure services seem to be down. Their status page says it's only the portal since 16:00. It would be nice if these mega-companies could update their status page when they take down a large fraction of the Internet and thousands of services that use them.
kierenj 2 hours ago [-]
FWIW, all of our databases, VMs, AKS clusters, services, jobs etc - are all working fine. Which services are down for you, maybe we can build a list?
reknih 2 hours ago [-]
Front Door is down for us (as Azure‘s Twitter account confirms)
parliament32 1 hours ago [-]
All of our Azure workloads are up, but we don't use Azure Front Door. That seems to be the only impacted product, apart from the management portal.
flumpcakes 1 hours ago [-]
We're using Application Gateway for ingress, that seems to be effected.
jayw_lead 1 hours ago [-]
Same playbook for AWS. When they admitted that Dynamo was inaccessible, they failed to provide context that their internal services are heavily dependent on Dynamo

It's only after the fact they are transparent about the impact

ApolloFortyNine 27 minutes ago [-]
Two hours after the initial outage, they have finally updated the Front Door status on their status page.
givemeethekeys 1 hours ago [-]
Surely more vibecoding will fix this problem. Time to fire more staff
martijnvds 18 minutes ago [-]
This probably explains why paying for street parking in Cologne by phone/web didn't work (eternal spinner) then
AdmiralAsshat 1 hours ago [-]
Some exec at Microsoft told the Azure guys to ape everything Amazon does and they took it literally.
Telemakhos 57 minutes ago [-]
Or, the NSA needed to upgrade their access at both.
embedding-shape 48 minutes ago [-]
Do Microsoft still say "If the government has a broader voluntary national security program to gather customer data, we don't participate in it" today (which PRISM proved very false), or are they at least acknowledging they're participating in whatever NSA has deployed today?
terminalshort 40 minutes ago [-]
PRISM wasn't voluntary. Also there are 3 levels here:

1. Mandatory

2. "Voluntary"

3. Voluntary

And I suspect that very little of what the NSA does falls into category 3. As Sen Chuck Schumer put it "you take on the intelligence community, they have six ways from Sunday at getting back at you"

cruffle_duffle 17 minutes ago [-]
“Voluntold”
jrochkind1 55 minutes ago [-]
I was gonna say that obv AWS hacked em to even things up.
dboreham 1 hours ago [-]
This is funny but also possibly true because: business/MBA types see these outages as a way to prove how critical some services are, leading to investors deciding to load up on the vendor's stock.
alt227 55 minutes ago [-]
I may or may not have been known to temporarily take a database down in the past to make a point to management about how unreliable some old software is.
cbovis 2 hours ago [-]
Looks to be affecting our pipelines that rely on Playwright as they download images from Azure e.g. https://playwright.azureedge.net/builds/chromium/1124/chromi... which aren't currently resolving.
mystcb 2 hours ago [-]
Updated 16:35 UTC

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025

----

Azure Portal Access Issues

We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly.

This message was last updated at 16:18 UTC on 29 October 2025

-- From the Azure status page

blenderob 2 hours ago [-]
https://azure.status.microsoft/en-us/status says everything's fine! Any place I can read more about this outage?
reid 2 hours ago [-]
You're looking at it. I couldn't find any discussion elsewhere yet...
sbergot 2 hours ago [-]
official status pages are useless most of the time.
sbergot 2 hours ago [-]
now there is an information about "Azure Portal Access Issues". No word about front door being down.
The_President 2 hours ago [-]
[dead]
tartieret 2 hours ago [-]
Microsoft posted an update on X: https://x.com/AzureSupport/status/1983569891379835372?ref_sr...

"We’re investigating an issue impacting Azure Front Door services. Customers may experience intermittent request failures or latency. Updates will be provided shortly."

llama052 2 hours ago [-]
Always fun when you can't trust the main status page but have to go to some opinionated social medial website to see the actual problem.
drjasonharrison 54 minutes ago [-]
https://www.cbc.ca/news/investigates/tesla-grok-mom-9.695693...

This mom’s son was asking Tesla’s Grok AI chatbot about soccer. It told him to send nude pics, she says

xAI, the company that developed Grok, responds to CBC: 'Legacy Media Lies'

LouisLazaris 44 minutes ago [-]
The VS Code website is down: https://code.visualstudio.com/

And so is Microsoft: http://www.microsoft.com/

codethief 28 minutes ago [-]
https://www.microsoft.com works for me (with the www subdomain).
anon025 45 minutes ago [-]
It's the DNS https://dnschecker.org/#A/get.helm.sh is unreachable
I_am_tiberius 29 minutes ago [-]
Why are Azure App Services still working?
tyfon 2 hours ago [-]
Seems to be down in Norway.

Even the national digital id service is down.

hexbin010 1 hours ago [-]
> Even the national digital id service is down.

Can't help but smirk as my country is ramming through "Digital ID" right now

44 minutes ago [-]
Jarwain 2 hours ago [-]
On our end, our VMs are still working, so our gitlab instance is still up. Our services using Azure App Services are available through their provided url. However, Front Door is failing to resolve any domains that it was responsible for.
m_fayer 2 hours ago [-]
ApolloFortyNine 1 hours ago [-]
They admit in their update blurb azure front door is having issues but still report azure front door as having no issues on their status page.

And it's very clear from these updates that they're more focused on the portal than the product, their updates haven't even mentioned fixing it yet, just moving off of it, as if it's some third party service that's down.

consp 1 hours ago [-]
> as having no issues on their status page

Unsubstantiated idea: So the support contract likely says there is a window between each reporting step and the status page is the last one and the one in the legal documents giving them several more hours before the clauses trigger.

1 hours ago [-]
empath75 1 hours ago [-]
Friend of mine at MSFT says it's a Sev-0 outage and they can't even get to the ticket tracking system.
LaserToy 2 hours ago [-]
Azure portal still insists the issue is jsut with Console.

We had to bypass the Frontdoor

ukblewis 17 minutes ago [-]
GitHub also seems to be having trouble for me
speckx 1 hours ago [-]
chuckadams 1 hours ago [-]
Which itself is^H^H was down. Wow.
speckx 1 hours ago [-]
Boxersteavee 1 hours ago [-]
503 Service Unavailable
42 minutes ago [-]
SoftTalker 2 hours ago [-]
We're on Office 365 and so far it's still responding. At least Outlook and Teams is.
jeffdn 2 hours ago [-]
They don't run on Azure!
RajT88 52 minutes ago [-]
They definitely do run on Azure. Probably not 100%, but at least some footprint of those services do.
rcarmo 2 hours ago [-]
Are you absolutely sure?
jansper39 2 hours ago [-]
They don't, however authentication for those services relies on Entra ID which seems to be affected.
rcarmo 2 hours ago [-]
I'd say DNS/Front Door (or some carrier interconnect) is the thing affected, since I can auth just fine in a few places. (I'm at MS, but not looped into anything operational these days, so I'm checking my personal subscription).
rcarmo 2 hours ago [-]
Not seeing it. I have VMs in US East and Netherlands and they're up.
reid 2 hours ago [-]
Portal and Azure CDN are down here in the SF Bay Area. Tenant azureedge.net DNS A queries are taking 2-6 seconds and most often return nothing. I got a couple successful A response in the last 10 minutes.

Edit: As of 9:19 AM Pacific time, I'm now getting successful A responses but they can take several seconds. The web server at that address is not responding.

vpears87 2 hours ago [-]
At least MSFT is consistent: https://www.microsoft.com/en-us/ is down as well
CommanderData 1 hours ago [-]
Likely behind Azure Front Door.

Much of Xbox is behind that too.

jacquesclouseau 16 minutes ago [-]
My bet is on a bad config change.
hypeatei 59 minutes ago [-]
All of my employers things are hosted on Azure and running just fine and didn't go down at all. Portal access has been fixed.

Doesn't seem to be too bad of an outage unless you were relying on Azure Front Door.

randomsofr 56 minutes ago [-]
SSO is down, Azure Portal Down and more, seems like a major outage. Already a lot of services seem to be affected: banks, airlines, consumer apps, etc.
hypeatei 54 minutes ago [-]
The portal is up for me and their status page confirms they did a failover for it. Definitely not disputing that its reach is wide, but a lot of smaller setups probably aren't using Front Door.
rahkiin 52 minutes ago [-]
Both work for me in the Netherlands
everfrustrated 42 minutes ago [-]
GitHub runners (specifically the "larger" runner types) are all down for us. These are known to be hosted on Azure.
AtNightWeCode 7 minutes ago [-]
Earnings report today. A coincidence?

I can at least login to Azure. But several MS sites are down.

elFarto 2 hours ago [-]
We saw all incoming traffic to our app drop to zero at about 15:45. I wonder how long this one will take to fix.
sech8420 2 hours ago [-]
Same exact time for us as well.
vincebowdren 2 hours ago [-]
UK, and other regions too; our APAC installation in Australia is affected.
andhuman 2 hours ago [-]
I bet it’s DNS.
andhuman 2 hours ago [-]
“ Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025”

2 hours ago [-]
glzone1 54 minutes ago [-]
Wasn't the saying "It's always DNS" floating around somewhere?

Be interesting to understand cause here. Pretty big impact on services we use

a_f 2 hours ago [-]
Looks like MyGet is impacted too. Seems like they use Azure:

>What is required to be able to use MyGet? ... MyGet runs its operations from the Microsoft Azure in the West Europe region, near Amsterdam, the Netherlands.

ksec 2 hours ago [-]
>Last week AWS, now this.

This is not the first or second time this happened, multiple Hyperscaler failed one by one.

2 hours ago [-]
borg16 2 hours ago [-]
i guess folks in azure wanted to show some solidarity with aws brethren

(couldn't resist adding it. i acknowledge this comment adds no value to the discussion)

aurumque 1 hours ago [-]
Azure goes down all the time. On Friday we had an entire regional service down all day. Two weeks ago same thing different region. You only hear about it when it's something everyone uses like the portal, because in general nobody uses Azure unless they're held hostage.
Mr_Bees69 1 hours ago [-]
Yeah, im regretting my decision to buy an xbox now. Every once in a while, everything goes down.
syntaxing 1 hours ago [-]
I absolutely love the utility aspect of LLMs but part of me is curious if moving faster by using AI is going to make these sorts of failure more and more often.
monkaiju 58 minutes ago [-]
If true then what "utility" is there?
vs4vijay 2 hours ago [-]
ipsum2 2 hours ago [-]
Status page (first link) is down for me. Second one works
charv 1 hours ago [-]
oh the irony, the status link being down too
karateka01 1 hours ago [-]
status page being affected by the same issue is so lame
millzlane 39 minutes ago [-]
It begs the question from a noob like me... Where should they host the status page? Surely it shouldn't be on the same infra that it's supposed to be monitoring. Am I correct in thinking that?
aftergibson 2 hours ago [-]
Looks like the status page is overloaded...
alt227 2 hours ago [-]
Cant access certain banking websites in the UK, I am assuming it because of this.

https://www.natwest.com/

glzone1 53 minutes ago [-]
I remember the saying "It's always DNS". I'm old.

Kind of mindboggling it's still sometimes DNS maybe.

jacquesm 1 hours ago [-]
It is much more than azure. One of my kids needs a key for their laptop and can't reach that either. Great excuse though, 'Azure ate my homework'. What a ridiculous world we are building. Fuck MS and their account requirements for windows.
2 hours ago [-]
reid 2 hours ago [-]
This is impacting the Azure CDN at azureedge.net. DNS A records for azureedge.net tenants are taking 2-6 seconds and often return nothing.
etyhhgfff 29 minutes ago [-]
It's always DNS, unless it's not DNS.
djeastm 2 hours ago [-]
I'm mid-deployment, but thankfully it seems to be running ok so far. Just the portal is not working so my visibility is not good.
nartaczact 1 hours ago [-]
Sounds like Shrodinger's Deploy
thimkerbell 1 hours ago [-]
Does (should, could) DownDetector also say what customer-facing services are down, when some infrastructure is unworking? Or is that the info that the malefactors are seeking?
bronco21016 2 hours ago [-]
Unable to access the portal and any hit to SSO for other corporate accesses is also broken. Seems like there's something wrong in their Identity services.
ZeroConcerns 2 hours ago [-]
Oh, well, I'm sure Azure will be given the same pass that AWS got here recently when they had their 12-hour outage...
taeric 2 hours ago [-]
I didn't realize AWS got a pass?
graemep 1 hours ago [-]
Have repeated outages lost them customers? has it lost them any money in any way?

That is a pass.

taeric 51 minutes ago [-]
Apologies, but this just reads like a low effort critique of big things.

To be clear, they should get criticism. They should be held liable for any damage they cause.

But that they remain the biggest cloud offering out there isn't something you'd expect to change from a few outages that, by most all evidence, potential replacements have, as well? More, a lot of the outages potential replacements have are often more global in nature.

philipallstar 28 minutes ago [-]
Have people left GitHub due to the multiple post-acquisition outages? That is a pass if you don't judge it the same way.
prmoustache 31 minutes ago [-]
Well, they have successfully locked their customers captive thanks to huge egress fees.
jimmyl02 1 hours ago [-]
pretty interesting how datadog's uptime tracker (https://updog.ai/) says all the sites are fully available.

if that's true then it's a sign that Azure's control / data plane separation is doing it's job! at least for now

jonathanlydall 51 minutes ago [-]
Our Azure hosted dotnet App Service is working fine, but our docs site served via Front Door went down. Can’t access anything through the Portal.
layer8 47 minutes ago [-]
Maybe they need a downtime tracker. ;)
tecleandor 1 hours ago [-]
LinkedIn has been acting funny for an hour or so, and some pages in the learn.microsoft.com domain have been failing for me too...
baconbrand 2 hours ago [-]
Our Azure DevOps site is still functioning and our Azure hosted databases are accessible. Everything else is cooked.
dlcarrier 2 hours ago [-]
Yesterday Amazon, today Microsoft. Are Google's cloud services going down tomorrow?
gtowey 1 hours ago [-]
This is because Azure just copies everything AWS does. Google is a bit more innovative, they will have something else unexpected happen.
dkdcio 1 hours ago [-]
throwback to when they deleted a customer's entire account! https://arstechnica.com/gadgets/2024/05/google-cloud-acciden...
Insanity 1 hours ago [-]
Maybe they are and no one realized yet.. :P

That said, I don't hear about GCP outages all that often. I do think AWS might be leading in outages, but that's a gut feeling, I didn't look up numbers.

luhn 1 hours ago [-]
They had a pretty massive one earlier this year. https://status.cloud.google.com/incidents/ow5i3PPK96RduMcb1S...

This isn't GCP's fault, but the outage ended up taking down Cloudflare too, so in total impact I think that takes the cake.

xenolithis 1 hours ago [-]
fairly certain they had a significant multi region outage within the past few years. I'll try to find some details to link.

Few customers....few voices to complain as well.

Mr_Bees69 1 hours ago [-]
as a victim of xbox, azure is down 'bout as often as its up
briffle 1 hours ago [-]
here's hoping its Oracle's cloud instead....
m_fayer 2 hours ago [-]
And if they don't, we'll know who the culprit is.
shishcat 1 hours ago [-]
Who?
1 hours ago [-]
opengrass 1 hours ago [-]
Github Actions and Codespaces degraded.
avgDev 2 hours ago [-]
I am having a bunch of issues. It looks like their sites and azure are both affected.

I also got weird notification in VS2022 that my license key was upgraded to Enterprise, but we did not purchase anything.

Mr_Bees69 1 hours ago [-]
Might be a failsafe, if you cant get a license status, and you're aware that MS is down, just default to the highest tier.
ThatManulTheCat 2 hours ago [-]
Free upgrade
amluto 56 minutes ago [-]
vscode.dev appears to be down. I think this will be my excuse to find an alternative -- I never really liked vscode.dev anyway.

(Coder is currently at the top of the experiment list. Any other suggestions?)

AtNightWeCode 2 minutes ago [-]
From Azure status page: "Customers can consider implementing failover strategies with Azure Traffic Manager, to fail over from Azure Front Door to your origins".

What a terrible advise.

Mr_Bees69 1 hours ago [-]
MS website seems to be up but really slow. Think xbox might still be down, Bing works for some reason tho!?
baconbrand 2 hours ago [-]
All of our sites went down. This is my company’s busiest time of year. Hooray.
8cvor6j844qw_d6 2 hours ago [-]
Quite close to the recent AWS outage. Let me take a look if its a major one similar to AWS.

Any guess on what's causing it?

In hindsight, I guess the foresight of some organizations to go multi-cloud was correct after all.

jcims 1 hours ago [-]
We're multi-cloud and it really saved a few workloads last week with the AWS issue.

It's not easy though.

stuff4ben 2 hours ago [-]
It's always freakin DNS...
conroydave 2 hours ago [-]
cost cutting attempts
iAMkenough 2 hours ago [-]
Trusting AI without sufficient review and oversight of changes to production.
whynotminot 2 hours ago [-]
Yeah, these things never happened when humans were trusted without sufficient review and oversight of changes to production.
shepherdjerred 2 hours ago [-]
Do you have any insight or do you just dislike AI? Incidents like this happened long before AI generated code
Capricorn2481 2 hours ago [-]
I don't think it's meant to be serious. It's a comment on Microsoft laying off their staff and stuffing their Azure and Dotnet teams with AI product managers.
Sharparam 2 hours ago [-]
The learning modules on https://learn.microsoft.com/ also seem to have a lot of issues properly loading.
rsolva 14 minutes ago [-]
So that's why all of our municipality's digital services are down ... utter chaos at the political meeting I attended just now.
chemodax 2 hours ago [-]
It seems Azure FrontDoor is affected, because our private VM works fine in different regions.
howard941 44 minutes ago [-]
Took out the archive.ph and .is sites too?
I_am_tiberius 55 minutes ago [-]
Shouldn't regions be completely independent?
ThatManulTheCat 2 hours ago [-]
Azure portal currently mostly not working (UK)... Downdetector reporting various Microsoft linked services are out (Minecraft, Microsoft 365, Xbox...)
kryogen1c 2 hours ago [-]
downdetector reports coincident cloudflare outage. is microsoft using cloudflare for management plane, or is there common infra? data center problem somewhere, maybe fiber backbone? BGP?
wingless_angel 56 minutes ago [-]
Please sort it out, I'll be out of a job tomorrow.
almosthere 1 hours ago [-]
Reports of Azure and AWS down on the same day? Infrastructure terrorism?
12_throw_away 5 minutes ago [-]
> Infrastructure terrorism?

Unless that's a euphemism for "vibe coding", no.

reaperducer 29 minutes ago [-]
Reports of Azure and AWS down on the same day? Infrastructure terrorism?

> We have confirmed that an inadvertent configuration change as the trigger event for this issue.

Save the speculation for Reddit. HN is better than that.

macshome 2 hours ago [-]
I just tried to check the Xbox services status page and it never even loaded.
chokolad 1 hours ago [-]
Majority of actual Xbox services are working fine, xbox.com itself is busted.
twodave 2 hours ago [-]
Appears to be an issue in Front Door. Our back end stuff is fine but FD is bouncing everything.
NDizzle 2 hours ago [-]
Yeah, I have non prod environments that don't use FD that are functioning. Routing through FD does not work. And a different app, nonprod doesn't use FD (and is working) but loads assets from the CDN (which is not working).

FD and CDN are global resources and are experiencing issues. Probably some other global resources as well.

Hate to say it, but DNS is looking like it's still the undisputed champ.

vanviegen 2 hours ago [-]
Many (all?) LinkedIn profiles are also down for me. Luckily the frontpage still works. ;-)

Go cloud!

AznHisoka 2 hours ago [-]
Luckily?
ThatManulTheCat 2 hours ago [-]
Yudkowsky's feared Superintellignece holding Azure hostage
vinyl7 2 hours ago [-]
Vibe coded internet keeps getting better
avgDev 2 hours ago [-]
Quick find someone who can actually read documentation and code!
the_af 1 hours ago [-]
You just paste the outage error codes back to the LLM and pray it's still working and can fix whatever went wrong!
m_fayer 1 hours ago [-]
When all the people forget to code for themselves, every LLM will code itself out of existence with that one last bug. One, after another.
philipallstar 1 hours ago [-]
Can't get to microsoft.com even.
pred8er 41 minutes ago [-]
on the line with msft, they said 4 hours is what they are thinking. a workaround they are saying is to use traffic manager,
dlcarrier 2 hours ago [-]
We're quickly learning who's relying on a single cloud provider.
shagie 1 hours ago [-]
MiguelHudnandez 29 minutes ago [-]
When you look at the scale of the reports, you find they are much lower than Azure's. seeing a bunch of 24-hour sparkline type graphs next to each other can make it look like they are equally impacted, but AWS has 500 reports and Azure has 20,000. The scale is hidden by the choice of graph.

In other words, people reporting outages at AWS are probably having trouble with microsoft-run DNS services or caching proxies. It's not that the issues aren't there, it's that the internet is full of intermingled complexity. Just that amount of organic false-positives can make it look like an unrelated major service is impacted.

Insanity 1 hours ago [-]
Multi cloud is really hard to get right at scale, and honestly not worth the effort for the majority of companies and use-case.
llimos 2 hours ago [-]
Yep, down from here too (in Israel).

Services too, not just the portal.

andoma 2 hours ago [-]
Can confirm
zelias 1 hours ago [-]
Anyone have betting odds on when Google will go down next? Are we looking at all 3 providers having outages in the span of 3 weeks?
uuuubbbb 2 hours ago [-]
Intune, Azure, Entra down in Switzerland
patching-trowel 2 hours ago [-]
As of now Azure Status page still shows no incident. It must be manually updated, someone has to actively decide to acknowledge an issue, and they're just... not. It undermines confidence in that status page.
baconbrand 2 hours ago [-]
I have never noticed that page being updated in a timely manner.
charles_f 2 hours ago [-]
It shows that some people have issues accessing the portal.
thewisenerd 2 hours ago [-]
they recently had an incident with front door reachability, wonder if it's back.

QNBQ-5W8

kryogen1c 2 hours ago [-]
downdetector reports coincident cloudflare outage. is microsoft using cloudflare for management plane, or is there common infra? data center problem somewhere, maybe fiber backbone? BGP?
Mr_Bees69 1 hours ago [-]
nope, dont see any cf issues.
rluhar 2 hours ago [-]
Looks like AWS is also impacted?
zavec 2 hours ago [-]
Yeah the graph for that one looks exactly the same shape. I wonder if they were depending on some azure component somehow, or maybe there were things hosted on both and the azure failure made enough things failover to AWS that AWS couldn't cope? If that was the case I'd expect to see something similar with GCP too though.

Edit: nope looks like there's actually a spike on GCP as well

estel 2 hours ago [-]
It's possibly more likely that people mis-attribute the cause of an outage to the wrong providers when they use downdetector.
zavec 1 hours ago [-]
Definitely also a strong possibility. I wish I had paid more attention during the AWS one earlier to see what other things looked like on there at the time.
okokwhatever 59 minutes ago [-]
This cannot be a coincidence
pred8er 1 hours ago [-]
looks like MS completed a failover and things are be recovering slowly
somerandomness 2 hours ago [-]
yep having trouble logging into https://entra.microsoft.com/ as well
kierenj 2 hours ago [-]
microsoft.com is back -

edit: it worked once, then died again. So I guess - some resolvers, or FD servers may be working!

xuf 2 hours ago [-]
Down here too (region West Europe)
bossyTeacher 42 minutes ago [-]
I noticed issues on Azure so I went to the status page. It said everything was fine even though the Azure Portal was down. It took more than 10 minutes for that status page to update.

How can one of the richest companies in the world not offer a better service?

voidpointer2000 2 hours ago [-]
Down in Sweden Central as well (all our production systems are down)
tonyhart7 7 minutes ago [-]
Wtf happen with US east????
majnata 2 hours ago [-]
The Azure API is still working though.
rvz 2 hours ago [-]
Looking forward to the post mortem.
joaomoreno 2 hours ago [-]
Yup, see it as well.
pred8er 1 hours ago [-]
things seem to be coming back up now
giantg2 2 hours ago [-]
Compare the comments and news coverage on this compared to the AWS outage... pretty telling.
2 hours ago [-]
1 hours ago [-]
improbableinf 1 hours ago [-]
What a time to be alive!
NDizzle 2 hours ago [-]
My best guess at the moment is something global like the CDN is having problems affecting things everywhere. I'm able to use a legacy application we have that goes directly to resources in uswest3, but I'm not able to use our more modern application which uses APIM/CDN networks at all.
siva7 2 hours ago [-]
auth services are down
barpol 2 hours ago [-]
still down
zzake 39 minutes ago [-]
Portal is now accessible, bypassing FDN
llama052 2 hours ago [-]
Just another day with microsoft. Honestly pretty tiring as something is always generally broken.
PacketPundit 59 minutes ago [-]
[dead]
bernardo786 1 hours ago [-]
now aws down again?
rawgabbit 2 hours ago [-]
ctoth 2 hours ago [-]
Layoffs will continue until uptime improves!
onraglanroad 29 minutes ago [-]
Now that is actually funny!
FeteCommuniste 1 hours ago [-]
> [Satya Nadella] said that the company’s future opportunity was to bring AI to all eight billion people on the planet.

But what if I don't want AI brought to me?

mring33621 1 hours ago [-]
Sounds like someone has a case of the 'Mondays'...
binarymax 1 hours ago [-]
The mondAIs
bostik 47 minutes ago [-]
You'll have to find another planet.

Although judging by the available transports it will likely be colonized by nazis.

gcanyon 59 minutes ago [-]
ryandrake 1 hours ago [-]
Like most technology initiative these tech CEOs dream up: You're going to get it and swallow it, whether you want it or not.
the_af 1 hours ago [-]
I especially like how Nadella speaks of layoffs as some kind of uncontrollable natural disaster, like a hurricane, caused by no-one in particular. A kind of "God works in mysterious ways".

    > “Microsoft is being recognized and rewarded at levels never seen before,” Nadella wrote. “And yet, at the same time, we’ve undergone layoffs. This is the enigma of success in an industry that has no franchise value.”
     
    > Nadella explained the disconnect between thriving financials and layoffs by stating that “progress isn’t linear” and that it is “sometimes dissonant, and always demanding.”
I've read the whole memo and it's actually worse than those excerpts. Nadella doesn't even claim these were low performers:

    > These decisions are among the most difficult we have to make. They affect people we’ve worked alongside, learned from, and shared countless moments with—our colleagues, teammates, and friends.
Ok, so Microsoft is thriving, these were friends and people "we've learned from", but they must go because... uh... "progress isn't linear". Well, thanks Nadella! That explains so much!
inquirerGeneral 1 hours ago [-]
[dead]
2 hours ago [-]
paj33t 2 hours ago [-]
[dead]
improbableinf 1 hours ago [-]
According to downtector.com - both AWS and GCP are down as well. Interesting
jasonjmcghee 1 hours ago [-]
Don't visit this address.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 18:25:16 GMT+0000 (Coordinated Universal Time) with Vercel.