Ask HN: Is AWS down again?

There's nothing on health.aws.amazon.com but I'm getting reports of performance failure on systems that utilize AWS...

Downdetector is also recording failure reports: https://downdetector.com/status/aws-amazon-web-services/

▲

tdubey 6 days ago | parent | next [-]

Disclosure, I work for Datadog:

https://updog.ai/status/amazonaws

Looks fine for now.

▲

AkshatM 6 days ago | parent | next [-]

Piece of UX feedback for the product team behind Updog: company logos are not searchable. It should be easy to Ctrl-F and find a relevant cloud on that detector, instead of scrolling alphabetically.

▲

jacobwg 6 days ago | parent | prev | next [-]

You all should add EC2 - extra bonus if you have some way of tracking performance in addition to errors (right now we're seeing EC2 instances in us-east-1c not transition out of Pending status).

▲

nodesocket 6 days ago | parent | prev | next [-]

This is cool, does this actually hit all the services directly (in each region) instead of pulling from AWS Status?

▲

dewey 6 days ago | parent | next [-]

Which uptime checker tool would be based on status pages (owned by the marketing department)? That defeats the whole purpose.

	▲	rozenmd 6 days ago \| parent \| next [-]
		I've run a business in this space since 2021, I am yet to meet a business that lets their marketing team own their status page. You'll find most engineering teams will start owning a status page to centralise updates to their stakeholders, before eventually growing into the customer success/support org owning it to minimise support tickets during incidents. Marketing has nothing to do with status pages.
	▲	kylecazar 6 days ago \| parent \| prev [-]
		I highly doubt AWS health dashboards are owned by marketing

▲

port3000 6 days ago | parent | prev | next [-]

https://updog.ai/status/openai issue history looks terrible. Wonder how you ping openAI for this; with a completion attempt on a particular model?

	▲	arbll 6 days ago \| parent [-]
		It is based on the impact on Datadog's customers, not on synthetic queries / pings

▲

seedless-sensat 6 days ago | parent | prev [-]

From the page:

> API health is inferred by analyzing aggregated and anonymized telemetry from across the Datadog customer base.

▲

snarkyturtle 6 days ago | parent | prev [-]

What's updog?

▲

mrinterweb 6 days ago | parent | next [-]

Nothing much. How bout you?

▲

hughdbrown 6 days ago | parent | prev | next [-]

Ah, you have fallen for it:

https://www.youtube.com/watch?v=wa4VJobPBr4

▲

Imustaskforhelp 6 days ago | parent [-]

Wasn't the first reference to this joke in the office, also is it just me or do I remember this guy from either breaking bad or the office or (both??)

Was his name neil on breaking bad or the office, I think his name was neil in the office, one of the warehouse workers right?

	▲	almosthere 6 days ago \| parent \| next [-]
		2003 https://knowyourmeme.com/memes/updog Personally my favorite usage of the joke is from Kitboga: https://www.facebook.com/Twitch/videos/watch-kitboga-on-twit...
	▲	HelloUsername 6 days ago \| parent \| prev [-]
		Better Call Saul

▲

millerm 6 days ago | parent | prev | next [-]

I know I could sure use some updog right about now.

▲

amelius 6 days ago | parent | prev [-]

Isn't that a kind of yoga pose?

▲

jacobwg 6 days ago | parent | prev | next [-]

We've been observing EC2 instances launched in us-east-1c (use1-az2) remain in Pending status for a very long time / indefinitely, starting at around 16:00 UTC.

	▲	everfrustrated 6 days ago \| parent [-]
		We were seeing ECS Fargate capacity weirdness in us-east-1 earlier.

▲

ENGNR 6 days ago | parent | prev | next [-]

Had some very weird behaviour from cloudfront used purely to serve images from s3. Mostly huge slowdowns and outright failures on endpoints. Was about 15 hours ago that I noticed it by chance.

Was nothing on the aws status pages and no alerts/errors in my console. Eventually it sped up again.

	▲	taf2 5 days ago \| parent [-]
		We noticed massive latency from cloudfront spent the first part of my day migrating services out.

▲

bwb 5 days ago | parent | prev | next [-]

I'm Ben from https://downforeveryoneorjustme.com/

We are not seeing anything right now... keeping an eye out but things are normal.

▲

farseer 6 days ago | parent | prev | next [-]

I would think that data centers scale up horizontally and a failure of one node should only affect a limited number of customers. Barring any centralized DNS mess up of-course.

▲

danudey 6 days ago | parent | prev | next [-]

I'm currently in the process of spinning up a k8s in us-west-2 and no issues, but, as others have said, us-east-1 is the problem child so I guess we'll see.

▲

matt-p 6 days ago | parent | prev | next [-]

It's wonky for sure, but only to certain IP ranges.

▲

AbhiAmbad 6 days ago | parent | prev | next [-]

Yes, aws was done. I am very irritated with aws now. This is 3rd time. Pricing is also very high…

Thinking to switch to another platform.

▲

nodesocket 6 days ago | parent | prev | next [-]

Don't see any issues in us-east-2 (Ohio) with my infra, but typically issues arise in us-east-1.

▲

paulddraper 6 days ago | parent | prev | next [-]

Down detector is much much higher when there is a real problem.

There might be something, but wouldn’t be widespread.

	▲	Johnny555 6 days ago \| parent [-]
		Most of the reports on the downdetector heatmap are coming from the NYC area, that's probably more likely to be a network issue (or even, if you can imagine it, DNS) than a real AWS failure since it's well into business hours on the West Coast. https://downdetector.com/status/aws-amazon-web-services/map/

▲

myidealab 6 days ago | parent | prev | next [-]

I think it may be down, showing early signs based on location services (geofencing) warning.

▲

NoSalt 6 days ago | parent | prev | next [-]

I was thinking the same thing as some sites like Google were taking a LONG time to load.

	▲	Johnny555 6 days ago \| parent \| next [-]
		Google probably isn't using AWS for any of their infrastructure.
	▲	soupfordummies 6 days ago \| parent \| prev [-]
		Internet has been very sluggish for me today too. Something may be going on (not necessarily AWS)

▲

anshumankmr 6 days ago | parent | prev | next [-]

I got a down status on https://leetcode.com/..It may be related.

▲

silktson 6 days ago | parent | prev [-]

still slow down