| ▲ | benhoyt 4 days ago |
| My fly.io-hosted website went down for 5 minutes (6 hours ago), but then came right back up, and has been up ever since. I use a free monitoring service that checks it every 5 minutes, so it's possible it missed another short bit of downtime. But fly.io has been pretty reliable overall for me! |
|
| ▲ | nomilk 4 days ago | parent | next [-] |
| Would be fascinated to see your data over a period of months. Application up time is flakey, but what was worse were fly deploys failing for no clear reason. Sometimes layers would just hang and eventually fail for no particular reason; I'd run the same command an hour or two later without any changes and it would just work as expected. I'd love to make a monitoring service to deploy a basic app (i.e. run the fly deploy command) every 5 minutes and see how often those deploys fail or hang. I'd guess ~5% inexplicably fail, which is frustrating unless you've got a lot of spare time. |
| |
| ▲ | jrockway 4 days ago | parent | next [-] | | I used to run a service that created k8s clusters on GCP for our customers. We did want to check that that functionality kept working and had a prober test it periodically. It was actually broken a lot. Always good to monitor your dependencies if you have the time. Then when someone complains about an issue in your service, you can check your monitoring to see if your upstream services are broken. If they are, at least you know where to start debugging. | |
| ▲ | sanswork 4 days ago | parent | prev | next [-] | | My downtimes from fly are pretty rare but generally global when they happen, in this outage we had no downtime but couldn't deploy for a few hours. I have issues with deploying about once per quarter(deploy most days across a few apps) | | |
| ▲ | nomilk 4 days ago | parent [-] | | If that’s the case I suspect fly is getting a lot more reliable. I stopped using them about a year ago so haven’t kept up on their reliability since. Glad to hear, it’s good for a competitive market to have many providers, and fly might have issues but hopefully has a bright future | | |
| ▲ | sanswork 4 days ago | parent [-] | | They are definitely getting more reliable. I was an early user and moved off them to self hosted for quite a while because of the frequent downtime in early days. Their support still leaves a lot to be desired even as someone that pays for it but the ease of running and deploying a distributed front end keeps bringing me back. |
|
| |
| ▲ | rozenmd 4 days ago | parent | prev | next [-] | | This may be of interest to you: https://news.ycombinator.com/item?id=42243282 | |
| ▲ | Joel_Mckay 4 days ago | parent | prev [-] | | [flagged] | | |
| ▲ | LorenzoGood 4 days ago | parent [-] | | What does rust have to do with fly.io? | | |
| ▲ | aobdev 4 days ago | parent | next [-] | | Snark aside, Joel is suggesting that because Fly uses rust-based virtualization software they should have a more reliable deployment process. | | |
| ▲ | LorenzoGood 4 days ago | parent | next [-] | | Thanks for clarifying. | |
| ▲ | Joel_Mckay 4 days ago | parent | prev [-] | | [flagged] | | |
| ▲ | nomilk 4 days ago | parent [-] | | By asking directly and someone answering, it solves the problem for the person wondering, but also anyone else wondering (i.e. asking directly scales very nicely). | | |
|
| |
| ▲ | Joel_Mckay 4 days ago | parent | prev [-] | | [flagged] |
|
|
|
|
| ▲ | rozenmd 4 days ago | parent | prev | next [-] |
| I externally monitor fly.io and it's docs here: https://flyio.onlineornot.com/ Looks like it lasted 16 minutes for them. |
| |
| ▲ | tptacek 3 days ago | parent [-] | | It wasn't a request routing outage; apps running on Fly.io didn't stop running. It was a deployments outage. For reasons passing understanding (I am reliably informed I'm wrong to complain about this), our website is the same Elixir app as our dashboard, and the dashboard got redeployed at one point. Our website being down is not the same as the whole service being down, though I guess there's a truth-in-advertising poetry to it being down when deployments are busted. | | |
| ▲ | sevenseacat 3 days ago | parent | next [-] | | A lot of apps did stop running - https://community.fly.io/t/fly-io-site-is-currently-inaccess... The entire API was also unusable, not just deployments. | | |
| ▲ | tptacek 2 days ago | parent [-] | | Sorry, you're right: pretty much any time I'm saying deployments are blocked, I'm really saying the API was down. |
| |
| ▲ | itbeho 3 days ago | parent | prev [-] | | I'm not sure if your explanation is comforting or disconcerting. | | |
| ▲ | tptacek 3 days ago | parent | next [-] | | Why not both? Tell me what's comforting and I'll tell you why you shouldn't be comforted; tell me why you're disconcerted and I'll tell you maybe something else. All we can do is be straight about things. | |
| ▲ | pajeetz 3 days ago | parent | prev [-] | | [flagged] | | |
| ▲ | tptacek 3 days ago | parent [-] | | I'm an HN person before I'm a Fly.io person, and as an HN person I find the points you're trying to make --- anybody can see them throughout the thread simply by searching your name --- tedious. As a businessperson, I don't think I have much to gain by genuflecting to the importance of reliability; everybody I care about on this site shares an understanding with us that reliability is important, though apparently not with you that all these systems are fallible. So I'm making the decision not to genuflect, and instead call you out --- you in particular, anonymous, venomous, green-named commenter --- as a a writer of boring and facile attempted dunks. | | |
| ▲ | pajeetz 3 days ago | parent [-] | | Are we not allowed to expect reliable uptimes from a cloud provider? What part of "fly.io has a documented history of prolonged downtimes and data redundancy issues" do you disagree with? Are you calling everybody liars who have had bad experience with fly.io, frankly, business and reputation loss that came as a result of trusting fly.io ? | | |
| ▲ | tptacek 3 days ago | parent [-] | | Nobody has called anybody a liar. I'm very comfortable with what i've said thus far on this thread, so maybe we're fine leaving it here. | | |
| ▲ | pajeetz 2 days ago | parent [-] | | is that why you are going through all my comments and flagging and downvoting? you know this just makes you look even worse right? | | |
| ▲ | tptacek 2 days ago | parent [-] | | That's not how flags work on HN: I can't flag your responses to me. For a variety of reasons, and for better or worse, I'm very comfortable with how I come across on HN. To twist the knife just a little bit, I'll refer you though to the guidelines: Please don't comment about the voting on comments. It never does any good, and it makes boring reading. https://news.ycombinator.com/newsguidelines.html |
|
|
|
|
|
|
|
|
|
| ▲ | davidgl 4 days ago | parent | prev | next [-] |
| Same for us, down for ~5 mins, back up and fine, error was 501 |
| |
|
| ▲ | beezlewax 4 days ago | parent | prev | next [-] |
| Do you mind if I ask what monitoring service that is? |
| |
|
| ▲ | dprotaso 3 days ago | parent | prev | next [-] |
| What free monitoring tool do you use? |
|
| ▲ | 4 days ago | parent | prev [-] |
| [deleted] |