Remix.run Logo
oarsinsync 2 days ago

Huge respect to the author for the details that have gone into this. I'd spent a week hammering at a Claude max 20x plan to try and build schengen 90/180 rolling window + tax residency in a couple of countries tracker... and that was hard work. I can only imagine how much effort has gone into this, to get all the details right.

It's unclear whether the author wrote all of this themselves, or if they outsourced a bunch of it to Claude. My experience with Claude was that it was terrible at writing code to do the math, even when I explained what the calculation needed to be, what the input was, and what the expected result was. It ultimately took starting a whole new project just to do the rolling window calculation, and then have that fed back in.

My biggest question for the author, if they happen to see this, is: how much manual testing validation did you do of the outputs the app produces? IE: Did you do the inputs + transformations = output calculations yourself as well, counting days on calendars, etc, to validate that the app is actually accurate? (That was the only way I developed any faith in solution I made for myself, which is way less impressive than your app). Regardless of whether you wrote the code yourself or not, a thorough test harness feels vitally important for an app like this.

gommm 2 days ago | parent | next [-]

I tend to find that for things like this that are really math heavy, it's usually better to create a DSL (or create easily readable function calls, etc) that you can easily write yourself instead of relying on AI to understand math heavy rules. Bonus points, if the rules are in an easily editable format, you can change them easily when they need to. It seems that was the path the author took...

And yes this kind of use-case is exactly where unit tests shine...

zahlman 2 days ago | parent | next [-]

> create a DSL (or create easily readable function calls, etc)

These aren't really that different. Consider the history of the earliest (non-assembly) programming languages, particularly https://en.wikipedia.org/wiki/Speedcoding , as well as the ideas expressed by Lisp.

embedding-shape 2 days ago | parent | prev [-]

I do the opposite, set up everything myself in terms of architecture/design of the software, so the AI can do the boring boilerplate like "math heavy rules". Always interesting to see how differently we all use LLMs.

davedx 2 days ago | parent | prev | next [-]

When I’ve worked on complex scheduling problems like that I use copious unit tests, they’re perfect for this kind of input->algo->output problem where algo has tons of edge cases.

Indeed, not using unit tests and instead trying to manually test all the cases sounds crazy to me!

skrebbel 2 days ago | parent | prev | next [-]

I'm not sure I’m reading this right but are you saying that an AI made you dumber and then you complain that the AI is too dumb? That sounds like a lose-lose deal tbh.

2 days ago | parent | prev | next [-]
[deleted]
lionkor 2 days ago | parent | prev [-]

> that was hard work

I'm sorry. I don't want to fight here, but you have literally just said you paid Claude to do the thinking for you (except for some math), yet you're talking about this like you're some kind of scientist; or that you've done this extensive, in-depth work.

You made an AI vibe-code an app in a week and now you're impressed someone else was able to do it better?

Am I missing something? Is it maybe just your writing style that makes it come across so "from your high horse"?

flumpcakes 2 days ago | parent | next [-]

This task seems like something a competent Excel user could create. I think the hard part is knowing the rules and the corner cases than any of the "math" (just addition and subtraction, surely) required.

woodson 2 days ago | parent | next [-]

But that’s the thing, there are no guarantees that the corner cases were actually handled correctly. Especially if it was AI coded without review by a subject matter expert.

andai 2 days ago | parent | prev [-]

How do you encode Gregg's sausage roll in .xls? ;)

jon-wood 2 days ago | parent [-]

You joke, but because it is actually possible - you have a regularly updated sheet which contains all your bank transactions (my bank will continually update a Google Sheet for me if I ask), and then you do a lookup for a Gregg's transaction on the relevant day.

zahlman 2 days ago | parent | prev [-]

I understand the sentiment, but I come to HN largely to avoid that tone in Internet commentary.