Remix.run Logo
toddmorey 6 hours ago

My taxes are rather complex, so I ran the same exercise to see if Claude agreed with my accountant. An automated second opinion, so to speak. Spent about 6 minutes analyzing all the PDFs and basically nailed it perfectly in one shot.

My only point here is it sure seems the same activity / use case can have wildly different results across sessions or users. Customer support and product development in the age of non-deterministic software is a strange, strange beast.

ozozozd 6 hours ago | parent [-]

What does nailing mean when you ask whether it agreed with your accountant?

toddmorey 6 hours ago | parent [-]

Given the same inputs but not provided the results (output) from our accountant, did it come to the same conclusions or have good analysis as to why it differed?

Obviously, accounting is "spreadsheet math" intensive, so Claude wrote some python scripts for that which kept the math very stable. But there were some complex nuances that had taken the accountant and I quite a bit of work to track down and clarify. Claude quickly had a very accurate read on the situation and knew all the right clarifying questions.

I'm not yet ready to ever sign a return that's been entirely AI prepared, but I left the exercise pretty impressed.