Remix.run Logo
vunderba 5 hours ago

Yeah I think that's a fair critique. It kind of looks like a bad cut-and-replace job (if you zoom in you can even see part of the neck is missing). I might give it some more attempts to see if it can do a better job.

I agree that Seedream could definitely be called out as a fail since it might just be a trick of perspective.

sefrost 3 hours ago | parent [-]

Have you ever considered a “partial pass”?

Perhaps it would be an easy cop out of making a decision if you had to choose something outside of pass/fail.

vunderba 2 hours ago | parent [-]

That's not a bad suggestion. I thought about adding a numerical score but it felt like it was bit overwhelming at the time. Maybe I should revisit it though in the form of:

  Fail = 0 points
  Partial = 0.5 points
  Success = 1 point
There's definitely a couple of pictures where I feel like I'm at the optometrist and somehow failing an eye exam (1 or 2, A... or B).
jofzar 39 minutes ago | parent [-]

I agree with this, some of those are "passing" and others are really passing. Specially with how much better some of the new model is compared to old ones.

I think the paws one is a good example where I think the new model got 100% while the other was more like 75%