| ▲ | Imustaskforhelp 4 hours ago | |
Thanks I have generated both beaver riding a scooter: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056... pelican riding a bicycle: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056... Personal opinion but the beaver one looks especially bad as compared to pelicans. Can we be for sure that this model of grok-4.3 hasn't been trained on pelican. Simonw in blog-post says that he will try with other creatures so I hope he does that but it does feel to me as the model/xAI is trying to cheat, Hope Simonw tests it out more. Edit: Also added turtle riding a scooter, something which literally has images online or heck even teenage mutant ninja turtles and I thought that it would be able to pass this but it wasn't even able to generate this: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056... This literally looks more avocado than turtle. Perhaps this could be a bug from arena.ai or something else too, not sure but at this point waiting for simon's analysis. | ||
| ▲ | gchamonlive 4 hours ago | parent [-] | |
We can never be sure of course, but I think this is a very strong indication that pelican riding a bike is indeed going into the training dataset. Thanks for generating those! | ||