| ▲ | bayindirh 2 hours ago | |
> For those claiming they rigged it. I don't think they "rigged" it, but might be given a bit more push on that part since it's going for a very long time now. Another benchmark is going on at [0]. It's pretty interesting. A perfect scoring model "borks" in the next iteration, for example. > Rant: This is why AI is going to take over, folks are not even trying the least. It might be drawing things alright, at least some cases. I seldom use it when my hours long researches doesn't take me to the place I want, and guess what? AI can't go there, either. It hallucinates things, makes up stuff, etc. For a couple of things I asked, it managed to find a single reference, and it was the thing I was looking for, so it works rarely in my cases. Rant: This is why people are delusional. They test the happy path and claims it knows all the paths, and then some. | ||