| ▲ | onlyrealcuzzo 2 hours ago | |||||||
> - this gets reinvented/rediscovered constantly under different names What are the different names? I haven't seen this before. > - it cant be trained very well (right now, will change) If you're sure it will change, then why are you certain that it hasn't yet, and if it's proven a 5000x boost in reasoning... why aren't they exploring this path more aggressively? > the idea is 100% obvious to all the frontier labs and there is a good reason why it isn't used Surely someone is willing to take a 5000x boost in reasoning on a small research model... None of them have even tried anything resembling this AFAIK. It does not seem like something 100% obvious to them. | ||||||||
| ▲ | everforward an hour ago | parent [-] | |||||||
> Surely someone is willing to take a 5000x boost in reasoning on a small research model... None of them have even tried anything resembling this AFAIK. It does not seem like something 100% obvious to them. Without knowing anything about the technology at all, if it can't be aligned I could see no one pursuing it. As far as I know, alignment is where the "don't tell the user how to make meth or generate CP" instructions end up and the last I saw eliding all the unsavory training data made materially worse LLMs. It could maybe be post-evaluated by a non-GRAM LLM? Not being aligned is probably a fatal flaw or at least a very short runway into Congress. | ||||||||
| ||||||||