| ▲ | alansaber 5 hours ago | |||||||||||||||||||||||||||||||||||||||||||||||||
"Our models are more honest" honey the quarterly marketing spin for a ML term has come. Forget "task alignment" now we're going for "truth index". I suppose this is the only way to generate hype when you're selling/releasing the same product over and over again. | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | TIPSIO 5 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
When doing some electrical, Opus 4.7 essentially told me to wiggle a wire to see if it was hot or not with my bare hand. I called it out. It then gave me one of the most super heartfelt honest and sincere apologies I have ever received. Glad the safety team was there for me and able to make such an honest model or I would have been very upset about it. | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | doginasuit 2 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
Credit where it is due, Claude is fantastic at pointing out potential flaws in how I understand the problem based on my question. I asked for this in the system instructions but it is the first model I've tried that does it regularly. It is also so tactful, I feel like I'm learning social skills from a language model. Half of the time it is a false positive due to insufficient context but I still appreciate the additional check. | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | mrdependable 5 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
Gave me wrong information on my very first question. Wasn’t even complicated, and I wasn’t trying to trick it. | ||||||||||||||||||||||||||||||||||||||||||||||||||