| ▲ | redfloatplane 3 hours ago | ||||||||||||||||||||||||||||||||||
> I would like to reach out and talk to biologists - do you find these models to be useful and capable? Can it save you time the way a highly capable colleague would? Well, I would say they have done precisely that in evaluating the model, no? For example section 2.2.5.1: >Uplift and feasibility results >The median expert assessed the model as a force-multiplier that saves meaningful time (uplift level 2 of 4), with only two biology experts rating it comparable to consulting a knowledgeable specialist (level 3). No expert assigned the highest rating. Most experts were able to iterate with the model toward a plan they judged as having only narrow gaps, but feasibility scores reflected that substantial outside expertise remained necessary to close them. Other similar examples also in the system card | |||||||||||||||||||||||||||||||||||
| ▲ | torginus 2 hours ago | parent [-] | ||||||||||||||||||||||||||||||||||
This is the exact logic people that was used to claim that GPT4 was a PhD level intelligence. | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||